How to track the model Progress/status when Sagemaker Kernel is dead? – Amazon web services
While training a model on AWS Sagemaker(let us assume training takes 15 hours or more). If our laptop lose internet connection in between, the Kernal on which it is training will die. But the model continues to train (I confirmed…