Hi - With Tensorflow, I was just editing the checkpoints textfile if I wanted to resume training from a specific checkpoint. What do I do when using Pytorch (checkpoint.pt looks wild…)? Thanks!
Looks like we always load from “checkpoint.pt”, so you should just be able to copy (or symlink) the checkpoint you want to that name.
We have MLA-1517 in our internal tracker to provide a better interface for this on the command line. It should be a pretty straightforward PR (I think) if anybody reading this wants to try to implement it.
2 Likes