Thanks for your great work.Do you have a plan to release the training code of the knowledge distillation process in Megatron-LM?
Please see our response here, thank you: https://github.com/NVlabs/Minitron/issues/5
· Sign up or log in to comment