About lr and evaluation

#6
by 141forever - opened

Hello, I have reproduced the code results strictly according to your traing parameter settings. I found that when the learning rate is set to 1e-7, it is difficult to learn anything, so the learning rate needs to be set between 1e-5 and 5e-5 for successful training.

BTW, may I ask if it's possible to share the parameter settings and code for the evaluation?

Hugging Face H4 org

Thanks for letting us know about the learning rate!

About the eval code, I just created a gist with the script we were using internally. Hope it's useful

https://gist.github.com/cmpatino/2270db038f93e8714f8fb213ff60f48f

cmpatino changed discussion status to closed

Sign up or log in to comment