About lr and evaluation
#6
by
141forever
- opened
Hello, I have reproduced the code results strictly according to your traing parameter settings. I found that when the learning rate is set to 1e-7, it is difficult to learn anything, so the learning rate needs to be set between 1e-5 and 5e-5 for successful training.
BTW, may I ask if it's possible to share the parameter settings and code for the evaluation?
Thanks for letting us know about the learning rate!
About the eval code, I just created a gist with the script we were using internally. Hope it's useful
https://gist.github.com/cmpatino/2270db038f93e8714f8fb213ff60f48f
cmpatino
changed discussion status to
closed