Cleaned up code, added multiseed training wrapper, PyTorch profiler training option, updated gradio demo, made changes to research paper to match new changes and new training results from adding new training techniques, architecture.md now explains all designs and decisions
90a2698
OliverPerrincommited on
Medium training run
b93250a
OliverPerrincommited on
Major improvements: real titles, anti-overfitting, better demo UX
0fe274c
OliverPerrincommited on
Add English language filter to all data downloads
8573220
OliverPerrincommited on
Fixed compiling issue, added legnth penalty, and atttempting freezing encoder layers 0-5 to lower parameters and preserve T5's langauge understanding.
baf3026
OliverPerrincommited on
Clean up codebase and fix training bugs
1601799
OliverPerrincommited on
Update LexiMind: improved training, model architecture, and evaluation