Pretrained models from the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
Zayd Muhammad Kawakibi Zuhri PRO
zaydzuhri
AI & ML interests
I really like watching loss go down
Recent Activity
updated a dataset about 18 hours ago
zaydzuhri/selective-copy-mad published a dataset about 18 hours ago
zaydzuhri/selective-copy-mad updated a dataset about 22 hours ago
zaydzuhri/fuzzy-recall-madOrganizations
None yet