AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

[SPAM] Deleted

3
#289 opened 4 days ago by
sarthak-saxena
stas 
posted an update 6 days ago
view post
Post
163
Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into
HuggingFace Trainer, Accelerate and TRL

For extensive details please see this writeup:
https://huggingface.co/blog/ulysses-sp

Thanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration.
christopher 
in bigscience/bloom 11 days ago

pretokenizer Regex issues?

8
#278 opened over 1 year ago by
hpcpony
christopher 
in bigscience/bloom 16 days ago

Test PR

#286 opened 16 days ago by
FIRSTACCOUNT69

Test discussion

#287 opened 16 days ago by
FIRSTACCOUNT69

Test discussion

#288 opened 16 days ago by
FIRSTACCOUNT69
albertvillanova 
posted an update 18 days ago
view post
Post
1929
🚀 TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

We’re excited to see what the community builds on top of this.

If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! 🤗

The future of ML tooling is agent-native.
🔗 https://github.com/huggingface/trl/releases/tag/v0.29.0
albertvillanova 
posted an update about 1 month ago
view post
Post
1763
5 years already working in democratizing AI 🤗
Grateful to be part of such an awesome team making it happen every day.