anakin87 (Stefano Fiorucci)

liked a Space 6 days ago

Emma 5

🐣

2

Demo non ufficiale di un piccolo modello italiano

updated a Space 6 days ago

Emma 5

🐣

2

Demo non ufficiale di un piccolo modello italiano

published a Space 6 days ago

Emma 5

🐣

2

Demo non ufficiale di un piccolo modello italiano

liked a model 7 days ago

Qwen/Qwen-AgentWorld-35B-A3B

Text Generation • 35B • Updated 8 days ago • 45.5k • 518

liked 2 models 8 days ago

clark-labs/clark-air-sana-1.6b-1.58bit

Text-to-Image • Updated 5 days ago • 86 • 13

LiquidAI/LFM2.5-230M

Text Generation • 0.2B • Updated 7 days ago • 29.6k • 194

upvoted an article 10 days ago

Article

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

Wauplin, celinah

•

10 days ago

• 20

liked a Space 10 days ago

DeepResearch Bench

🔍

37

Explore Deep Research model rankings and compare performance

liked a Space 15 days ago

ColBERT Tool Selection

🧭

28

Send a request, the retriever pre-selects the right tools

liked a model 23 days ago

VAGOsolutions/SauerkrautLM-LFM2.5-GLiNER

Token Classification • Updated 24 days ago • 87 • 22

upvoted a collection 28 days ago

ClaimExtractor-2605

Collection

Extract claims and intents from conversations • 7 items • Updated 19 days ago • 8

upvoted 2 articles about 1 month ago

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

May 25

• 126

Article

Introducing the Ettin Reranker Family

tomaarsen

•

May 19

• 53

updated a Space about 2 months ago

Phi 3.5 Mini ITA

💬

4

Chat with an Italian Small Model

liked a dataset 2 months ago

VAGOsolutions/SauerkrautLM-Doom-MultiVec-31k

Updated Apr 6 • 58 • 3

upvoted an article 2 months ago

Article

ML Intern Takes Our Post-Training Internship Test

cmpatino

•

Apr 23

• 31

reacted to their post with ❤️ 2 months ago

Post

3404

A small model that struggled against a random opponent now beats GPT-5-mini at tic-tac-toe

I took LiquidAI/LFM2-2.6B and trained it through play.

🧑‍🍳 Here's how:

1️⃣ Build a solid RL env with Verifiers (Prime Intellect)
2️⃣ Generate synthetic data: <200 games sampled from GPT-5-mini playing in the env
3️⃣ SFT warm-up to teach format
4️⃣ Group-based RL (CISPO) against opponents making 20-70% random moves
5️⃣ RL again with stronger opponents (0-25% random moves) + 1.25 temperature to push exploration and shake off suboptimal strategies

Done! Beats GPT-5-mini 🏆

---

🎮 Play against the model: anakin87/LFM2-2.6B-mr-tictactoe

🤗 Model: anakin87/LFM2-2.6B-mr-tictactoe

📚 Walkthrough/course: https://github.com/anakin87/llm-rl-environments-lil-course

🤗 Dataset and checkpoints: https://huggingface.co/collections/anakin87/lfm2-26b-mr-tic-tac-toe

posted an update 2 months ago

Post

3404

A small model that struggled against a random opponent now beats GPT-5-mini at tic-tac-toe

I took LiquidAI/LFM2-2.6B and trained it through play.

🧑‍🍳 Here's how:

1️⃣ Build a solid RL env with Verifiers (Prime Intellect)
2️⃣ Generate synthetic data: <200 games sampled from GPT-5-mini playing in the env
3️⃣ SFT warm-up to teach format
4️⃣ Group-based RL (CISPO) against opponents making 20-70% random moves
5️⃣ RL again with stronger opponents (0-25% random moves) + 1.25 temperature to push exploration and shake off suboptimal strategies

Done! Beats GPT-5-mini 🏆

---

🎮 Play against the model: anakin87/LFM2-2.6B-mr-tictactoe

🤗 Model: anakin87/LFM2-2.6B-mr-tictactoe

📚 Walkthrough/course: https://github.com/anakin87/llm-rl-environments-lil-course

🤗 Dataset and checkpoints: https://huggingface.co/collections/anakin87/lfm2-26b-mr-tic-tac-toe

reacted to AkimfromParis's post with ❤️ 2 months ago

Post

2535

🌸 𝙊𝙥𝙚𝙣 𝙅𝙖𝙥𝙖𝙣𝙚𝙨𝙚 𝙇𝙇𝙈 𝙇𝙚𝙖𝙙𝙚𝙧𝙗𝙤𝙖𝙧𝙙 𝙑2 𝙤𝙣 𝙃𝙪𝙜𝙜𝙞𝙣𝙜 𝙁𝙖𝙘𝙚 🇯🇵 // 🌸 ハギングフェイス版「 𝗢𝗽𝗲𝗻 𝗝𝗮𝗽𝗮𝗻𝗲𝘀𝗲 𝗟𝗟𝗠 𝗟𝗲𝗮𝗱𝗲𝗿𝗯𝗼𝗮𝗿𝗱 𝗩𝟮 」公開 🇯🇵

I am thrilled to announce the launch of version 2 of the 𝙊𝙥𝙚𝙣 𝙅𝙖𝙥𝙖𝙣𝙚𝙨𝙚 𝙇𝙇𝙈 𝙇𝙚𝙖𝙙𝙚𝙧𝙗𝙤𝙖𝙧𝙙. This initiative is driven by the "Fine-tuning and Evaluation" team, led by Professor Miyao at the The University of Tokyo, under the Research and Development Center for Large Language Models (LLMC) at Japan’s National Institute of Informatics (NII).

𝙎𝙩𝙧𝙖𝙩𝙚𝙜𝙞𝙘 𝙖𝙣𝙙 𝙩𝙚𝙘𝙝𝙣𝙞𝙘𝙖𝙡 𝙪𝙥𝙜𝙧𝙖𝙙𝙚𝙨:
- Our new backend features eight A100 GPUs, enabling the evaluation of open-source models of more than 100B parameters.
- Submissions now require a Hugging Face Hub login to ensure accountability.
- We have added metrics for evaluation time, CO₂ emissions (thx to Code Carbon 🌱 ), alongside reasoning capabilities.

𝘿𝙖𝙩𝙖𝙨𝙚𝙩𝙨 𝙖𝙣𝙙 𝙚𝙫𝙖𝙡𝙪𝙖𝙩𝙞𝙤𝙣 𝙨𝙩𝙖𝙣𝙙𝙖𝙧𝙙𝙨:
- New datasets cover reasoning, mathematics, exams, and instruction following.
- Math evaluations now span from grade-school levels to expert-tier challenges (GSM8K, PolyMath, AIME).
- While integrating English-heavy and multilingual benchmarks (including Humanity’s Last Exam, GPQA, and BBH in both English and Japanese), we continue to prioritize unique Japanese cultural datasets.

llm-jp/open-japanese-llm-leaderboard-v2

どうぞお願い致します！😊

liked a model 2 months ago

anakin87/LFM2-2.6B-mr-tictactoe

Text Generation • 3B • Updated Apr 5 • 9 • 1

Stefano Fiorucci PRO

AI & ML interests

Recent Activity

Organizations

Emma 5

Emma 5

Emma 5

Qwen/Qwen-AgentWorld-35B-A3B

clark-labs/clark-air-sana-1.6b-1.58bit

LiquidAI/LFM2.5-230M

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

DeepResearch Bench

ColBERT Tool Selection

VAGOsolutions/SauerkrautLM-LFM2.5-GLiNER

ClaimExtractor-2605

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

Introducing the Ettin Reranker Family

Phi 3.5 Mini ITA

VAGOsolutions/SauerkrautLM-Doom-MultiVec-31k

ML Intern Takes Our Post-Training Internship Test

anakin87/LFM2-2.6B-mr-tictactoe

Stefano Fiorucci PRO

AI & ML interests

Recent Activity

Organizations

anakin87's activity

Emma 5

Emma 5

Emma 5

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

DeepResearch Bench

ColBERT Tool Selection

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

Introducing the Ettin Reranker Family

Phi 3.5 Mini ITA

ML Intern Takes Our Post-Training Internship Test