5 11 43

MoonTide

MoonTideF

AI & ML interests

NLP,CV

Recent Activity

upvoted a paper about 8 hours ago

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

updated a model about 1 month ago

MoonTideF/Llama-GenSyntax

published a model about 1 month ago

MoonTideF/Llama-GenSyntax

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

Paper • 2606.03108 • Published 10 days ago • 7

updated a model about 1 month ago

MoonTideF/Llama-GenSyntax

8B • Updated May 10 • 5

published a model about 1 month ago

MoonTideF/Llama-GenSyntax

8B • Updated May 10 • 5

liked a dataset about 1 month ago

OpenRaiser/Intern-Atlas

Viewer • Updated May 1 • 9.15M • 1.14k • 8

liked a model 2 months ago

openbmb/VoxCPM2

Text-to-Speech • 2B • Updated Apr 16 • 263k • 1.39k

updated a collection 4 months ago

OCR

Collection

3 items • Updated Feb 3

liked a Space 4 months ago

Qwen3-TTS Demo

🎙

1.95k

Generate speech from text using voice design, cloning or presets

upvoted an article 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

upvoted a collection 4 months ago

high-quality Chinese training datasets

Collection

a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated May 22, 2025 • 24

liked a Space 5 months ago

The Smol Training Playbook

📚

3.2k

The secrets to building world-class LLMs

upvoted an article 5 months ago

Article

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

driaforall

•

Sep 11, 2025

• 26

upvoted a paper 5 months ago

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 106

upvoted a collection 5 months ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309

commented a paper 6 months ago

Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings

Paper • 2509.10534 • Published Sep 5, 2025 • 4 •

upvoted a paper 6 months ago

Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings

Paper • 2509.10534 • Published Sep 5, 2025 • 4

upvoted a paper 7 months ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 98

commented a paper 7 months ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 98 •

New activity in GSAI-ML/LLaDA-8B-Instruct 8 months ago

Question about the chat template which ignores add_generation_prompt

👍 2

#12 opened 12 months ago by

xukp20

liked a dataset 9 months ago

opencsg/Fineweb-Edu-Chinese-V2.1

Viewer • Updated Jan 28 • 958M • 24.5k • 75

MoonTide

AI & ML interests

Recent Activity

Organizations

MoonTideF's activity

Qwen3-TTS Demo

SmolLM - blazingly fast and remarkably powerful

The Smol Training Playbook

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Question about the chat template which ignores add_generation_prompt