tm23's picture

tm23

tm23hgf

·

AI & ML interests

None yet

Recent Activity

liked a Space 3 days ago

HuggingFaceFW/finephrase

updated a model 14 days ago

tm23hgf/anime-sdxl-lora

published a model 14 days ago

tm23hgf/anime-sdxl-lora

View all activity

Organizations

None yet

liked a Space 3 days ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Explore synthetic data experiments on a virtual bookshelf

updated a model 14 days ago

tm23hgf/anime-sdxl-lora

Updated 14 days ago • 12

published a model 14 days ago

tm23hgf/anime-sdxl-lora

Updated 14 days ago • 12

commented on Strand-Rust-Coder-v1: Rust Coding Model Fine-Tuned on Peer-Ranked Synthetic Data 17 days ago

awesome work, i am going to start some research on reasoning SLM on rust wanted to know is the dataset publicly released?

liked a Space 17 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

Building and scaling RL environments for LLM training

liked a Space 23 days ago

GPU Budget Negotiation Arena

Simulate GPU budget negotiations and view results

updated a Space 26 days ago

Social Network Env

Simulate a social network to detect coordinated inauthentic behavior

updated a dataset 27 days ago

tm23hgf/socialnet-sft

Viewer • Updated 27 days ago • 14.6k • 86

published a dataset 27 days ago

tm23hgf/socialnet-sft

Viewer • Updated 27 days ago • 14.6k • 86

published a Space 27 days ago

Social Network Env

Simulate a social network to detect coordinated inauthentic behavior

updated a model about 1 month ago

tm23hgf/Qwen3-1.7B-Wordle-SFT

2B • Updated Apr 18 • 2

published a model about 1 month ago

tm23hgf/Qwen3-1.7B-Wordle-SFT

2B • Updated Apr 18 • 2

updated a Space about 1 month ago

Algo Reasoning Environment

Submit Rust code and reasoning to get a correctness reward

published a Space about 2 months ago

Algo Reasoning Environment

Submit Rust code and reasoning to get a correctness reward

updated a Space about 2 months ago

Algo Reasoning Env

Evaluate algorithmic solutions with automated grading

published a Space about 2 months ago

Algo Reasoning Env

Evaluate algorithmic solutions with automated grading

New activity in BibbyResearch/3blue1brown-manim 5 months ago

Not a good dataset

#2 opened 5 months ago by

commented on Mixture of Experts Explained 6 months ago

Chinchilla paper actually shows that for a fixed compute budget, it is better to train a smaller model on more data rather than training a larger model for fewer steps.

upvoted an article 6 months ago

Article

Mixture of Experts Explained

+4

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k