AlphaSue

3 22 21

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

upvoted a paper 3 months ago

Agentic Reasoning for Large Language Models

upvoted an article 6 months ago

Jupyter Agents: training LLMs to reason with notebooks

View all activity

Organizations

None yet

liked 2 models about 1 year ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Feb 24, 2025 • 680k • • 1.53k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 9.08M • • 13.5k

liked a model over 1 year ago

gair-prox/web-chunk-refining-lm

Text Generation • 0.4B • Updated Oct 10, 2024 • 35 • 7

liked a Space over 1 year ago

TxT360: Trillion Extracted Text

📖

134

Explore the TxT360 LLM pre‑training dataset online

liked a model over 1 year ago

jinaai/ReaderLM-v2

Text Generation • 2B • Updated Mar 4, 2025 • 35.9k • 800

liked a Space over 1 year ago

The Ultra-Scale Playbook

🌌

3.93k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset over 1 year ago

microsoft/RedStone

Updated Dec 5, 2024 • 9 • 35

liked a model over 1 year ago

open-web-math/filtering-models

Updated Nov 2, 2023 • 10

liked a dataset over 1 year ago

m-a-p/FineFineWeb

Viewer • Updated Dec 19, 2024 • 4.89B • 1.33M • 155

liked 2 models almost 2 years ago

nvidia/quality-classifier-deberta

0.2B • Updated Sep 22, 2025 • 3.02k • 76

oliverguhr/fullstop-punctuation-multilang-large

Token Classification • Updated Nov 16, 2023 • 712k • • 179

liked a dataset about 2 years ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 16.6k • 868

liked a model about 2 years ago

Snowflake/snowflake-arctic-embed-m

liked a Space about 2 years ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.38k

Explore and download the FineWeb web‑scale text dataset

liked 4 datasets about 2 years ago

liked a Space over 2 years ago

ControlNet V1.1

📉

1.19k

Generate edited images using edge, pose, and other guides

liked a model almost 3 years ago

TheBloke/Llama-2-7B-Chat-GGML

Text Generation • Updated Sep 27, 2023 • 153 • 873

AlphaSue

AI & ML interests

Recent Activity

Organizations

AlphaSue's activity

TxT360: Trillion Extracted Text

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

ControlNet V1.1