38 1 150

JaheimLee

JaheimLee

AI & ML interests

None yet

Recent Activity

new activity 26 days ago

Jackrong/Qwopus3.5-27B-v3:MTP Speculation

liked a model about 2 months ago

voidful/Qwen3.5-27B-gemini-3.1-opus-4.6-reasoning

new activity 2 months ago

Sehyo/Qwen3.5-397B-A17B-NVFP4:missing think tag

View all activity

Organizations

New activity in Jackrong/Qwopus3.5-27B-v3 26 days ago

MTP Speculation

👍 1

#11 opened about 1 month ago by

memtalow

liked a model about 2 months ago

voidful/Qwen3.5-27B-gemini-3.1-opus-4.6-reasoning

Image-Text-to-Text • 27B • Updated Mar 26 • 22 • 11

New activity in Sehyo/Qwen3.5-397B-A17B-NVFP4 2 months ago

missing think tag

#2 opened 2 months ago by

fouvy

liked 3 datasets 4 months ago

liked a model 4 months ago

QuantTrio/MiniMax-M2-REAP-162B-A10B-AWQ

Text Generation • 162B • Updated Jan 5 • 54 • 3

New activity in 0xSero/GLM-4.7-REAP-50-W4A16 4 months ago

REAP-55 quant version

👍 2

#7 opened 4 months ago by

JaheimLee

liked a model 5 months ago

RESMP-DEV/Qwen3-Next-80B-A3B-Thinking-NVFP4

Text Generation • Updated Oct 11, 2025 • 82 • 10

liked a Space 6 months ago

The Smol Training Playbook

📚

3.15k

The secrets to building world-class LLMs

liked 2 Spaces 7 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.33k

Explore and download the FineWeb web‑text dataset

The Ultra-Scale Playbook

🌌

3.83k

The ultimate guide to training LLM on large GPU Clusters

New activity in DevQuasar/Qwen.Qwen3-Next-80B-A3B-Instruct-FP8 8 months ago

VLLM compatibility?

#1 opened 8 months ago by

aidendle94

liked a model 8 months ago

Qwen/Qwen3-Next-80B-A3B-Instruct

Text Generation • 81B • Updated Sep 17, 2025 • 268k • • 1.01k

New activity in cyankiwi/GLM-4.5-Air-AWQ-4bit 9 months ago

Does this actually work with VLLM?

#1 opened 9 months ago by

sirus

liked 2 models 10 months ago

Multiverse4FM/Multiverse-32B

Text Generation • 33B • Updated Jun 13, 2025 • 4 • 10

tencent/Hunyuan-A13B-Instruct-GPTQ-Int4

Text Generation • 80B • Updated Jul 11, 2025 • 181 • 51

liked a model 11 months ago

Tongyi-Zhiwen/QwenLong-L1-32B-AWQ

33B • Updated May 29, 2025 • 11 • 10

New activity in Qwen/Qwen3-32B-FP8 12 months ago

Is this a QAT model?

#2 opened about 1 year ago by

Downtown-Case

liked a model 12 months ago

RedHatAI/Qwen3-32B-FP8-dynamic

Text Generation • 33B • Updated May 13, 2025 • 20.5k • 15

JaheimLee

AI & ML interests

Recent Activity

Organizations

JaheimLee's activity

MTP Speculation

missing think tag

REAP-55 quant version

The Smol Training Playbook

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

VLLM compatibility?

Does this actually work with VLLM?

Is this a QAT model?