Building on HF

8 8 301

poe

wop

https://cosmos-vb.netlify.app/

koo1140

AI & ML interests

AI research AGI

Recent Activity

liked a Space 2 days ago

m96-chan/0xBitNet

liked a Space 4 days ago

wop/Trillim

new activity 4 days ago

lmsys/lmsys-chat-1m:Request: DOI

View all activity

Organizations

liked a Space 2 days ago

0xBitNet

⚡

Generate AI text responses in your browser

liked a Space 4 days ago

Trillim

📉

Run AI locally

New activity in lmsys/lmsys-chat-1m 4 days ago

Request: DOI

#23 opened about 1 month ago by

mehendiparvin2006

updated a Space 4 days ago

Trillim

📉

Run AI locally

published a Space 4 days ago

Trillim

📉

Run AI locally

reacted to imnotkitty's post with 🔥 5 days ago

Post

3930

tencent/Hy3-preview is out: an open-weights MoE reasoning model.

✅ 295B total / 21B active / 256K context
✅ Fused fast-and-slow thinking in a single model
✅ First model trained on Hunyuan's rebuilt pretraining + RL infra (Feb → Apr)

Benchmarks:
👉 SWE-Bench Verified, Terminal-Bench 2.0, BrowseComp, WideSearch — competitive results, particularly strong on agentic tool use
👉 Top score on Tsinghua's 2026 Spring math PhD qualifying exam
👉 Strong context-learning and instruction-following on Tencent's CL-bench / CL-bench-Life

More details can be found in my article: https://huggingface.co/blog/imnotkitty/hy3-preview

2 replies

liked 3 Spaces 5 days ago

Gemma-4-26B-A4B-it-GGUF-UD-Q3_K_XL

🔥

This space uses ''unsloth/gemma-4-26B-A4B-it-GGUF''.

Qwen3.5-2B-GGUF-UD-Q5_K_XL

🔥

This space uses “unsloth/Qwen3.5-2B-GGUF”.

Bonsai-8B 1-bit Demo

🌳

1-bit 8B LLM running on Apple Silicon

published a model 5 days ago

ml-agent-explorers/LFM2.5-350M-MultiChain-CoT

Updated 5 days ago

liked a Space 5 days ago

OBLITERATUS

💥

337

One-click model liberation + chat playground

liked 2 Spaces 7 days ago

HY World 2.0 Demo

🤗

A Multi-Modal World Model for Reconstruction

Bonsai 1-bit WebGPU

🌳

166

Run 1-bit Bonsai LLMs locally in your browser on WebGPU

New activity in webml-community/bonsai-ternary-webgpu 7 days ago

Error, Fp16 not available

#2 opened 7 days ago by

wop

reacted to SeanLee97's post with 🔥 7 days ago

Post

8079

Our lab recently released a paper where we introduce ShadowPEFT, a new Parameter-Efficient Fine-Tuning (PEFT) paradigm tailored for edge computing scenarios.

Unlike traditional approaches such as LoRA and its variants, which inject trainable parameters directly into the weights of Transformer, requiring tight coupling with the backbone.

ShadowPEFT instead enhances the frozen large base model by adding a lightweight, centralized, pretrainable, and detachable Shadow network.
This shadow network operates in parallel with the base model, delivering learned corrections to each decoder layer. Because the shadow module is architecturally decoupled from the backbone, it can be independently trained, stored, and deployed, benefiting edge computing scenarios and edge-cloud collaboration computing.

- HF Paper: ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning (2604.19254)
- GitHub: https://github.com/ShadowLLM/shadow-peft
- HF Collection: https://huggingface.co/collections/shadow-llm/shadow-peft-models