Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
35.2
TFLOPS
1
2
Shahnawaz Ahmed
swaze
Follow
StefanAK's profile picture
salomons's profile picture
tommulder's profile picture
3 followers
ยท
2 following
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
about 16 hours ago
Cosmos-Reason2
reacted
to
JonnaMat
's
post
with ๐
3 days ago
๐ FlashHead: Efficient Drop-In Replacement for the Classification Head in Language Model Inference ๐ Check out our latest FlashHead-enabled model: https://huggingface.co/embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead ๐งฉ Seamless integration with vllm: ``` docker run --rm -it \ --network host \ --shm-size=8g \ --ulimit memlock=-1 \ --ulimit stack=67108864 \ --runtime=nvidia \ --name=vllm-serve \ -e HF_TOKEN=hf_*** \ -e HF_HOME=/root/.cache/huggingface \ embedl/vllm:latest-jetson-orin-flashhead \ vllm serve "embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead" \ --max-model-len 8192 \ --gpu-memory-utilization 0.75 \ --max-num-seqs 2 \ --trust-remote-code ```
liked
a Space
6 days ago
jane-street/droppedaneuralnet
View all activity
Organizations
swaze
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
6 days ago
Running
53
droppedaneuralnet
๐
53
Check your neural net reconstruction permutation
liked
a model
3 months ago
embedl/Llama-3.2-1B-Instruct-FlashHead-W4A16
0.7B
โข
Updated
Dec 16, 2025
โข
6