Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tranheden's picture
1 11

Tranheden

WilhelmT
swaze's profile picture tommulder's profile picture salomons's profile picture
ยท

AI & ML interests

None yet

Recent Activity

reacted to JonnaMat's post with ๐Ÿ”ฅ about 4 hours ago
โšก Blackwell-native Vision Reasoning at the edge โšก Released a NVFP4A16-variant of nvidia/Cosmos-Reason2-2B: https://huggingface.co/embedl/Cosmos-Reason2-2B-NVFP4A16 ๐Ÿ’– Optimized for Blackwell with minimal accuracy drop compared to its FP16 counterpart. Thorough on-device benchmarks on AGX Thor in the modelcard. ๐Ÿค“ ๐Ÿ“Š Try it out: ``` docker run --rm -it \ --network host \ --shm-size=8g \ --ulimit memlock=-1 \ --ulimit stack=67108864 \ --runtime=nvidia \ --name=vllm-serve \ -e HF_TOKEN=hf_*** \ -e HF_HOME=/root/.cache/huggingface \ nvcr.io/nvidia/vllm:26.01-py3 \ vllm serve "embedl/Cosmos-Reason2-2B-NVFP4A16" \ --host 0.0.0.0 \ --port 8000 \ --tensor-parallel-size 1 \ --max-model-len 16384 \ --gpu-memory-utilization 0.9 ```
new activity 4 days ago
nvidia/Cosmos-Reason2-2B:Cosmos-Reason2-2B running on Jetson Orin Nano Super 8GB
liked a model 11 days ago
embedl/Cosmos-Reason2-2B-W4A16
View all activity

Organizations

Embedl's profile picture

WilhelmT 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs