Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
5
2
2
VivienneZhang
viviennezhang
Follow
Peter905's profile picture
hlarcher's profile picture
dat-lequoc's profile picture
14 followers
ยท
4 following
AI & ML interests
None yet
Recent Activity
new
activity
about 1 month ago
HuggingFaceH4/tau2-bench-data:
Add eval.yaml to register TAU2-Bench as a benchmark with NeMo Evaluator
new
activity
about 1 month ago
gorilla-llm/Berkeley-Function-Calling-Leaderboard:
Add eval.yaml to register BFCL as a benchmark with NeMo Evaluator
new
activity
about 1 month ago
SciCode1/SciCode:
Add eval.yaml to register SciCode as a benchmark with NeMo Evaluator
View all activity
Organizations
Articles
2
Article
49
The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator
Article
21
Can Your LLM Think Like a Professional? Introducing ProfBench
View all Articles
models
0
None public yet
datasets
0
None public yet