AI & ML interests
None defined yet.
Recent Activity
Papers
MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
Articles
Kimodo
Generate high-quality motions from text prompts
VoMP
Volumetric physics materials for interactive worlds
Music Flamingo
Generate detailed answers about any song or YouTube track
KVPress Leaderboard
KVPress leaderboard: benchmark KV Cache compression methods
Audio Flamingo 3 Demo
Audio Flamingo 3 Demo
Judge's Verdict Leaderboard
Judge's Verdict: Benchmarking LLM as a Judge
Llm Robustness Leaderboard
LLM Robustness leaderboard
Magpietts Demo
Generate natural speech from text in multiple languages
MMOU Eval
Evaluate prediction files against MMOU benchmark data
Cosmos Embed1
Cosmos-Embed1 demo app
ProfBench
Human-annotated rubrics in Professional Tasks
Parakeet-TDT-0.6b-V2
Transcribe audio files into timestamped text
Aic Demo
Configure and estimate AI model performance for deployment
Earth2 Inference Demo
Visualize weather forecasts for any date and time range
Nemotron Speech Streaming
Real-time speech recognition with NVIDIA Triton
Difix3D
Interface to interact with NVIDIA's Difix3D+ model
Parakeet-tdt_ctc-1.1b
Transcribe audio with timestamps
DoMINO with Ahmed Body Dataset - Multi-Scale Neural Operator for CFD
Access JupyterLab for interactive coding
Voice Agent WebRTC + LangGraph
Voice agent with LangGraph, WebRTC, ASR & TTS
NV-Reason-CXR-3B Demo
Analyze chest X-rays and identify abnormalities
Kvpress
kvpress: LLM KV cache compression made easy
Synthda Demo
Short Demo of SynthDa, using the real-real interpolation mtd
Modeling Magnetohydrodynamics with PhysicsNeMo
Access JupyterLab for interactive coding
Canary 1b V2
Transcribe and Translate in 25 European Languages