12 11

jiahaowang

wang-jiahao

https://wang-jiahao.github.io/

wang-jiahao

AI & ML interests

None yet

Recent Activity

updated a dataset about 16 hours ago

wang-jiahao/AVSCapBench

published a dataset about 18 hours ago

wang-jiahao/AVSCapBench

upvoted a paper 1 day ago

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

View all activity

Organizations

updated a dataset about 16 hours ago

wang-jiahao/AVSCapBench

Viewer • Updated about 15 hours ago • 1.23k

published a dataset about 18 hours ago

wang-jiahao/AVSCapBench

Viewer • Updated about 15 hours ago • 1.23k

upvoted 3 papers 1 day ago

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Paper • 2606.02320 • Published 4 days ago • 13

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

Paper • 2606.01993 • Published 4 days ago • 13

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 4 days ago • 48

upvoted a paper 17 days ago

Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution

Paper • 2605.15301 • Published 22 days ago • 22

upvoted a paper about 2 months ago

CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published Apr 13 • 38

upvoted a paper 3 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 312

liked a dataset 4 months ago

gtysssp/audio_benchmarks

Viewer • Updated Jul 30, 2025 • 16 • 159 • 2

liked a Space 4 months ago

BibGuard

🛡

Generate a bibliography health report from .bib and .tex files

authored a paper 5 months ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Paper • 2512.21094 • Published Dec 24, 2025 • 25

upvoted a paper 5 months ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Paper • 2512.21094 • Published Dec 24, 2025 • 25

upvoted 3 papers 6 months ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

Paper • 2510.10395 • Published Oct 12, 2025 • 32

ViDiC: Video Difference Captioning

Paper • 2512.03405 • Published Dec 3, 2025 • 29

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304

liked a model 6 months ago

Qwen/Qwen3-Omni-30B-A3B-Captioner

Any-to-Any • 32B • Updated Sep 22, 2025 • 5.42k • 227

upvoted a paper 7 months ago

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published Oct 20, 2025 • 20

upvoted a paper 8 months ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12, 2025 • 46

liked a dataset 8 months ago

lmms-lab/common_voice_15

Viewer • Updated Feb 4, 2025 • 43.1k • 194 • 1

liked a model 8 months ago

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22, 2025 • 1.56M • 935

jiahaowang

AI & ML interests

Recent Activity

Organizations

wang-jiahao's activity

BibGuard