Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
PanChanghao's picture
3 7 6

PanChanghao

DavidPigeon
·
https://david-pigeon.github.io/
  • DavidPigeon

AI & ML interests

audio synthesis

Recent Activity

upvoted a paper about 20 hours ago
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
upvoted a paper about 20 hours ago
Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios
upvoted a paper about 20 hours ago
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue
View all activity

Organizations

Zhejiang University's profile picture

liked a Space 8 days ago
Running
84

ACL Pubcheck

📝
84

Check your PDF for ACL guidelines

liked a Space 4 months ago
Paused
Agents
Featured
1.94k

Qwen3-TTS Demo

🎙
1.94k

Generate custom speech from text, voice descriptions, or samples

liked a model 5 months ago

stepfun-ai/Step-Audio-R1.1

Audio-Text-to-Text • 33B • Updated Feb 14 • 284 • 180
liked a Space 5 months ago
Running
Agents
21

Fun-ASR-Nano

🚀
21

LLM-powered ASR: 31 languages, Chinese dialects, timestamps

liked a model 5 months ago

nvidia/bigvgan_v2_24khz_100band_256x

Audio-to-Audio • Updated Sep 5, 2024 • 97.3k • 22
liked a dataset 10 months ago

OpenSound/CapSpeech

Viewer • Updated Jun 4, 2025 • 20.8M • 1.29k • 24
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs