Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Yu Zhang's picture
13 11 5

Yu Zhang

AaronZ345
prompts-dot-com's profile picture this-username-exists's profile picture littlealan's profile picture
·
https://aaronz345.github.io
  • AaronZ345
  • yuzhang34

AI & ML interests

Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).

Recent Activity

authored a paper 20 days ago
ALIVE: Animate Your World with Lifelike Audio-Video Generation
authored a paper 20 days ago
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue
authored a paper 20 days ago
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
View all activity

Organizations

Zhejiang University's profile picture Zhejiang University's profile picture
AaronZ345 's papers 12
arxiv:2605.30993
arxiv:2605.30940
arxiv:2510.10396
arxiv:2508.10924
arxiv:2507.14534
arxiv:2507.06670
arxiv:2505.14910
arxiv:2504.20630
arxiv:2504.19062
arxiv:2409.15977
arxiv:2409.13832
arxiv:2312.10741
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs