| --- |
| license: cc-by-4.0 |
| tags: |
| - audio |
| - audio-super-resolution |
| - speech |
| - music |
| - flow-matching |
| library_name: pytorch |
| --- |
| |
| # UniverSR - General Audio (Flagship) |
|
|
| Vocoder-free audio super-resolution model that upsamples **8/12/16/24 kHz → 48 kHz** audio using flow matching in the complex STFT domain. Trained on speech, music, and sound effects. |
|
|
| This is the **recommended model** for general use. |
| For speech-only evaluation (e.g. VCTK benchmark), see [universr-speech](https://huggingface.co/woongzip1/universr-speech). |
|
|
| **Paper**: [arXiv:2510.00771](https://arxiv.org/abs/2510.00771) | |
| **Demo**: [woongzip1.github.io/universr-demo](https://woongzip1.github.io/universr-demo/) | **Code**: [github.com/woongzip1/UniverSR](https://github.com/woongzip1/UniverSR) |
|
|
| ## Usage |
|
|
| ```python |
| import torchaudio |
| from universr import UniverSR |
| |
| model = UniverSR.from_pretrained("woongzip1/universr-audio", device="cuda") |
| output = model.enhance("low_res.wav", input_sr=8000) |
| torchaudio.save("output_48k.wav", output.cpu(), 48000) |
| ``` |
|
|
| ## Citation |
|
|
| ```bibtex |
| @inproceedings{choi2026universr, |
| title = {{UniverSR}: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching}, |
| author = {Choi, Woongjib and Lee, Sangmin and Lim, Hyungseob and Kang, Hong-Goo}, |
| booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)}, |
| year = {2026} |
| } |
| ``` |