universr-audio / README.md
woongzip1's picture
Update README.md
1c32948 verified
|
Raw
History Blame Contribute Delete
1.41 kB
---
license: cc-by-4.0
tags:
- audio
- audio-super-resolution
- speech
- music
- flow-matching
library_name: pytorch
---
# UniverSR - General Audio (Flagship)
Vocoder-free audio super-resolution model that upsamples **8/12/16/24 kHz → 48 kHz** audio using flow matching in the complex STFT domain. Trained on speech, music, and sound effects.
This is the **recommended model** for general use.
For speech-only evaluation (e.g. VCTK benchmark), see [universr-speech](https://huggingface.co/woongzip1/universr-speech).
**Paper**: [arXiv:2510.00771](https://arxiv.org/abs/2510.00771) |
**Demo**: [woongzip1.github.io/universr-demo](https://woongzip1.github.io/universr-demo/) | **Code**: [github.com/woongzip1/UniverSR](https://github.com/woongzip1/UniverSR)
## Usage
```python
import torchaudio
from universr import UniverSR
model = UniverSR.from_pretrained("woongzip1/universr-audio", device="cuda")
output = model.enhance("low_res.wav", input_sr=8000)
torchaudio.save("output_48k.wav", output.cpu(), 48000)
```
## Citation
```bibtex
@inproceedings{choi2026universr,
title = {{UniverSR}: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching},
author = {Choi, Woongjib and Lee, Sangmin and Lim, Hyungseob and Kang, Hong-Goo},
booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
year = {2026}
}
```