ONNX conversion of Resnet34-tidyvoiceX-ASV from TidyVoice Challenge: Cross-Lingual Speaker Verification. Compatitable with ONNX inference script from WeSpeaker and sherpa-onnx

Download

wget https://huggingface.co/hr16/tidyvoicex-samresnet34-onnx/resolve/main/tidyvoicex_samresnet34.onnx

Example on sherpa-onnx

import sherpa_onnx
import soundfile as sf

audio, sample_rate = sf.read("your-audio-file.wav")
embedding_model = sherpa_onnx.SpeakerEmbeddingExtractor(
    sherpa_onnx.SpeakerEmbeddingExtractorConfig(
        model="tidyvoicex_samresnet34.onnx", num_threads=1, provider="cuda"
    )
)
stream = embedding_model.create_stream()
stream.accept_waveform(sample_rate=sample_rate, waveform=audio)
spk_emb = np.array(embedding_model.compute(stream))
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for hr16/tidyvoicex-samresnet34-onnx

Quantized
(1)
this model

Paper for hr16/tidyvoicex-samresnet34-onnx