Instructions to use WillHeld/DiVA-llama-3-v0-8b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use WillHeld/DiVA-llama-3-v0-8b with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-feature-extraction", model="WillHeld/DiVA-llama-3-v0-8b", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("WillHeld/DiVA-llama-3-v0-8b", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
| from transformers import AutoModel | |
| import librosa | |
| import wget | |
| from modeling_diva import DiVAModel | |
| filename = wget.download( | |
| "https://github.com/ffaisal93/SD-QA/raw/refs/heads/master/dev/eng/irl/wav_eng/-1008642825401516622.wav" | |
| ) | |
| speech_data, _ = librosa.load(filename, sr=16_000) | |
| model = DiVAModel.from_pretrained("./") | |
| print(model.generate([speech_data])) | |
| print(model.generate([speech_data], ["Reply Briefly Like A Pirate"])) | |
| filename = wget.download( | |
| "https://github.com/ffaisal93/SD-QA/raw/refs/heads/master/dev/eng/irl/wav_eng/-2426554427049983479.wav" | |
| ) | |
| speech_data2, _ = librosa.load(filename, sr=16_000) | |
| print( | |
| model.generate( | |
| [speech_data, speech_data2], | |
| ["Reply Briefly Like A Pirate", "Reply Briefly Like A New Yorker"], | |
| ) | |
| ) | |