SmolVLM2-500M-Video-Instruct โ€” Ternary Quantized (tritplane3)

Ternary-quantized version of HuggingFaceTB/SmolVLM2-500M-Video-Instruct โ€” a compact video-understanding VLM.

Specifications

Property Value
Base Model HuggingFaceTB/SmolVLM2-500M-Video-Instruct
Parameters 500M
Quantization tritplane3 (225 linear layers)
Full-model effective bits 11.50
Compression ratio 1.39ร—
Avg reconstruction error 0.1413
Vision Encoder FP16 (preserved)

Collection

Part of ternary-models.

GitHub: github.com/Asad-Ismail/ternary-models

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for AsadIsmail/SmolVLM2-500M-Video-Instruct-ternary

Collection including AsadIsmail/SmolVLM2-500M-Video-Instruct-ternary