readVDOpromt / README.md
victor's picture
victor HF Staff
Move to Gradio Server interactive frontend
9e31608 verified
|
Raw
History Blame Contribute Delete
901 Bytes

A newer version of the Gradio SDK is available: 6.19.0

Upgrade
metadata
title: Marlin 2B Video Understanding
emoji: 🎬
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 6.10.0
app_file: app.py
short_description: Dense video captions and timestamp search
python_version: '3.10'
startup_duration_timeout: 1h
models:
  - NemoStation/Marlin-2B

Marlin 2B Video Understanding

ZeroGPU Gradio Server demo for NemoStation/Marlin-2B, a 2B video VLM for dense video captioning and natural-language temporal grounding.

The app exposes two model-card workflows:

  • Caption: returns Marlin's parsed scene paragraph plus timestamped events.
  • Find: resolves an event query into a parsed start/end time span.

The model is loaded once at module startup. Inference runs through Gradio Server @app.api endpoints backed by ZeroGPU, and the custom frontend turns timestamped results into a synced video timeline.