AI & ML interests
None defined yet.
Recent Activity
Papers
VoLo: A Physical Orchestrator for Open-Vocabulary Long-Horizon Manipulation
GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors
Articles
Cosmos Embed1
Cosmos-Embed1 demo app
Aic Demo
Configure and estimate AI model performance for deployment
Earth2 Inference Demo
Visualize weather forecasts for any date and time range
Nemotron Speech Streaming
Real-time speech recognition with NVIDIA Triton
Difix3D
Interface to interact with NVIDIA's Difix3D+ model
Parakeet-tdt_ctc-1.1b
Transcribe audio with timestamps
DoMINO with Ahmed Body Dataset - Multi-Scale Neural Operator for CFD
Access JupyterLab for interactive coding
Voice Agent WebRTC + LangGraph
Voice agent with LangGraph, WebRTC, ASR & TTS
Kvpress
kvpress: LLM KV cache compression made easy
Synthda Demo
Short Demo of SynthDa, using the real-real interpolation mtd
Modeling Magnetohydrodynamics with PhysicsNeMo
Access JupyterLab for interactive coding
Canary 1b V2
Transcribe and Translate in 25 European Languages
Canary 1B Flash
Canary 1B Flash demo
Audio Flamingo 3 Chat
Audio Flamingo 3 demo for multi-turn multi-audio chat
Speech IQ Leaderboard
speech agentic IQ leaderboard
Addit
Add objects to images using text prompts
Canary-Qwen-2.5B
Transcribe audio and generate responses based on prompts
Plan2Align Test-Time Multilingual Translation
Translate text into multiple languages using various methods
Audio Flamingo 2
Audio Flamingo 2 Demo
PartPacker
Part-level image-to-3D generation.
Cosmos-Predict2 14B
Text-to-Image world model with Cosmos2
Tp 1 Dgx Node Estimator
for NVIDIA TRDC estimation
LOTUS VLM Bias
Leaderboard to societal bias and User preference
Describe Anything
Describe any selected part of an image