AGI

qq94244365 's Collections

AGI

AGI-pic

updated Mar 2

Upvote

Running on Zero

Agents

113

MiniGPT-v2

🚀

113

Chat with images and get visual answers
teknium/Mistral-Trismegistus-7B

Text Generation • Updated Nov 12, 2023 • 294 • • 239
Runtime error

Agents

Featured

345

Latent Consistency Models

⚡

345
Runtime error

Agents

Featured

2.77k

XTTS

🐸

2.77k

Generate speech from text using a reference voice
Build error

Agents

365

VALL E X

🎙

365

Generate audio from text using voice prompts
Build error

Agents

216

LLaMA Board

🦙

216

Fine-tuning large language model with Gradio UI
Running on Zero

Agents

Featured

5.07k

MusicGen

🎵

5.07k

Generate music from a text description and optional melody
Runtime error

Agents

Featured

1.43k

MagicAnimate

💃

1.43k

Generate animated videos from images and motion sequences
Runtime error

Featured

517

Seamless M4T v2

📞

517

Translate speech and text between languages
Running on Zero

MCP

Featured

2.02k

Stable Video Diffusion 1.1

📺

2.02k

Generate a short video from a single image
Runtime error

Agents

Featured

235

Video LLaVA

📚

235
Build error

Agents

Featured

166

Mustango

🐢

166
Runtime error

Agents

559

OpenAI TTS New

📊

559
Running

Featured

399

3D Arena

🏢

399

Vote for 3D creations and view the leaderboard
Running

Featured

229

Distil Whisper Web

👀

229

Transcribe audio files to text instantly
Runtime error

Featured

286

Zero123++ Demo Space

🌒

286
Runtime error

Agents

Featured

108

InstaFlow

🐨

108
Running on Zero

MCP

Featured

1.35k

CLIP Interrogator 2

🕵

1.35k

Generate detailed Stable Diffusion prompts from any image
Running on Zero

Agents

Featured

826

ZoeDepth

🦀

826

Predict depth map from a single image
Paused

Agents

41

LooseControl

📚

41
Runtime error

Agents

Featured

300

Enhance This DemoFusion SDXL

🔍

300

Creative Upscaler High-Res Image Generation DemoFusion SDXL
axiong/PMC_LLaMA_13B

Text Generation • Updated Aug 28, 2023 • 650 • 33
axiong/pmc_llama_instructions

Viewer • Updated Nov 23, 2023 • 514k • 167 • 33
med-flamingo/med-flamingo

Updated Aug 1, 2023 • 57
wikimedia/wikisource

Viewer • Updated Dec 8, 2023 • 1.66M • 2.08k • 84
Running

Agents

2.81k

OutfitAnyone

🏢

2.81k

Generate virtual try‑on images for any model and clothing
Pixel Aligned Language Models

Paper • 2312.09237 • Published Dec 14, 2023 • 16
Running

Agents

190

Gemini Playground

💬

190

Chat with Gemini Pro and upload images for responses
Runtime error

Agents

262

Singing Voice Conversion

🎼

262

Transform your voice into a singer's
Running

Agents

60

Text To Speech

🔥

60

Generate speech from text with different voices
Runtime error

Agents

28

Text To Audio

🌖

28
Build error

Agents

52

NaturalSpeech2

🎧

52

Generate speech with cloned timbre
Runtime error

Agents

Featured

207

AnyDoor Online

👁

207

Teleport objects into new backgrounds using masks
Runtime error

Agents

59

MotionCtrl

📊

59
Runtime error

Agents

118

MotionGPT

🏃

118

Generate human motion from text or audio
MotionGPT: Human Motion as a Foreign Language

Paper • 2306.14795 • Published Jun 26, 2023 • 28
Paused

Agents

435

GPT-Academic

😻

435

Generate academic responses using GPT
Runtime error

Agents

Featured

94

M2UGen Demo

💻

94
Runtime error

Agents

Featured

63

VCoder

✌

63
Running on Zero

Agents

Featured

1.01k

IP-Adapter-FaceID

🧑

1.01k

Generate AI images featuring your own face
Runtime error

Agents

Featured

267

AnyText

👁

267

Generate images with text and edit existing images
osunlp/Mind2Web

Viewer • Updated Oct 19, 2025 • 253 • 7.1k • 126
Paused

144

FaceChain

🏆

144

Display Hugging Face status and loading animation
Runtime error

229

Dreamtalk

😛

229

Animate a portrait from audio speech
Runtime error

104

I2VGen-XL

🔥

104
Runtime error

Agents

Featured

953

ReplaceAnything

📚

953

Replace objects in images using prompts or reference images
Running on Zero

Agents

Featured

1.95k

PhotoMaker

📷

1.95k

Generate personalized photos of a person from a prompt
Running on Zero

Agents

474

Resemble Enhance

🚀

474

Enhance your audio with denoising and quality boost
Build error

Agents

6

DiffusionGPT

👁

6

Generate images from text prompts
Build error

Agents

15

DiffusionAgent XL

🐢

15
Running on Zero

Agents

Featured

3.61k

InstantID

😻

3.61k

Generate personalized images preserving your face identity
Running

42

DuckDB NSQL 7B

🏢

42

Generate DuckDB SQL queries from natural language
Running on Zero

Agents

Featured

212

InstructIR

💻

212

Enhance images with custom text instructions
Running on Zero

MCP

Featured

572

Image to Music v2

🎺

572

Get a music sample inspired by the mood of an image
Running

Agents

Featured

879

BRIA RMBG 1.4

💻

879

Remove background from images instantly
Runtime error

Agents

Featured

490

YOLO World

🔥

490

Detect objects in images or videos
Running

Featured

561

Vision Arena (Testing VLMs side-by-side)

🖼

561

Explore Vision Arena visual AI demo online
Running on Zero

Agents

Featured

1.68k

Stable Cascade

👁

1.68k

Generate high‑resolution images from text prompts
Runtime error

Agents

Featured

68

Diffusion Transformers (DiT)

🚀

68
Running on Zero

Agents

Featured

467

SDXL Lightning

⚡

467

Super-fast image generation on SDX
Build error

Agents

Featured

259

YOLO-World + EfficientSAM

🔥

259

Detect and segment objects in images or videos
Running on Zero

Agents

Featured

127

Differential Diffusion

😻

127

Edit images with custom change maps using AI
Running

Featured

56

YOLOv9 Object Detection w/ Transformers.js

🖼

56

In-browser object detection w/ YOLOv9 and Transformers.js
Build error

Agents

75

Depth Anything Video

👁

75

Generate depth maps for video frames
Running on Zero

Agents

Featured

566

Depth Anything

🌖

566

Generate depth map from a single image
Running on Zero

Agents

Featured

477

MeloTTS

🗣

477

Fast, efficient, & multilingual text-to-speech
Running on Zero

Agents

Featured

1.14k

Playground V2.5

🌍

1.14k

Generate highly aesthetic images
Running

Agents

123

MoMask

🎭

123

Generate 3D human motion from text prompts
Running on Zero

Agents

Featured

655

PhotoMaker Style

📷

655

Generate personalized stylized portraits from your photos
Runtime error

Agents

56

TCD

📈

56

Official Demo Space for Trajectory Consistency Distillation
Running on Zero

Agents

821

TripoSR

🐳

821

Generate a 3D model from a single image
Running

42

Magi Demo

🏢

42

Generate comic transcriptions from images
Running on Zero

Agents

1.4k

Animagine XL 3.1

🌍

1.4k

The most opinionated, anime-themed SDXL model
Build error

Agents

Featured

206

Img2img Turbo Sketch

📚

206
Running on Zero

Agents

141

APISR

🏃

141

Enhance low‑resolution anime images with AI upscaling
Runtime error

Agents

Featured

166

DynamiCrafter

🐨

166

Generate animated videos from images and text prompts
Running on Zero

Agents

290

DynamiCrafter

🐨

290

Animate an image into a video using a text prompt
Build error

Agents

9

DragAPart

🏢

9
Running

11.1k

AI Comic Factory

👩

11.1k

Create your own AI comic with a single prompt
Running

85

GRM

🏆

85

Display a live demo website
Configuration error

Agents

74

AnyV2V

🎥

74

Video Editing
Configuration error

Agents

49

DesignEdit

🌿

49
Running on Zero

Agents

Featured

1.54k

InstructPix2Pix

🚀

1.54k

Edit images using text instructions
Running on Zero

Agents

Featured

848

Parler-TTS

🥖

848

High-fidelity Text-To-Speech
Running on Zero

Agents

118

MagicTime

🚀

118

MagicTime: Time-lapse Video Generation Models as Metamorphic
Running on Zero

Agents

46

CustomNet

🐠

46

Generate customized scenes with your object and viewpoint
Running on Zero

Agents

244

PixArt Sigma

👁

244

Generate high-res images from text prompts
Running

Agents

47

Sd3 Api

😻

47

Generate images from text prompts
Running on Zero

Agents

Featured

1.58k

InstantMesh

📚

1.58k

Create a 3D model from an image in 10 seconds!
Runtime error

Agents

243

Hyper SDXL 1Step T2I

🐠

243

Generate images from text prompts
Running on Zero

Agents

Featured

2.11k

IDM VTON

👕

2.11k

High-fidelity Virtual Try-on
Runtime error

Agents

1.37k

IC Light

📈

1.37k

Relight photos with AI using custom lighting prompts
Running

295

Phi-3 WebGPU

🚀

295

A private and powerful AI that runs locally in your browser
Paused

Agents

Featured

315

PaliGemma Demo

🤲

315

Annotate and describe images with text prompts
Running

Agents

Featured

101

Yolov10

📉

101

Detect objects in images with customizable YOLOv10 models
Runtime error

Agents

74

Open Sora Plan V1.1.0

⚡

74
Runtime error

Agents

Featured

338

Chattts Zero

🐢

338

Generate audio from text with tuning options
Running on Zero

Agents

Featured

1.06k

ToonCrafter

😻

1.06k

Generate animated video from two images and a prompt
Running on Zero

Agents

Featured

2.37k

Bark

🐶

2.37k

Generate realistic speech and sounds from typed text
Running on Zero

Agents

Featured

126

MimicBrush

🐨

126

Edit image regions using a reference picture
Runtime error

Agents

74

SD3 ControlNet

⚡

74

Generate images using a reference image and text prompt
Running

Agents

120

ChatTTS Speaker

🌍

120

Download and preview ChatTTS speaker embeddings
Running on Zero

Agents

Featured

259

SD3 Long Captioner

🏃

259

Generate detailed captions for your images
Running on Zero

Agents

242

MassivelyMultilingualTTS

🌍

242

Generate natural speech in 7000+ languages
Running on Zero

Agents

Featured

199

Flash SD3

⚡

199

Generate high‑quality images from text prompts in seconds
Running on Zero

Agents

Featured

844

Florence 2

📉

844

Generate captions, detections, and segmentations for any image
Runtime error

Agents

Featured

111

ExVideo SVD 128f V1

🐨

111

Generate a video from an image
Runtime error

Agents

163

InternLM XComposer

🏢

163

Display a web page
Paused

272

Llm Pricing

📊

272

Display a React app with TypeScript
Paused

Agents

133

FoleyCrafter

📚

133

Generate audio for silent videos
Running on Zero

Agents

1.21k

PhotoMaker V2

📷

1.21k

Generate personalized portrait images from your photos and prompts
Running on Zero

Agents

3.75k

Live Portrait

🤪

3.75k

Apply the motion of a video on a portrait
Build error

Agents

Featured

137

Diffree

🖼

137
Paused

Agents

13

ViPer

😻

13

Generate personalized images based on comments
Running on Zero

Agents

Featured

1.19k

Stable Fast 3D

🎮

1.19k

Generate a 3D mesh from a single image
Running on Zero

MCP

2.86k

Background Removal

🌘

2.86k

Remove backgrounds from images instantly
Running on Zero

Agents

Featured

163

LongWriter

💬

163

LLM for long context
Running on CPU Upgrade

Agents

10.1k

Kolors Virtual Try-On

👕

10.1k

Generate virtual try‑on images of clothes on a person
Running on Zero

Agents

1.04k

CogVideoX-5B

🎥

1.04k

Text-to-Video
Running on Zero

Agents

66

Svd Keyframe Interpolation

🐨

66

Generate a smooth video between two keyframe images
Runtime error

Agents

Featured

697

Fish Audio S1

🏆

697

Convert text to natural-sounding speech audio
Running on Zero

Agents

516

Finegrain Object Cutter

✂

516

Create HD cutouts from any image with just a prompt
Running on Zero

Agents

Featured

360

GOT Online

💬

360

Extract and format text from images with advanced OCR
Runtime error

Agents

21

Dream Machine

🦀

21
Running

Agents

458

PDF2Audio

📚

458

Generate audio‑ready script from documents
Runtime error

Agents

Featured

390

Llama-Vision-11B

🚀

390

Chat with Llama about images and text
Runtime error

Agents

8

Llama 3.2 90b Text Preview Groq

🌖

8
Paused

Agents

1.02k

Whisper Turbo

🤯

1.02k

Transcribe audio or YouTube videos into text
Build error

Agents

Featured

1.09k

Open NotebookLM

🎙

1.09k

Personalised Podcasts For All - Available in 13 Languages
Running

Agents

72

Podcastfy.ai - An Open Source alternative to NotebookLM's podcast feature

🚀

72

Create custom AI podcasts from text, URLs, PDFs, and images
Running on Zero

Agents

Featured

313

PMRF

🖼

313

A gradio demo for Posterior-Mean Rectified Flow (PMRF)
Running on Zero

Agents

Featured

2.88k

F5-TTS

🗣

2.88k

F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Running on Zero

Agents

258

MaskGCT TTS Demo

😻

258

MaskGCT TTS Demo
Configuration error

Agents

Featured

149

Fish Agent

💬

149

An end-to-end (e2e) Voice Language Model by Fish Audio.
Paused

Agents

Featured

1.73k

Qwen2.5 Coder Artifacts

🐢

1.73k

Generate and preview app code from a text description
Build error

Agents

451

SeedEdit-APP-V1.0

🎨

451

Generate and edit images using text instructions
Runtime error

Agents

1.13k

OOTDiffusion

🥼

1.13k

High-quality virtual try-on ~ Your cyber fitting room
Running on Zero

Agents

Featured

2.26k

MagicQuill

🪶

2.26k

Edit images with scribble‑based color and edge control
Running on Zero

Agents

Featured

941

OminiControl

🌍

941

Generate new images from a subject photo and text prompt
Paused

Agents

1.17k

IC Light V2-Vary

📈

1.17k

Execute custom code from environment variable
Running on Zero

Agents

64

TryOffDiff

🔥

64

Extract garment images from everyday images!
Running on Zero

Agents

109

Janus Pro 7b

🌍

109

A unified multimodal understanding and generation model.
Runtime error

Agents

Featured

2.02k

Chat With Janus-Pro-7B

🌍

2.02k

A unified multimodal understanding and generation model.
Running on Zero

Agents

315

Llasa 3b Tts

🔥

315

Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Running on Zero

Agents

97

Paligemma2 Mix

🌖

97

Generate text answers or segment objects from images
Runtime error

Featured

464

Gemini Co-Drawing

✏

464

Gemini 2.0 native image generation co-doodling
Running on Zero

Agents

Featured

688

Di♪♪Rhythm

🎶

688

Blazingly Fast and Embarrassingly Simple Song Generation
Running on Zero

Agents

Featured

1.1k

InfiniteYou-FLUX

📸

1.1k

Flexible Photo Recrafting While Preserving Your Identity
Build error

Agents

Featured

76

Text2Human

🏃

76

Generate human images from text descriptions
Runtime error

Agents

Featured

1.1k

GFPGAN

😁

1.1k

Enhance and restore old photos and AI-generated faces
Paused

Agents

43

LiveCC

🐠

43

LiveCC-7B-Instruct
Running on Zero

Agents

Featured

662

ACE Step

😻

662

A Step Towards Music Generation Foundation Model
Running

243

MedGemma - Radiology Explainer Demo

🩺

243

Radiology Image & Report Explainer Demo. Built with MedGemma
Paused

Agents

120

PlayDiffusion

🎨

120

Generate modified audio from text and voice
Running

Agents

Featured

354

MiniMax M1

💬

354

Generate web code from your description and view it live
Running

Agents

Featured

251

PaddleOCR-VL Online Demo

📈

251

Extract text, tables, formulas, and charts from images
Running on A10G

Agents

4

Tx1 Demo

🚀

4

Upload your anndata, get Tx1 embeddings in minutes
Running

Agents

Featured

405

Qwen3 TTS Demo

🚀

405

Generate spoken audio from your text in many voices
Running on Zero

Agents

Featured

1.96k

Qwen3-TTS Demo

🎙

1.96k

Generate speech from text using voice design, cloning or presets

Upvote

Collection guide
Browse collections

AGI

MiniGPT-v2

Latent Consistency Models

XTTS

VALL E X

LLaMA Board

MusicGen

MagicAnimate

Seamless M4T v2

Stable Video Diffusion 1.1

Video LLaVA

Mustango

OpenAI TTS New

3D Arena

Distil Whisper Web

Zero123++ Demo Space

InstaFlow

CLIP Interrogator 2

ZoeDepth

LooseControl

Enhance This DemoFusion SDXL

OutfitAnyone

Gemini Playground

Singing Voice Conversion

Text To Speech

Text To Audio

NaturalSpeech2

AnyDoor Online

MotionCtrl

MotionGPT

GPT-Academic

M2UGen Demo

VCoder

IP-Adapter-FaceID

AnyText

FaceChain

Dreamtalk

I2VGen-XL

ReplaceAnything

PhotoMaker

Resemble Enhance

DiffusionGPT

DiffusionAgent XL

InstantID

DuckDB NSQL 7B

InstructIR

Image to Music v2

BRIA RMBG 1.4

YOLO World

Vision Arena (Testing VLMs side-by-side)

Stable Cascade

Diffusion Transformers (DiT)

SDXL Lightning

YOLO-World + EfficientSAM

Differential Diffusion

YOLOv9 Object Detection w/ Transformers.js

Depth Anything Video

Depth Anything

MeloTTS

Playground V2.5

MoMask

PhotoMaker Style

TCD

TripoSR

Magi Demo

Animagine XL 3.1

Img2img Turbo Sketch

APISR

DynamiCrafter

DynamiCrafter

DragAPart

AI Comic Factory

GRM

AnyV2V

DesignEdit

InstructPix2Pix

Parler-TTS

MagicTime

CustomNet

PixArt Sigma