🚀 Introducing Robust-U1: Teaching MLLMs to Self-Recover Corrupted Visual Content

Multimodal Large Language Models (MLLMs) have achieved impressive visual understanding, yet they remain highly brittle under real-world corruptions—noise, blur, compression artifacts, adverse weather.

Standard MLLMs suffer dramatic performance drops, and existing robustness solutions come with fundamental limits: black‑box feature alignment lacks interpretability, while white‑box text reasoning cannot restore the lost pixel‑level visual details. This raises a crucial question:

🧐 Can MLLMs recover corrupted visual content by themselves?

If the answer is yes, we can move beyond merely “compensating” for corruption and instead build a more intrinsic, generalizable form of resilience. Robust-U1 is our answer to that question.

💡 Paper: https://arxiv.org/abs/2606.08063
🔗 Code: github.com/jqtangust/Robust-U1
🌍 Demo: Jiaqi-hkust/Robust-U1

1 reply

liked a model 5 days ago

michaelw9999/Qwen3.6-35B-A3B-NVFP4-MTP-GGUF

Text Generation • 36B • Updated 5 days ago • 2.13k • 4

liked a model 7 days ago

michaelw9999/Qwopus3.6-27B-v2-MTP-NVFP4-GGUF

27B • Updated 7 days ago • 1.75k • 6

reacted to sergiopaniego's post with 🔥 7 days ago

Post

3827

OpenEnv has a new home: github.com/huggingface/OpenEnv

Starting today, it's coordinated by a committee that includes Meta-PyTorch, Reflection, Unsloth, Modal, Prime Intellect, Nvidia, Mercor, Fleet AI, and Hugging Face

frontier labs train their models and their harnesses together. Claude knows Claude Code. GPT-5.5 knows Codex. that's not an accident, it's training. open-source models deserve the same magic, but pulling that off requires infrastructure that belongs to everyone, not one lab

OpenEnv is that layer. one api, any harness, any trainer, any environment

Rewards and training loops stay in TRL, Unsloth, wherever you already work. OpenEnv is the socket they all plug into

Get involved!

Full announcement: https://huggingface.co/blog/openenv-agentic-rl

New activity in michaelw9999/Qwen3.6-27B-NVFP4-MXFP6-MTP-GGUF 10 days ago

Loading model speed

#2 opened 13 days ago by

IsiiTheFuture

liked a model 11 days ago

michaelw9999/Qwen3.6-27B-NVFP4-SMALL-MTP-GGUF

27B • Updated 12 days ago • 1.16k • 1

liked 3 models 12 days ago

liked a model 14 days ago

michaelw9999/Qwen3.5-9B-NVFP4-MTP-GGUF

9B • Updated 14 days ago • 2.91k • 3

reacted to RakshitAralimatti's post with ❤️ 14 days ago

Post

487

Reading engineering and research blogs from OpenAI, Anthropic, DeepMind, Meta and others has genuinely leveled up my understanding of AI systems and helped me in my day-to-day work. But keeping track of 20+ sites manually is a pain.

So I built AI Blogs Tracker — a Streamlit app that scrapes the actual blog listing pages (not search) of 20+ top AI companies and surfaces titles, dates, and links in one clean feed. Filter by source, by date, star posts to a reading list, or add your own custom sources.

One click. ~30 seconds. Everything in one place.

🔗 GitHub link - https://github.com/rakshit2020/Tech-Blogs-Tracker-of-Top-AI-Companies-Agent

1 reply

liked a model 14 days ago

michaelw9999/Qwen3.6-27B-NVFP4-MXFP6-MTP-GGUF

Updated 15 days ago • 2.22k • 5

liked a model 15 days ago

nilayparikh/Qwen3.6-27B-Text-NVFP4-MTP-GGUF

Text Generation • Updated 17 days ago • 3.73k • 8

liked a model 17 days ago

sakamakismile/Huihui-Qwen3.6-27B-abliterated-NVFP4-MTP

Text Generation • 17B • Updated 17 days ago • 65.1k • 59

reacted to prithivMLmods's post with ❤️ 18 days ago

Post

6153

PiD — Pixel Diffusion Decoder Image Edit Upscale and Image Generation Upscale, an all-in-one demo, is now live on Spaces! Great improvements in realism-based image generation and editing are powered by FLUX.2-Klein, while image generation is paired with Z-Image, and upscaling is enabled by default!

🤗 Space: prithivMLmods/PiD-Image-Upscaler
🔗 Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

🤗 > To learn more, visit the app page or the respective model pages.