Graysonjohnson's picture

Graysonjohnson

graysonjohnson3

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

liked a dataset 17 days ago

DCAgent3/medagentbench_g1_diverse_tezos_top4_316_8b_20260602_100929

upvoted a paper 17 days ago

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 3 days ago • 70

upvoted a paper 17 days ago

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

Paper • 2605.26293 • Published 25 days ago • 6

upvoted a paper 19 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 23 days ago • 426

upvoted a paper 29 days ago

IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools

Paper • 2605.20682 • Published about 1 month ago • 84

upvoted 3 papers about 1 month ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published May 12 • 196

Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

Paper • 2605.13511 • Published May 13 • 33

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published May 6 • 105

upvoted a paper about 2 months ago

The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability

Paper • 2604.17698 • Published Apr 20 • 4

upvoted 2 papers 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 507

upvoted 3 papers 3 months ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Paper • 2604.02097 • Published Apr 2 • 32

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published Mar 25 • 183

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 343

upvoted 3 papers 4 months ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 266

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 525