junmingyang's picture

junmingyang

jmyang

·

https://junming-yang.github.io/

junming-yang

AI & ML interests

LLM Alignment, VLM

Recent Activity

authored a paper 10 days ago

Preference Orchestrator: Prompt-Aware Multi-Objective Alignment for Large Language Models

authored a paper 10 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

upvoted a paper 10 days ago

SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

View all activity

Organizations

None yet

authored 2 papers 10 days ago

Preference Orchestrator: Prompt-Aware Multi-Objective Alignment for Large Language Models

Paper • 2511.10656 • Published Nov 3, 2025

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 12 days ago • 44

upvoted a paper 10 days ago

SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

Paper • 2606.07074 • Published 15 days ago • 12

upvoted a paper 11 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 12 days ago • 44

upvoted a paper 18 days ago

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search

Paper • 2605.29796 • Published 23 days ago • 25

upvoted a paper 24 days ago

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

Paper • 2605.25624 • Published 26 days ago • 34

liked a model about 2 months ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 11 days ago • 3.02M • • 4.96k

upvoted a paper 2 months ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published Apr 6 • 203

updated a collection 4 months ago

Meta APO

Model of MetaAPO https://arxiv.org/abs/2509.23371 • 6 items • Updated Feb 28 • 2

upvoted a collection 4 months ago

Meta APO

Model of MetaAPO https://arxiv.org/abs/2509.23371 • 6 items • Updated Feb 28 • 2

updated a model 4 months ago

jmyang/MetaAPO-Qwen2.5-7B

0.5B • Updated Feb 28 • 5 • 1

published a model 4 months ago

jmyang/MetaAPO-Qwen2.5-7B

0.5B • Updated Feb 28 • 5 • 1

updated a collection 4 months ago

Meta APO

Model of MetaAPO https://arxiv.org/abs/2509.23371 • 6 items • Updated Feb 28 • 2

updated a model 4 months ago

jmyang/Qwen2.5-7B-rm

1B • Updated Feb 28 • 1

published a model 4 months ago

jmyang/Qwen2.5-7B-rm

1B • Updated Feb 28 • 1

updated a collection 4 months ago

Meta APO

Model of MetaAPO https://arxiv.org/abs/2509.23371 • 6 items • Updated Feb 28 • 2