HF Papers

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

2026-06-18

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

2026-06-18

Kairos: A Native World Model Stack for Physical AI

Kairos: A Native World Model Stack for Physical AI

2026-06-18

Guava: An Effective and Universal Harness for Embodied Manipulation

Guava: An Effective and Universal Harness for Embodied Manipulation

2026-06-18

The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL

The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL

2026-06-18

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

2026-06-18

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

2026-06-18

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

2026-06-18

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

2026-06-18

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

2026-06-18

Native Active Perception as Reasoning for Omni-Modal Understanding

Native Active Perception as Reasoning for Omni-Modal Understanding

2026-06-18

Sumi: Open Uniform Diffusion Language Model from Scratch

Sumi: Open Uniform Diffusion Language Model from Scratch

2026-06-18

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

2026-06-18

MaineCoon: Pursuing A Real-Time Audio-Visual Social World Model

MaineCoon: Pursuing A Real-Time Audio-Visual Social World Model

2026-06-18

Beyond Alignment: Value Diversity as a Collective Property in Multicultural Agent Systems

Beyond Alignment: Value Diversity as a Collective Property in Multicultural Agent Systems

2026-06-18

CEO-Bench: Can Agents Play the Long Game?

CEO-Bench: Can Agents Play the Long Game?

2026-06-18

ViT-Up: Faithful Feature Upsampling for Vision Transformers

ViT-Up: Faithful Feature Upsampling for Vision Transformers

2026-06-18

PAIWorld: A 3D-Consistent World Foundation Model for Robotic Manipulation

PAIWorld: A 3D-Consistent World Foundation Model for Robotic Manipulation

2026-06-18

SciOrch: Learning to Orchestrate Expert LLMs for Solving Frontier Multimodal Scientific Reasoning Tasks

SciOrch: Learning to Orchestrate Expert LLMs for Solving Frontier Multimodal Scientific Reasoning Tasks

2026-06-18

RODS: Reward-Driven Online Data Synthesis for Multi-Turn Tool-Use Agents

RODS: Reward-Driven Online Data Synthesis for Multi-Turn Tool-Use Agents

2026-06-18