HF Papers

AR-VLA: True Autoregressive Action Expert for Vision-Language-Action Models

AR-VLA: True Autoregressive Action Expert for Vision-Language-Action Models

2026-05-19

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

2026-05-18

PhysBrain 1.0 Technical Report

PhysBrain 1.0 Technical Report

2026-05-18

MMSkills: Towards Multimodal Skills for General Visual Agents

MMSkills: Towards Multimodal Skills for General Visual Agents

2026-05-18

FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization

FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization

2026-05-18

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

2026-05-18

DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo

DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo

2026-05-18

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

2026-05-18

InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation

InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation

2026-05-18

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

2026-05-18

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

2026-05-18

ReactiveGWM: Steering NPC in Reactive Game World Models

ReactiveGWM: Steering NPC in Reactive Game World Models

2026-05-18

Hölder Policy Optimisation

Hölder Policy Optimisation

2026-05-18

Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution

Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution

2026-05-18

From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing

From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing

2026-05-18

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage

CM-EVS: Sparse Panoramic RGB-D-Pose Data for Complete Scene Coverage

2026-05-18

PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control

PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control

2026-05-18

Unlocking Dense Metric Depth Estimation in VLMs

Unlocking Dense Metric Depth Estimation in VLMs

2026-05-18

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

2026-05-18

Steered LLM Activations are Non-Surjective

Steered LLM Activations are Non-Surjective

2026-05-18