HF Papers

High-Fidelity Two-Step Image Generation via Teacher-Aligned End-to-End Distillation

High-Fidelity Two-Step Image Generation via Teacher-Aligned End-to-End Distillation

2026-06-12

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

2026-06-12

SG-OPD: Sign-Gated On-Policy Distillation via Sign-Consistency Gating and Phased Teacher Sampling

SG-OPD: Sign-Gated On-Policy Distillation via Sign-Consistency Gating and Phased Teacher Sampling

2026-06-12

Visual Para-Thinker++: A Single-Policy Multi-Agent Framework for Visual Reasoning

Visual Para-Thinker++: A Single-Policy Multi-Agent Framework for Visual Reasoning

2026-06-12

EvoBrowseComp: Benchmarking Search Agents on Evolving Knowledge

EvoBrowseComp: Benchmarking Search Agents on Evolving Knowledge

2026-06-12

MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training

MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training

2026-06-12

ArogyaSutra: A Multi-Agent Framework for Multimodal Medical Reasoning in Indic Languages

ArogyaSutra: A Multi-Agent Framework for Multimodal Medical Reasoning in Indic Languages

2026-06-12

Surflo: Consistent 3D Surface Flow Model with Global State

Surflo: Consistent 3D Surface Flow Model with Global State

2026-06-12

Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior

Rethinking Psychometric Evaluation of LLMs: When and Why Self-Reports Predict Behavior

2026-06-12

Evoflux: Inference-Time Evolution of Executable Tool Workflows for Compact Agents

Evoflux: Inference-Time Evolution of Executable Tool Workflows for Compact Agents

2026-06-12

MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning

MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning

2026-06-12

See What I See, Know What I Think: Dense Latent Communication Across Heterogeneous Agents

See What I See, Know What I Think: Dense Latent Communication Across Heterogeneous Agents

2026-06-12

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

2026-06-12

WEAVER, Better, Faster, Longer: An Effective World Model for Robotic Manipulation

WEAVER, Better, Faster, Longer: An Effective World Model for Robotic Manipulation

2026-06-12

PianoKontext: Expressive Performance Rendering from Deadpan Context

PianoKontext: Expressive Performance Rendering from Deadpan Context

2026-06-12

IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder

IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder

2026-06-12

The Cold-Start Safety Gap in LLM Agents

The Cold-Start Safety Gap in LLM Agents

2026-06-12

ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs

ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs

2026-06-12

A Stationary (and Therefore Compatible) Representation is All You Need

A Stationary (and Therefore Compatible) Representation is All You Need

2026-06-12

WebChallenger: A Reliable and Efficient Generalist Web Agent

WebChallenger: A Reliable and Efficient Generalist Web Agent

2026-06-12