HF Papers

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

2026-06-01

RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video

RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video

2026-06-01

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

2026-06-01

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

2026-06-01

Count Anything

2026-06-01

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

2026-06-01

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

2026-06-01

FRAPPE: Full Input, Residual Output Autoencoding with Projection Pursuit Encoder

FRAPPE: Full Input, Residual Output Autoencoding with Projection Pursuit Encoder

2026-06-01

SurGe: Improved Surface Geometry in Point Maps

SurGe: Improved Surface Geometry in Point Maps

2026-06-01

DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization

DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization

2026-06-01

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models

2026-06-01

SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?

SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?

2026-06-01

MAAT: Multi-phase Adapter-Aware Targeted Unlearning

MAAT: Multi-phase Adapter-Aware Targeted Unlearning

2026-06-01

The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction

The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction

2026-06-01

When Confidence Misleads: Suffix Anchoring and Anchor-Proximity Confidence Modulation for Diffusion Language Models

When Confidence Misleads: Suffix Anchoring and Anchor-Proximity Confidence Modulation for Diffusion Language Models

2026-06-01

iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning

iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning

2026-06-01

One Click per Cell Type Suffices: Training-free Group Interaction for Cell Instance Segmentation

One Click per Cell Type Suffices: Training-free Group Interaction for Cell Instance Segmentation

2026-06-01

AlphaTransit: Learning to Design City-scale Transit Routes

AlphaTransit: Learning to Design City-scale Transit Routes

2026-06-01

From Model Scaling to System Scaling: Scaling the Harness in Agentic AI

From Model Scaling to System Scaling: Scaling the Harness in Agentic AI

2026-06-01

Benchmarking Composed Image Retrieval for Applied Earth Observation

Benchmarking Composed Image Retrieval for Applied Earth Observation

2026-06-01