HF Papers

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

2026-06-03

Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams

Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams

2026-06-03

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

2026-06-03

PlatonicNav: Unveiling Semantic Correspondence in Navigation with Platonic Topological Maps

PlatonicNav: Unveiling Semantic Correspondence in Navigation with Platonic Topological Maps

2026-06-03

Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging

Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging

2026-06-03

Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces

Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces

2026-06-03

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

2026-06-03

MERIT: Learning Disentangled Music Representations for Audio Similarity

MERIT: Learning Disentangled Music Representations for Audio Similarity

2026-06-03

Value-Aware Stochastic KV Cache Eviction for Reasoning Models

Value-Aware Stochastic KV Cache Eviction for Reasoning Models

2026-06-03

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

2026-06-03

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

2026-06-03

A Multi-AI-agent Framework Enabling End-to-end Finite Element Analysis for Solid Mechanics Problems

A Multi-AI-agent Framework Enabling End-to-end Finite Element Analysis for Solid Mechanics Problems

2026-06-03

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

2026-06-03

ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree

ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree

2026-06-03

αDepth: Learning Single-Pass Soft Boundary Decomposition for Stereo Conversion

αDepth: Learning Single-Pass Soft Boundary Decomposition for Stereo Conversion

2026-06-03

AURA: Action-Gated Memory for Robot Policies at Constant VRAM

AURA: Action-Gated Memory for Robot Policies at Constant VRAM

2026-06-03

Prior Availability in Industrial Visual Sim-to-Real: A Review of CAD-Guided and CAD-Unavailable Regimes

Prior Availability in Industrial Visual Sim-to-Real: A Review of CAD-Guided and CAD-Unavailable Regimes

2026-06-03

BA-T: An Iterative Transformer for Two-View Bundle Adjustment

BA-T: An Iterative Transformer for Two-View Bundle Adjustment

2026-06-03

Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling

Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling

2026-06-03

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations

2026-06-03