HF Papers

CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization

CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization

2026-03-10

NaviDriveVLM: Decoupling High-Level Reasoning and Motion Planning for Autonomous Driving

NaviDriveVLM: Decoupling High-Level Reasoning and Motion Planning for Autonomous Driving

2026-03-10

Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned

Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned

2026-03-10

OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning

OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning

2026-03-10

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

2026-03-10

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

2026-03-10

Skip to the Good Part: Representation Structure & Inference-Time Layer Skipping in Diffusion vs. Autoregressive LLMs

Skip to the Good Part: Representation Structure & Inference-Time Layer Skipping in Diffusion vs. Autoregressive LLMs

2026-03-10

HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing

HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing

2026-03-10

Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts

Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts

2026-03-10

Agentic Planning with Reasoning for Image Styling via Offline RL

Agentic Planning with Reasoning for Image Styling via Offline RL

2026-03-10

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

2026-03-10

Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation

Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation

2026-03-10

HydroShear: Hydroelastic Shear Simulation for Tactile Sim-to-Real Reinforcement Learning

HydroShear: Hydroelastic Shear Simulation for Tactile Sim-to-Real Reinforcement Learning

2026-03-10

CAST: Modeling Visual State Transitions for Consistent Video Retrieval

CAST: Modeling Visual State Transitions for Consistent Video Retrieval

2026-03-10

SlowBA: An efficiency backdoor attack towards VLM-based GUI agents

SlowBA: An efficiency backdoor attack towards VLM-based GUI agents

2026-03-10

Variational Flow Maps: Make Some Noise for One-Step Conditional Generation

Variational Flow Maps: Make Some Noise for One-Step Conditional Generation

2026-03-10

PresentBench: A Fine-Grained Rubric-Based Benchmark for Slide Generation

PresentBench: A Fine-Grained Rubric-Based Benchmark for Slide Generation

2026-03-10

LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models

LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models

2026-03-10

MedSteer: Counterfactual Endoscopic Synthesis via Training-Free Activation Steering

MedSteer: Counterfactual Endoscopic Synthesis via Training-Free Activation Steering

2026-03-10

Spatiotemporal Heterogeneity of AI-Driven Traffic Flow Patterns and Land Use Interaction: A GeoAI-Based Analysis of Multimodal Urban Mobility

Spatiotemporal Heterogeneity of AI-Driven Traffic Flow Patterns and Land Use Interaction: A GeoAI-Based Analysis of Multimodal Urban Mobility

2026-03-10