HF Papers

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

2026-05-05

OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models

OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models

2026-05-05

Hallucinations Undermine Trust; Metacognition is a Way Forward

Hallucinations Undermine Trust; Metacognition is a Way Forward

2026-05-05

AcademiClaw: When Students Set Challenges for AI Agents

AcademiClaw: When Students Set Challenges for AI Agents

2026-05-05

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

2026-05-05

PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments

PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments

2026-05-05

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

2026-05-05

Hierarchical Abstract Tree for Cross-Document Retrieval-Augmented Generation

Hierarchical Abstract Tree for Cross-Document Retrieval-Augmented Generation

2026-05-05

Generative Modeling with Orbit-Space Particle Flow Matching

Generative Modeling with Orbit-Space Particle Flow Matching

2026-05-05

Perceptual Flow Network for Visually Grounded Reasoning

Perceptual Flow Network for Visually Grounded Reasoning

2026-05-05

Linear-Time Global Visual Modeling without Explicit Attention

Linear-Time Global Visual Modeling without Explicit Attention

2026-05-05

Counting as a minimal probe of language model reliability

Counting as a minimal probe of language model reliability

2026-05-05

Agentic AI Systems Should Be Designed as Marginal Token Allocators

Agentic AI Systems Should Be Designed as Marginal Token Allocators

2026-05-05

Code World Model Preparedness Report

Code World Model Preparedness Report

2026-05-05

HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?

HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?

2026-05-05

Motion-Aware Caching for Efficient Autoregressive Video Generation

Motion-Aware Caching for Efficient Autoregressive Video Generation

2026-05-05

BlenderRAG: High-Fidelity 3D Object Generation via Retrieval-Augmented Code Synthesis

BlenderRAG: High-Fidelity 3D Object Generation via Retrieval-Augmented Code Synthesis

2026-05-05

Assessing Pancreatic Ductal Adenocarcinoma Vascular Invasion: the PDACVI Benchmark

Assessing Pancreatic Ductal Adenocarcinoma Vascular Invasion: the PDACVI Benchmark

2026-05-05

Prior-Aligned Data Cleaning for Tabular Foundation Models

Prior-Aligned Data Cleaning for Tabular Foundation Models

2026-05-05

A Hybrid Approach for Closing the Sim2real Appearance Gap in Game Engine Synthetic Datasets

A Hybrid Approach for Closing the Sim2real Appearance Gap in Game Engine Synthetic Datasets

2026-05-05