HF Papers

SPRITE: From Static Mockups to Engine-Ready Game UI

SPRITE: From Static Mockups to Engine-Ready Game UI

2026-04-22

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

2026-04-21

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

2026-04-21

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

2026-04-21

OpenGame: Open Agentic Coding for Games

OpenGame: Open Agentic Coding for Games

2026-04-21

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

2026-04-21

EasyVideoR1: Easier RL for Video Understanding

EasyVideoR1: Easier RL for Video Understanding

2026-04-21

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

2026-04-21

When Can LLMs Learn to Reason with Weak Supervision?

When Can LLMs Learn to Reason with Weak Supervision?

2026-04-21

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

2026-04-21

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

2026-04-21

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

2026-04-21

Crowded in B-Space: Calibrating Shared Directions for LoRA Merging

Crowded in B-Space: Calibrating Shared Directions for LoRA Merging

2026-04-21

The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation

The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation

2026-04-21

Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding

Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding

2026-04-21

On the Reliability of Computer Use Agents

On the Reliability of Computer Use Agents

2026-04-21

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

2026-04-21

GenericAgent: A Token-Efficient Self-Evolving LLM Agent via Contextual Information Density Maximization (V1.0)

GenericAgent: A Token-Efficient Self-Evolving LLM Agent via Contextual Information Density Maximization (V1.0)

2026-04-21

VoxMind: An End-to-End Agentic Spoken Dialogue System

VoxMind: An End-to-End Agentic Spoken Dialogue System

2026-04-21

Agents Explore but Agents Ignore: LLMs Lack Environmental Curiosity

Agents Explore but Agents Ignore: LLMs Lack Environmental Curiosity

2026-04-21