AI Native Daily Paper Digest – 20250613
1. ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning 🔑 Keywords: ReasonMed, LLMs, Error Refiner, Chain-of-Thought, Medical Reasoning 💡 Category: […]
AI Native Daily Paper Digest – 20250612
1. Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models 🔑 Keywords: Reinforcement Learning, Self-Confidence, Large Language Models, AI-generated summary, […]
AI Native Daily Paper Digest – 20250611
1. Geopolitical biases in LLMs: what are the “good” and the “bad” countries according to contemporary language models 🔑 Keywords: LLMs, geopolitical […]
AI Native Daily Paper Digest – 20250610
1. Reinforcement Pre-Training 🔑 Keywords: Reinforcement Pre-Training, language model accuracy, scalable method, next-token prediction, reinforcement learning 💡 Category: Reinforcement Learning 🌟 Research […]
AI Native Daily Paper Digest – 20250609
1. Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA 🔑 Keywords: EvergreenQA, LLMs, multilingual QA dataset, […]
AI Native Daily Paper Digest – 20250606
1. SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training 🔑 Keywords: Video Restoration, AI-Generated, Adaptive Window Attention, Feature Matching Loss 💡 Category: […]
AI Native Daily Paper Digest – 20250605
1. MiMo-VL Technical Report 🔑 Keywords: MiMo-VL-7B-SFT, MiMo-VL-7B-RL, Multimodal Reasoning, Mixed On-policy Reinforcement Learning, Chain-of-Thought 💡 Category: Multi-Modal Learning 🌟 Research Objective: […]
AI Native Daily Paper Digest – 20250604
1. UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation 🔑 Keywords: Unified Generative Framework, Visual-Language Models, Image Perception, Image Manipulation […]
AI Native Daily Paper Digest – 20250603
1. Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning 🔑 Keywords: RLVR, token entropy patterns, high-entropy […]
AI Native Daily Paper Digest – 20250602
1. ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models 🔑 Keywords: Prolonged reinforcement learning, Reasoning strategies, Language models, AI-generated […]
AI Native Daily Paper Digest – 20250530
1. Table-R1: Inference-Time Scaling for Table Reasoning 🔑 Keywords: Table Reasoning, Distillation, RLVR, Generalization, LLMs 💡 Category: Knowledge Representation and Reasoning 🌟 […]
AI Native Daily Paper Digest – 20250529
1. The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models 🔑 Keywords: Policy Entropy, Entropy Dynamics, LLMs, Reinforcement Learning, Exploration 💡 […]