AI Native Daily Paper Digest – 20250617
1. MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention 🔑 Keywords: MiniMax-M1, Mixture-of-Experts, Reinforcement Learning, AI Native, CISPO 💡 Category:…
AI Native Daily Paper Digest – 20250616
1. Effective Red-Teaming of Policy-Adherent Agents 🔑 Keywords: CRAFT, policy-adherent agents, adversarial users, policy-aware persuasive strategies, tau-break 💡 Category: Human-AI…
AI Native Daily Paper Digest – 20250613
1. ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning 🔑 Keywords: ReasonMed, LLMs, Error Refiner, Chain-of-Thought, Medical Reasoning…
AI Native Daily Paper Digest – 20250612
1. Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models 🔑 Keywords: Reinforcement Learning, Self-Confidence, Large Language Models,…
AI Native Daily Paper Digest – 20250611
1. Geopolitical biases in LLMs: what are the “good” and the “bad” countries according to contemporary language models 🔑 Keywords:…
AI Native Daily Paper Digest – 20250610
1. Reinforcement Pre-Training 🔑 Keywords: Reinforcement Pre-Training, language model accuracy, scalable method, next-token prediction, reinforcement learning 💡 Category: Reinforcement Learning…
AI Native Daily Paper Digest – 20250609
1. Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA 🔑 Keywords: EvergreenQA, LLMs, multilingual…
AI Native Daily Paper Digest – 20250606
1. SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training 🔑 Keywords: Video Restoration, AI-Generated, Adaptive Window Attention, Feature Matching Loss…
AI Native Daily Paper Digest – 20250605
1. MiMo-VL Technical Report 🔑 Keywords: MiMo-VL-7B-SFT, MiMo-VL-7B-RL, Multimodal Reasoning, Mixed On-policy Reinforcement Learning, Chain-of-Thought 💡 Category: Multi-Modal Learning 🌟…