AI Native Daily Paper Digest – 20250912
1. VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model 🔑 Keywords: VLA-Adapter, Bridge Attention, lightweight Policy module, state-of-the-art performance, fast inference speed […]
AI Native Daily Paper Digest – 20250911
1. A Survey of Reinforcement Learning for Large Reasoning Models 🔑 Keywords: Reinforcement Learning, Large Language Models, Artificial SuperIntelligence, DeepSeek-R1, Reasoning Abilities […]
AI Native Daily Paper Digest – 20250910
1. Parallel-R1: Towards Parallel Thinking via Reinforcement Learning 🔑 Keywords: Parallel thinking, Reinforcement learning, Large language models, Progressive curriculum, Cold-start problem 💡 […]
AI Native Daily Paper Digest – 20250909
1. Reverse-Engineered Reasoning for Open-Ended Generation 🔑 Keywords: REER, deep reasoning, reverse engineering, gradient-free, DeepWriting-20K 💡 Category: Knowledge Representation and Reasoning 🌟 […]
AI Native Daily Paper Digest – 20250908
1. Why Language Models Hallucinate 🔑 Keywords: Language Models, Hallucinations, Benchmark Scoring, Uncertainty, Trustworthy AI Systems 💡 Category: Natural Language Processing 🌟 […]
AI Native Daily Paper Digest – 20250905
1. Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth 🔑 Keywords: Drivelology, LLMs, NLP, benchmark dataset, pragmatic understanding 💡 Category: Natural Language […]
AI Native Daily Paper Digest – 20250904
1. Open Data Synthesis For Deep Research 🔑 Keywords: AI-generated summary, Deep Research, Hierarchical Constraint Satisfaction Problems, dual-agent system, reasoning trajectories 💡 […]
AI Native Daily Paper Digest – 20250903
1. The Landscape of Agentic Reinforcement Learning for LLMs: A Survey 🔑 Keywords: Agentic reinforcement learning, Large language models, POMDPs, Decision-making agents, […]
AI Native Daily Paper Digest – 20250902
1. PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning 🔑 Keywords: PVPO, computational cost, data pre-sampling, reinforcement learning, State-Of-The-Art (SOTA) 💡 Category: […]
AI Native Daily Paper Digest – 20250901
1. R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning 🔑 Keywords: auto-thinking, multimodal large language models, bi-mode […]
AI Native Daily Paper Digest – 20250829
1. Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning 🔑 Keywords: Reward Hacking, GRPO, Text-to-Image Generation, Preference Fitting, Semantic Consistency […]
AI Native Daily Paper Digest – 20250828
1. Beyond Transcription: Mechanistic Interpretability in ASR 🔑 Keywords: Interpretability methods, ASR, logit lens, semantic biases, repetition hallucinations 💡 Category: Foundations of […]