AI Native Daily Paper Digest – 20250909
1. Reverse-Engineered Reasoning for Open-Ended Generation 🔑 Keywords: REER, deep reasoning, reverse engineering, gradient-free, DeepWriting-20K 💡 Category: Knowledge Representation and Reasoning 🌟 […]
AI Native Daily Paper Digest – 20250908
1. Why Language Models Hallucinate 🔑 Keywords: Language Models, Hallucinations, Benchmark Scoring, Uncertainty, Trustworthy AI Systems 💡 Category: Natural Language Processing 🌟 […]
AI Native Daily Paper Digest – 20250905
1. Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth 🔑 Keywords: Drivelology, LLMs, NLP, benchmark dataset, pragmatic understanding 💡 Category: Natural Language […]
AI Native Daily Paper Digest – 20250904
1. Open Data Synthesis For Deep Research 🔑 Keywords: AI-generated summary, Deep Research, Hierarchical Constraint Satisfaction Problems, dual-agent system, reasoning trajectories 💡 […]
AI Native Daily Paper Digest – 20250903
1. The Landscape of Agentic Reinforcement Learning for LLMs: A Survey 🔑 Keywords: Agentic reinforcement learning, Large language models, POMDPs, Decision-making agents, […]
AI Native Daily Paper Digest – 20250902
1. PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning 🔑 Keywords: PVPO, computational cost, data pre-sampling, reinforcement learning, State-Of-The-Art (SOTA) 💡 Category: […]
AI Native Daily Paper Digest – 20250901
1. R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning 🔑 Keywords: auto-thinking, multimodal large language models, bi-mode […]
AI Native Daily Paper Digest – 20250829
1. Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning 🔑 Keywords: Reward Hacking, GRPO, Text-to-Image Generation, Preference Fitting, Semantic Consistency […]
AI Native Daily Paper Digest – 20250828
1. Beyond Transcription: Mechanistic Interpretability in ASR 🔑 Keywords: Interpretability methods, ASR, logit lens, semantic biases, repetition hallucinations 💡 Category: Foundations of […]
AI Native Daily Paper Digest – 20250825
1. AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs 🔑 Keywords: Memory-augmented Markov Decision Process (M-MDP), Neural case-selection policy, Episodic memory, Continuous learning […]
AI Native Daily Paper Digest – 20250822
1. Intern-S1: A Scientific Multimodal Foundation Model 🔑 Keywords: Multimodal Mixture-of-Experts, reinforcement learning, scientific domains, open-source models, Artificial General Intelligence 💡 Category: […]
AI Native Daily Paper Digest – 20250821
1. From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models 🔑 Keywords: FinCDM, AI Native, Large Language […]