AI Native Daily Paper Digest – 20250203
1. s1: Simple test-time scaling π Keywords: Test-time scaling, Language modeling, OpenAI, Reasoning performance, Budget forcing π‘ Category: Natural Language Processing π […]
AI Native Daily Paper Digest – 20250131
1. GuardReasoner: Towards Reasoning-based LLM Safeguards π Keywords: LLMs, GuardReasoner, reasoning, guard models, safety-critical applications π‘ Category: Knowledge Representation and Reasoning π […]
AI Native Daily Paper Digest – 20250130
1. Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate π Keywords: Supervised Fine-Tuning (SFT), Critique Fine-Tuning (CFT), GPT-4o, […]
AI Native Daily Paper Digest – 20250129
1. SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training π Keywords: supervised fine-tuning, reinforcement learning, model generalization, text-based rule […]
AI Native Daily Paper Digest – 20250128
1. Baichuan-Omni-1.5 Technical Report π Keywords: Baichuan-Omni-1.5, omni-modal, audio generation, MLLM π‘ Category: Multi-Modal Learning π Research Objective: – To introduce Baichuan-Omni-1.5, […]
AI Native Daily Paper Digest – 20250127
1. Humanity’s Last Exam π Keywords: Benchmarks, Large Language Models, LLM Capabilities, Humanity’s Last Exam, Multi-Modal π‘ Category: Natural Language Processing π […]
AI Native Daily Paper Digest – 20250124
1. SRMT: Shared Memory for Multi-agent Lifelong Pathfinding π Keywords: Multi-agent reinforcement learning, Cooperative multi-agent problems, Shared Recurrent Memory Transformer, Coordination, Decentralized […]
AI Native Daily Paper Digest – 20250123
1. DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning π Keywords: DeepSeek-R1-Zero, DeepSeek-R1, Reinforcement Learning, Reasoning Models, Open-source π‘ Category: Reinforcement […]
AI Native Daily Paper Digest – 20250122
1. Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training π Keywords: Large Language Models, Intelligent Agent, Self-Training Framework, Error Correction, […]
AI Native Daily Paper Digest – 20250121
1. GameFactory: Creating New Games with Generative Interactive Videos π Keywords: Generative game engines, scene generalization, video diffusion models, action-controllable π‘ Category: […]
AI Native Daily Paper Digest – 20250120
1. Evolving Deeper LLM Thinking π Keywords: Mind Evolution, Inference time compute, Language model, Natural language planning π‘ Category: Natural Language Processing […]
AI Native Daily Paper Digest – 20250117
1. Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps π Keywords: Generative Models, Large Language Models, Diffusion Models, Inference-time Scaling π‘ […]