AI Native Daily Paper Digest – 20250117
1. Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps π Keywords: Generative Models, Large Language Models, Diffusion Models, Inference-time Scaling π‘ […]
AI Native Daily Paper Digest – 20250116
1. Towards Best Practices for Open Datasets for LLM Training π Keywords: Copyright, Legal Challenges, AI Models, Transparency, Open Access π‘ Category: […]
AI Native Daily Paper Digest – 20250115
1. MiniMax-01: Scaling Foundation Models with Lightning Attention π Keywords: MiniMax-01, Lightning Attention, Mixture of Experts, Vision-Language Model π‘ Category: Generative Models […]
AI Native Daily Paper Digest – 20250114
1. The Lessons of Developing Process Reward Models in Mathematical Reasoning π Keywords: Process Reward Models, Large Language Models, Monte Carlo estimation, […]
AI Native Daily Paper Digest – 20250113
1. VideoRAG: Retrieval-Augmented Generation over Video Corpus π Keywords: Retrieval-Augmented Generation, Multi-Modal Learning, VideoRAG, Large Video Language Models, Video Integration π‘ Category: […]
AI Native Daily Paper Digest – 20250110
1. The GAN is dead; long live the GAN! A Modern GAN Baseline π Keywords: GAN Loss, Regularization, AI Native, Modernization, R3GAN […]
AI Native Daily Paper Digest – 20250109
1. rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking π Keywords: Small Language Models, Math Reasoning, Monte Carlo Tree […]
AI Native Daily Paper Digest – 20250108
1. REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models π Keywords: Reinforcement Learning, Human Feedback, REINFORCE++, Proximal Policy Optimization, […]
AI Native Daily Paper Digest – 20250107
1. STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution π Keywords: Video Super-Resolution, Temporal Consistency, T2V Models, GAN, Artifacts π‘ […]
AI Native Daily Paper Digest – 20250106
1. EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation π Keywords: EnerVerse, robotic manipulation, Free Anchor View, 4D Gaussian Splatting, sim-to-real gap […]
AI Native Daily Paper Digest – 20250103
1. 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining π Keywords: Vision-Language Models, Multimodal Textbook, Instructional Videos, Image-Text Alignment π‘ […]
AI Native Daily Paper Digest – 20250102
1. OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis π Keywords: Vision-Language Models, GUI agents, OS-Genesis, GUI data synthesis π‘ […]