AI Native Daily Paper Digest – 20250402
1. Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation 🔑 Keywords: Any2Caption, Video Generation, Multimodal Large Language Models, Any2CapIns 💡 Category: […]
AI Native Daily Paper Digest – 20250401
1. TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes 🔑 Keywords: Complex Visual Text Generation, TextCrafter, Multi-Visual Text Rendering, Generative Models, […]
AI Native Daily Paper Digest – 20250331
1. AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation 🔑 Keywords: Large Language Models, domain adaptation, vocabulary adaptation, AdaptiVocab, […]
AI Native Daily Paper Digest – 20250328
1. Video-R1: Reinforcing Video Reasoning in MLLMs 🔑 Keywords: Video Reasoning, T-GRPO Algorithm, Multi-Modal Large Language Models, Temporal Modeling, Video-R1 💡 Category: […]
AI Native Daily Paper Digest – 20250327
1. Qwen2.5-Omni Technical Report 🔑 Keywords: Multimodal model, End-to-end, Thinker-Talker architecture, TMRoPE, Streaming 💡 Category: Multi-Modal Learning 🌟 Research Objective: – The […]
AI Native Daily Paper Digest – 20250324
1. When Less is Enough: Adaptive Token Reduction for Efficient Image Representation 🔑 Keywords: Collection, Knowledge Representation, AI Systems 💡 Category: Knowledge […]
AI Native Daily Paper Digest – 20250320
1. φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation 🔑 Keywords: Collection 💡 Category: Knowledge Representation and Reasoning 🌟 Research […]
AI Native Daily Paper Digest – 20250319
1. RWKV-7 “Goose” with Expressive Dynamic State Evolution 🔑 Keywords: Collection 💡 Category: Knowledge Representation and Reasoning 🌟 Research Objective: – To […]
AI Native Daily Paper Digest – 20250317
1. ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Collection 1. Multi-Modal Learning 2. Generative Models 3. Reinforcement Learning 4. Computer Vision […]
AI Native Daily Paper Digest – 20250314
1. CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing 🔑 Keywords: Collection 💡 Category: Foundations of AI 🌟 Research Objective: – Investigate […]
AI Native Daily Paper Digest – 20250313
1. TPDiff: Temporal Pyramid Video Diffusion Model 🔑 Keywords: Collection 💡 Category: Foundations of AI 🌟 Research Objective: – The paper focuses […]
AI Native Daily Paper Digest – 20250312
1. Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Collection 🔑 Keywords: Collection 💡 Category: Foundations of […]