AI Native Daily Paper Digest – 20251015
1. Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model 🔑 Keywords: Vision-language-action models, Spatial Forcing, 3D foundation models, Robotic tasks 💡 […]
AI Native Daily Paper Digest – 20251014
1. QeRL: Beyond Efficiency — Quantization-enhanced Reinforcement Learning for LLMs 🔑 Keywords: QeRL, Quantization, Reinforcement Learning, Large Language Models, Exploration 💡 Category: […]
AI Native Daily Paper Digest – 20251013
1. D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI 🔑 Keywords: Embodied AI, Desktop environments, Sensorimotor interactions, OWA […]
AI Native Daily Paper Digest – 20251010
1. Agent Learning via Early Experience 🔑 Keywords: early experience, reinforcement learning, self-reflection, out-of-domain generalization, AI-generated 💡 Category: Reinforcement Learning 🌟 Research […]
AI Native Daily Paper Digest – 20251009
1. Cache-to-Cache: Direct Semantic Communication Between Large Language Models 🔑 Keywords: Cache-to-Cache, Large Language Models, neural network, direct semantic communication, latency 💡 […]
AI Native Daily Paper Digest – 20251003
1. LongCodeZip: Compress Long Context for Code Language Models 🔑 Keywords: LongCodeZip, Large Language Models, code compression, context pruning 💡 Category: AI […]
AI Native Daily Paper Digest – 20251002
1. DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search 🔑 Keywords: RLVR, DeepSearch, Monte Carlo […]
AI Native Daily Paper Digest – 20251001
1. MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use 🔑 Keywords: MCPMark, LLMs, MCP, AI Agents, Benchmarking 💡 Category: AI […]
AI Native Daily Paper Digest – 20250929
1. LongLive: Real-time Interactive Long Video Generation 🔑 Keywords: Long video generation, Causal attention, KV-recache, Interactive capabilities, INT8-quantized inference 💡 Category: Generative […]
AI Native Daily Paper Digest – 20250925
1. Video models are zero-shot learners and reasoners 🔑 Keywords: Veo 3, Zero-shot capabilities, Generative models, Unified vision foundation models 💡 Category: […]
AI Native Daily Paper Digest – 20250924
1. Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR 🔑 Keywords: Arabic document OCR, vision-language model, decoder-only fine-tuning, WER, domain-specific adaptation 💡 […]
AI Native Daily Paper Digest – 20250923
1. LIMI: Less is More for Agency 🔑 Keywords: AI Native, autonomous agents, agentic intelligence, strategic curation, Agency Efficiency Principle 💡 Category: […]