AI Native Daily Paper Digest – 20250417
1. ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness 🔑…
AI Native Daily Paper Digest – 20250416
1. xVerify: Efficient Answer Verifier for Reasoning Model Evaluations 🔑 Keywords: reasoning models, complex reasoning, xVerify, equivalence judgment, VAR dataset…
AI Native Daily Paper Digest – 20250415
1. InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models 🔑 Keywords: InternVL3, multimodal pre-training paradigm, MLLM, V2PE,…
AI Native Daily Paper Digest – 20250411
1. Kimi-VL Technical Report 🔑 Keywords: Mixture-of-Experts (MoE), Vision-Language Model (VLM), Multimodal Reasoning, Long Context Understanding, Reinforcement Learning (RL) 💡…
AI Native Daily Paper Digest – 20250408
1. SmolVLM: Redefining small and efficient multimodal models 🔑 Keywords: Vision-Language Models, Resource-Efficient, On-Device Applications, Tokenization, Multimodal Performance 💡 Category:…
AI Native Daily Paper Digest – 20250407
1. Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving 🔑 Keywords: Multi-SWE-bench, Large Language Models, Reinforcement Learning, AGI 💡 Category: Reinforcement…
AI Native Daily Paper Digest – 20250404
1. Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems 🔑 Keywords: Large Language…
AI Native Daily Paper Digest – 20250403
1. MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization 🔑 Keywords: Masked Image…
AI Native Daily Paper Digest – 20250402
1. Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation 🔑 Keywords: Any2Caption, Video Generation, Multimodal Large Language Models, Any2CapIns…