AI Native Daily Paper Digest – 20250103
1. 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining 🔑 Keywords: Vision-Language Models, Multimodal Textbook, Instructional Videos, Image-Text Alignment 💡 […]
AI Native Daily Paper Digest – 20250102
1. OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis 🔑 Keywords: Vision-Language Models, GUI agents, OS-Genesis, GUI data synthesis 💡 […]
AI Native Daily Paper Digest – 20241231
1. Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization 🔑 Keywords: Computer Vision, zero-shot task generalization, Explanatory Instructions, vision-language model […]
AI Native Daily Paper Digest – 20241230
1. HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs 🔑 Keywords: OpenAI o1, Reasoning, Medical Domain, Reinforcement Learning, HuatuoGPT-o1 💡 Category: AI in […]
AI Native Daily Paper Digest – 20241226
1. Token-Budget-Aware LLM Reasoning 🔑 Keywords: LLMs, Chain-of-Thought, Reasoning, Token Budget, Efficiency 💡 Category: Natural Language Processing 🌟 Research Objective: – The […]
AI Native Daily Paper Digest – 20241225
1. 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding 🔑 Keywords: 3D scene graph, Large Language Models, semantic […]
AI Native Daily Paper Digest – 20241224
1. RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response 🔑 Keywords: Supervised Fine-Tuning, Noise-Robust Framework, Large Language Models, RobustFT, […]
AI Native Daily Paper Digest – 20241223
1. Parallelized Autoregressive Visual Generation 🔑 Keywords: Autoregressive models, Parallel generation, Visual generation, Inference speed 💡 Category: Generative Models 🌟 Research Objective: […]
AI Native Daily Paper Digest – 20241220
1. Qwen2.5 Technical Report 🔑 Keywords: Qwen2.5, Large Language Models, Reinforcement Learning, Human Preference Alignment 💡 Category: Natural Language Processing 🌟 Research […]
AI Native Daily Paper Digest – 20241219
1. No More Adam: Learning Rate Scaling at Initialization is All You Need 🔑 Keywords: Adaptive Gradient Methods, SGD-SaI, Transformer, ImageNet-1K, GPT-2 […]
AI Native Daily Paper Digest – 20241218
1. Are Your LLMs Capable of Stable Reasoning? 🔑 Keywords: Large Language Models, G-Pass@k, LiveMathBench, Evaluation Metrics 💡 Category: Natural Language Processing […]
AI Native Daily Paper Digest – 20241217
1. Byte Latent Transformer: Patches Scale Better Than Tokens 🔑 Keywords: Byte Latent Transformer, LLM architecture, inference efficiency, scaling, raw bytes 💡 […]