AI Native Daily Paper Digest – 20241231
1. Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization π Keywords: Computer Vision, zero-shot task generalization, Explanatory Instructions, vision-language model […]
AI Native Daily Paper Digest – 20241230
1. HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs π Keywords: OpenAI o1, Reasoning, Medical Domain, Reinforcement Learning, HuatuoGPT-o1 π‘ Category: AI in […]
AI Native Daily Paper Digest – 20241226
1. Token-Budget-Aware LLM Reasoning π Keywords: LLMs, Chain-of-Thought, Reasoning, Token Budget, Efficiency π‘ Category: Natural Language Processing π Research Objective: – The […]
AI Native Daily Paper Digest – 20241225
1. 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding π Keywords: 3D scene graph, Large Language Models, semantic […]
AI Native Daily Paper Digest – 20241224
1. RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response π Keywords: Supervised Fine-Tuning, Noise-Robust Framework, Large Language Models, RobustFT, […]
AI Native Daily Paper Digest – 20241223
1. Parallelized Autoregressive Visual Generation π Keywords: Autoregressive models, Parallel generation, Visual generation, Inference speed π‘ Category: Generative Models π Research Objective: […]
AI Native Daily Paper Digest – 20241220
1. Qwen2.5 Technical Report π Keywords: Qwen2.5, Large Language Models, Reinforcement Learning, Human Preference Alignment π‘ Category: Natural Language Processing π Research […]
AI Native Daily Paper Digest – 20241219
1. No More Adam: Learning Rate Scaling at Initialization is All You Need π Keywords: Adaptive Gradient Methods, SGD-SaI, Transformer, ImageNet-1K, GPT-2 […]
AI Native Daily Paper Digest – 20241218
1. Are Your LLMs Capable of Stable Reasoning? π Keywords: Large Language Models, G-Pass@k, LiveMathBench, Evaluation Metrics π‘ Category: Natural Language Processing […]
AI Native Daily Paper Digest – 20241217
1. Byte Latent Transformer: Patches Scale Better Than Tokens π Keywords: Byte Latent Transformer, LLM architecture, inference efficiency, scaling, raw bytes π‘ […]
AI Native Daily Paper Digest – 20241216
1. Apollo: An Exploration of Video Understanding in Large Multimodal Models π Keywords: Large Multimodal Models, video understanding, Apollo, Scaling Consistency, video-LMMs […]
AI Native Daily Paper Digest – 20241213
1. InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions π Keywords: Specialized Generalist AI, Multimodal Large Language Models, […]