AI Native Daily Paper Digest – 20250704
1. WebSailor: Navigating Super-human Reasoning for Web Agent 🔑 Keywords: WebSailor, LLM, proprietary agents, reasoning capabilities, complex information-seeking tasks 💡 Category: Reinforcement […]
AI Native Daily Paper Digest – 20250703
1. Kwai Keye-VL Technical Report 🔑 Keywords: Multimodal Large Language Models, short-video understanding, vision-language alignment 💡 Category: Multi-Modal Learning 🌟 Research Objective: […]
AI Native Daily Paper Digest – 20250702
1. GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning 🔑 Keywords: Vision-Language Model, Reinforcement Learning, Multimodal Reasoning, Curriculum Sampling, General-Purpose 💡 […]
AI Native Daily Paper Digest – 20250701
1. Ovis-U1 Technical Report 🔑 Keywords: Ovis-U1, multimodal understanding, text-to-image generation, image editing, diffusion-based visual decoder 💡 Category: Generative Models 🌟 Research […]
AI Native Daily Paper Digest – 20250630
1. BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing 🔑 Keywords: BlenderFusion, diffusion model, source masking, simulated object jittering, AI-generated summary 💡 Category: […]
AI Native Daily Paper Digest – 20250626
1. ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation 🔑 Keywords: ShareGPT-4o-Image, Janus-4o, text-to-image, photorealistic, dataset 💡 Category: Generative Models 🌟 Research […]
AI Native Daily Paper Digest – 20250625
1. AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models 🔑 Keywords: AnimaX, video diffusion models, skeleton-based animation, multi-view pose […]
AI Native Daily Paper Digest – 20250624
1. Light of Normals: Unified Feature Representation for Universal Photometric Stereo 🔑 Keywords: Photometric Stereo, Surface Normals, Illumination-Surface Normal Coupling, High-Frequency Geometric […]
AI Native Daily Paper Digest – 20250623
1. Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights 🔑 Keywords: Drag-and-Drop LLMs, Parameter-Efficient Fine-Tuning, LoRA, prompt-conditioned parameter generation, cross-domain generalization 💡 Category: Natural Language Processing […]
AI Native Daily Paper Digest – 20250620
1. Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective 🔑 Keywords: Reinforcement Learning, Large Language Model, RL Reasoning, Cross-Domain Training, […]
AI Native Daily Paper Digest – 20250619
1. Sekai: A Video Dataset towards World Exploration 🔑 Keywords: Sekai, worldwide video dataset, rich annotations, interactive video, world exploration 💡 Category: […]
AI Native Daily Paper Digest – 20250618
1. MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation 🔑 Keywords: Multilingual, Multimodal, LLMs, Benchmark, Financial Domain 💡 Category: […]