AI Native Daily Paper Digest – 20250516
1. Beyond ‘Aha!’: Toward Systematic Meta-Abilities Alignment in Large Reasoning Models 🔑 Keywords: Large reasoning models, AI Native, Reinforcement Learning, Meta-abilities, Performance […]
AI Native Daily Paper Digest – 20250515
1. BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset 🔑 Keywords: Unifying image understanding, Image generation, Diffusion transformer, […]
AI Native Daily Paper Digest – 20250514
1. MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder 🔑 Keywords: autoregressive Transformer, Text-to-Speech (TTS), learnable speaker encoder, zero-shot, Flow-VAE 💡 […]
AI Native Daily Paper Digest – 20250513
1. Seed1.5-VL Technical Report 🔑 Keywords: Vision-Language Foundation Model, Multimodal Understanding, Mixture-of-Experts, State-of-the-Art Performance, GUI Control 💡 Category: Multi-Modal Learning 🌟 Research […]
AI Native Daily Paper Digest – 20250512
1. Bielik v3 Small: Technical Report 🔑 Keywords: parameter-efficient, Polish language processing, generative text models, token efficiency, Adaptive Learning Rate 💡 Category: […]
AI Native Daily Paper Digest – 20250509
1. Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models 🔑 Keywords: Large Multimodal Reasoning Models, Multimodal Reasoning, Cross-modal […]
AI Native Daily Paper Digest – 20250508
1. Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities 🔑 Keywords: multimodal understanding, image generation, autoregressive-based architectures, diffusion-based models, GPT-4o […]
AI Native Daily Paper Digest – 20250507
1. Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning 🔑 Keywords: multimodal Reward Models, CoT reasoning, UnifiedReward-Think, reinforcement fine-tuning 💡 Category: Reinforcement […]
AI Native Daily Paper Digest – 20250506
1. Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play 🔑 Keywords: Voice AI, AI Native, Low-latency conversations, Multilingual speech […]
AI Native Daily Paper Digest – 20250505
1. PixelHacker: Image Inpainting with Structural and Semantic Consistency 🔑 Keywords: Image Inpainting, Latent Categories Guidance, Diffusion-Based Model, PixelHacker, Linear Attention 💡 […]
AI Native Daily Paper Digest – 20250502
1. A Survey of Interactive Generative Video 🔑 Keywords: Interactive Generative Video, generative capabilities, interactive features, control signals, responsive feedback 💡 Category: […]
AI Native Daily Paper Digest – 20250501
1. Sadeed: Advancing Arabic Diacritization Through Small Language Model 🔑 Keywords: Arabic text diacritization, morphological richness, fine-tuned, benchmarking, SadeedDiac-25 💡 Category: Natural […]