AI Native Daily Paper Digest – 20250514
1. MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder 🔑 Keywords: autoregressive Transformer, Text-to-Speech (TTS), learnable speaker encoder, zero-shot, Flow-VAE 💡 […]
AI Native Daily Paper Digest – 20250513
1. Seed1.5-VL Technical Report 🔑 Keywords: Vision-Language Foundation Model, Multimodal Understanding, Mixture-of-Experts, State-of-the-Art Performance, GUI Control 💡 Category: Multi-Modal Learning 🌟 Research […]
AI Native Daily Paper Digest – 20250512
1. Bielik v3 Small: Technical Report 🔑 Keywords: parameter-efficient, Polish language processing, generative text models, token efficiency, Adaptive Learning Rate 💡 Category: […]
AI Native Daily Paper Digest – 20250509
1. Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models 🔑 Keywords: Large Multimodal Reasoning Models, Multimodal Reasoning, Cross-modal […]
AI Native Daily Paper Digest – 20250508
1. Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities 🔑 Keywords: multimodal understanding, image generation, autoregressive-based architectures, diffusion-based models, GPT-4o […]
AI Native Daily Paper Digest – 20250507
1. Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning 🔑 Keywords: multimodal Reward Models, CoT reasoning, UnifiedReward-Think, reinforcement fine-tuning 💡 Category: Reinforcement […]
AI Native Daily Paper Digest – 20250506
1. Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play 🔑 Keywords: Voice AI, AI Native, Low-latency conversations, Multilingual speech […]
AI Native Daily Paper Digest – 20250505
1. PixelHacker: Image Inpainting with Structural and Semantic Consistency 🔑 Keywords: Image Inpainting, Latent Categories Guidance, Diffusion-Based Model, PixelHacker, Linear Attention 💡 […]
AI Native Daily Paper Digest – 20250502
1. A Survey of Interactive Generative Video 🔑 Keywords: Interactive Generative Video, generative capabilities, interactive features, control signals, responsive feedback 💡 Category: […]
AI Native Daily Paper Digest – 20250501
1. Sadeed: Advancing Arabic Diacritization Through Small Language Model 🔑 Keywords: Arabic text diacritization, morphological richness, fine-tuned, benchmarking, SadeedDiac-25 💡 Category: Natural […]
AI Native Daily Paper Digest – 20250430
1. Reinforcement Learning for Reasoning in Large Language Models with One Training Example 🔑 Keywords: Reinforcement Learning, Large Language Models, Mathematical Reasoning, […]
AI Native Daily Paper Digest – 20250429
1. CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges 🔑 Keywords: Large Language Models, Cryptographic Reasoning, AI Native 💡 […]