AI Native Foundation

Explore Baidu’s launch of the Wenxin 4.5 Turbo and X1 Turbo models with an impressive 80% cost reduction, Tencent’s significant upgrade to HunyuanAI 3D featuring a 10x parameter boost and doubled free generation quota, and LiveCC’s advancements in video LLMs through large-scale automatic speech recognition. Discover more in Today’s China AI Native Industry Insights.

1. Baidu Launches Wenxin 4.5 Turbo and X1 Turbo Models with 80% Cost Reduction

🔑 Key Details:
– New Models Released: Baidu launches Wenxin 4.5 Turbo and X1 Turbo with enhanced multimodal capabilities and major price cuts (80% and 50% respectively)
– Ultra-Realistic Digital Human: Supports professional script creation, facial expression control, and interactive livestreaming for e-commerce
– Super Agent App: “Xinxiang” introduced as a universal AI agent capable of autonomous task planning and execution
– MCP Ecosystem Support: China’s first e-commerce search and transaction MCP servers released to support model-content-provider architecture

💡 How It Helps:
– AI Developers: Lower cost (0.8 yuan input / 3.2 yuan output per million tokens) encourages wider experimentation and deployment
– Content Creators: “Cangzhou OS” boosts productivity by combining knowledge bases with content creation and playback tools
– Non-Technical Users: “Miaoda” enables no-code development using simple natural language instructions
– E-commerce Teams: Digital humans act as full-service marketing teams with AI-powered content, performance, and user interaction

🌟 Why It Matters:
Baidu’s price reduction strategy lowers barriers to AI adoption while its focus on practical scenarios—from livestreaming agents to no-code tools—signals a shift from model development to ecosystem growth. With MCP architecture in place, Baidu positions itself as a core infrastructure provider for scalable AI innovation in China.

Original Chinese article: https://mp.weixin.qq.com/s/vsXxuaD0B_yfpnIljDJmtw

English translation via free online service: https://translate.google.com/translate?hl=en&sl=zh-CN&tl=en&u=https%3A%2F%2Fmp.weixin.qq.com%2Fs%2FvsXxuaD0B_yfpnIljDJmtw

Video Credit: The original article

2. Tencent Upgrades HunyuanAI 3D to Ultra-HD with 10x Parameter Boost and Doubled Free Generation Quota

🔑 Key Details:
– Model Size Expansion: Parameters increased from 1B to 10B, with 10x more effective mesh faces.
– Resolution Upgrade: Geometric modeling detail reaches 1024 resolution, transitioning from standard to high-definition quality.
– Texture Enhancements: Now supports 4K textures and bump maps for realistic surface details.
– Free Usage Increase: Daily free generation quota doubled to 20 times per day.

💡 How It Helps:
– Game Developers: Streamlined 3D pipeline with professional templates saves time and resources in production.
– 3D Animators: Optimized skeletal skinning system with auto-rigging dramatically improves animation workflow efficiency.
– Product Designers: Multi-view input capability with PBR modeling creates more photorealistic product visualizations.
– Enterprise Developers: API now officially available on Tencent Cloud for integration into business applications.

🌟 Why It Matters:
This significant upgrade represents a strategic push to democratize 3D content creation while maintaining professional quality. By boosting both accessibility and technical capabilities, Tencent positions HunyuanAI as a versatile solution addressing the growing demand for 3D assets across gaming, e-commerce, social platforms, and industrial applications, further accelerating the adoption of AI-generated 3D content in commercial workflows.

Original Chinese article: https://mp.weixin.qq.com/s/ksqIyJ8EPxqzjcDceOAByQ

English translation via free online service: https://translate.google.com/translate?hl=en&sl=zh-CN&tl=en&u=https%3A%2F%2Fmp.weixin.qq.com%2Fs%2FksqIyJ8EPxqzjcDceOAByQ

Video Credit: The original article

3. LiveCC: Advancing Video LLMs with Automatic Speech Recognition at Scale

🔑 Key Details:
– Cost-Effective Training: LiveCC uses automatic speech recognition (ASR) instead of costly human annotations or proprietary APIs like GPT-4o.
– Novel Streaming Approach: Interleaves ASR words and video frames by timestamps for temporally-aligned vision-language modeling.
– Strong Performance: LiveCC-7B-Instruct outperforms larger 72B models in commentary quality and achieves state-of-the-art results on video QA benchmarks.

💡 How It Helps:
– AI Researchers: Provides a scalable method to train video LLMs without expensive manual annotations.
– Video Content Creators: Enables real-time video commentary capabilities for enhanced user experiences.
– ML Engineers: Offers access to Live-CC-5M and Live-WhisperX-526K datasets for training similar models.

🌟 Why It Matters:
This research represents a significant step toward democratizing video AI by removing dependencies on expensive resources. By proving ASR-based training can produce state-of-the-art results, LiveCC opens new possibilities for real-time video understanding applications while potentially lowering barriers to entry in multimodal AI development.

Original article: https://showlab.github.io/livecc/

Video Credit: The original article

4. Introducing Hailuo Image’s New Feature: Character Reference!

🔑 Key Details:
– Character Adaptation: Hailuo Image’s “Character Reference” turns a single image into dynamic characters with customizable angles, lighting, and emotions.
– Prompt Control: Users can guide output through natural language prompts to generate expressive, story-driven character visuals.
– Visual Consistency: Maintains character identity across poses and compositions, ideal for animation, comics, and visual storytelling.

💡 How It Helps:
– Creators & Artists: Rapidly generate diverse character illustrations from a single reference for storyboards, games, or comics.
– Designers: Customize characters with detailed prompts without needing 3D rigs or manual redraws.
– Developers: Integrate character generation into creative pipelines for avatar creation or animated content.

🌟 Why It Matters:
By enabling prompt-based, reference-consistent character generation, Hailuo Image bridges the gap between static illustration and dynamic storytelling. This marks a new phase in AI-assisted visual creation—more expressive, controllable, and creator-friendly than ever.

Original article: https://x.com/Hailuo_AI/status/1914845649704772043

Video Credit: Hailuo AI (MiniMax) (@Hailuo_AI on X)

That’s all for today’s China AI Native Industry Insights. Join us at AI Native Foundation Membership Dashboard for the latest insights on AI Native, or follow our linkedin account at AI Native Foundation and our twitter account at AINativeF.

China AI Native Industry Insights – 20250425 – Baidu | Tencent | ByteDance | more

1. Baidu Launches Wenxin 4.5 Turbo and X1 Turbo Models with 80% Cost Reduction

2. Tencent Upgrades HunyuanAI 3D to Ultra-HD with 10x Parameter Boost and Doubled Free Generation Quota

3. LiveCC: Advancing Video LLMs with Automatic Speech Recognition at Scale

4. Introducing Hailuo Image’s New Feature: Character Reference!

About

Ecosystem

Insights

Legal