Baichuan-Omni Technical Report 2024-10-14 Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis 2024-10-14 From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning 2024-10-14 EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models 2024-10-14 StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization 2024-10-14 PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness 2024-10-14 SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights 2024-10-14 Semantic Score Distillation Sampling for Compositional Text-to-3D Generation 2024-10-14 Mechanistic Permutability: Match Features Across Layers 2024-10-14 Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining 2024-10-14 KV Prediction for Improved Time to First Token 2024-10-14 ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion 2024-10-14 DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models 2024-10-14 Think While You Generate: Discrete Diffusion with Planned Denoising 2024-10-14 MiRAGeNews: Multimodal Realistic AI-Generated News Detection 2024-10-14 Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting 2024-10-14 SimpleStrat: Diversifying Language Model Generation with Stratification 2024-10-14 I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow 2024-10-14 Mentor-KD: Making Small Language Models Better Multi-step Reasoners 2024-10-14 GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment 2024-10-14 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28