SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity 2025-03-04 From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens 2025-03-04 Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation 2025-03-04 PodAgent: A Comprehensive Framework for Podcast Generation 2025-03-04 Word Form Matters: LLMs’ Semantic Reconstruction under Typoglycemia 2025-03-04 CodeArena: A Collective Evaluation Platform for LLM Code Generation 2025-03-04 Large-Scale Data Selection for Instruction Tuning 2025-03-04 General Reasoning Requires Learning to Reason from the Get-go 2025-03-04 VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation 2025-03-04 Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator 2025-03-04 Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model 2025-03-04 CLEA: Closed-Loop Embodied Agent for Enhancing Task Execution in Dynamic Environments 2025-03-04 Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis 2025-03-04 AI-Invented Tonal Languages: Preventing a Machine Lingua Franca Beyond Human Understanding 2025-03-04 DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking 2025-03-03 Chain of Draft: Thinking Faster by Writing Less 2025-03-03 Multi-Turn Code Generation Through Single-Step Rewards 2025-03-03 How far can we go with ImageNet for Text-to-Image generation? 2025-03-03 SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers 2025-03-03 ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents 2025-03-03 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160