GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding 2025-03-14 New Trends for Modern Machine Translation with Large Reasoning Models 2025-03-14 DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation 2025-03-14 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models 2025-03-14 VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search 2025-03-14 Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond 2025-03-14 Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k 2025-03-14 Long Context Tuning for Video Generation 2025-03-14 Do I look like a `cat.n.01` to you? A Taxonomy Image Generation Benchmark 2025-03-14 Distilling Diversity and Control in Diffusion Models 2025-03-14 R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization 2025-03-14 Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo 2025-03-14 SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation 2025-03-14 CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance 2025-03-14 UniGoal: Towards Universal Zero-shot Goal-oriented Navigation 2025-03-14 Autoregressive Image Generation with Randomized Parallel Decoding 2025-03-14 Quantization for OpenAI’s Whisper Models: A Comparative Analysis 2025-03-14 Discovering Influential Neuron Path in Vision Transformers 2025-03-14 The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation 2025-03-14 ConsisLoRA: Enhancing Content and Style Consistency for LoRA-based Style Transfer 2025-03-14 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160