LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment 2024-12-09 MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale 2024-12-09 EXAONE 3.5: Series of Large Language Models for Real-world Use Cases 2024-12-09 APOLLO: SGD-like Memory, AdamW-level Performance 2024-12-09 SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion 2024-12-09 Moto: Latent Motion Token as the Bridging Language for Robot Manipulation 2024-12-09 GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration 2024-12-09 Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction 2024-12-09 CompCap: Improving Multimodal Large Language Models with Composite Captions 2024-12-09 Mind the Time: Temporally-Controlled Multi-Event Video Generation 2024-12-09 2DGS-Room: Seed-Guided 2D Gaussian Splatting with Geometric Constrains for High-Fidelity Indoor Scene Reconstruction 2024-12-09 PanoDreamer: 3D Panorama Synthesis from a Single Image 2024-12-09 DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling 2024-12-09 BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks 2024-12-09 VisionZip: Longer is Better but Not Necessary in Vision Language Models 2024-12-06 Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection 2024-12-06 Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion 2024-12-06 Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction 2024-12-06 A Noise is Worth Diffusion Guidance 2024-12-06 Evaluating Language Models as Synthetic Data Generators 2024-12-06 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213