SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding 2024-12-16 BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities 2024-12-16 Large Action Models: From Inception to Implementation 2024-12-16 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 2024-12-16 FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion 2024-12-16 ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation 2024-12-16 FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing 2024-12-16 FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers 2024-12-16 Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation 2024-12-16 LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity 2024-12-16 SCBench: A KV Cache-Centric Analysis of Long-Context Methods 2024-12-16 TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies 2024-12-16 SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs 2024-12-16 GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers 2024-12-16 Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images 2024-12-16 InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions 2024-12-13 Phi-4 Technical Report 2024-12-13 Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions 2024-12-13 Multimodal Latent Language Modeling with Next-Token Diffusion 2024-12-13 EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM 2024-12-13 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213