Training Large Language Models to Reason in a Continuous Latent Space 2024-12-10 Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation 2024-12-10 Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation 2024-12-10 Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models 2024-12-10 You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale 2024-12-10 Robust Multi-bit Text Watermark with LLM-based Paraphrasers 2024-12-10 Global and Dense Embeddings of Earth: Major TOM Floating in the Latent Space 2024-12-10 MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views 2024-12-10 CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction 2024-12-10 Gated Delta Networks: Improving Mamba2 with Delta Rule 2024-12-10 Turbo3D: Ultra-fast Text-to-3D Generation 2024-12-10 If You Can’t Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs 2024-12-10 MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance 2024-12-10 Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling 2024-12-09 LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment 2024-12-09 MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale 2024-12-09 EXAONE 3.5: Series of Large Language Models for Real-world Use Cases 2024-12-09 APOLLO: SGD-like Memory, AdamW-level Performance 2024-12-09 SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion 2024-12-09 Moto: Latent Motion Token as the Bridging Language for Robot Manipulation 2024-12-09 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160