On Domain-Specific Post-Training for Multimodal Large Language Models 2024-12-02 Video Depth without Video Models 2024-12-02 Puzzle: Distillation-Based NAS for Inference-Optimized LLMs 2024-12-02 Timestep Embedding Tells: It’s Time to Cache for Video Diffusion Model 2024-12-02 Trajectory Attention for Fine-grained Video Motion Control 2024-12-02 FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion 2024-12-02 DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding 2024-12-02 MATATA: a weak-supervised MAthematical Tool-Assisted reasoning for Tabular Applications 2024-12-02 AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos 2024-12-02 Look Every Frame All at Once: Video-Ma$^2$mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing 2024-12-02 GRAPE: Generalizing Robot Policy via Preference Alignment 2024-12-02 Reverse Thinking Makes LLMs Stronger Reasoners 2024-12-02 LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification 2024-12-02 AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers 2024-12-02 Scaling Transformers for Low-Bitrate High-Quality Speech Coding 2024-12-02 Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling 2024-12-02 Training Noise Token Pruning 2024-12-02 SpotLight: Shadow-Guided Object Relighting via Diffusion 2024-12-02 Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning 2024-11-29 ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting 2024-11-29 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160