Inference Optimal VLMs Need Only One Visual Token but Larger Models 2024-11-06 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation 2024-11-06 AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents 2024-11-05 “Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization 2024-11-05 WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning 2024-11-05 Training-free Regional Prompting for Diffusion Transformers 2024-11-05 Survey of Cultural Awareness in Language Models: Text and Beyond 2024-11-05 Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent 2024-11-05 How Far is Video Generation from World Model: A Physical Law Perspective 2024-11-05 DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models 2024-11-05 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-05 GenXD: Generating Any 3D and 4D Scenes 2024-11-05 Adaptive Caching for Faster Video Generation with Diffusion Transformers 2024-11-05 DynaSaur: Large Language Agents Beyond Predefined Actions 2024-11-05 Sparsing Law: Towards Large Language Models with Greater Activation Sparsity 2024-11-05 PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance 2024-11-05 LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models 2024-11-05 Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models 2024-11-05 SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF 2024-11-05 AutoVFX: Physically Realistic Video Editing from Natural Language Instructions 2024-11-05 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160