ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks 2025-03-11 DiffCLIP: Differential Attention Meets CLIP 2025-03-11 Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations 2025-03-11 TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models 2025-03-11 A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning 2025-03-11 Adaptive Audio-Visual Speech Recognition via Matryoshka-Based Multimodal LLMs 2025-03-11 Novel Object 6D Pose Estimation with a Single Reference View 2025-03-11 Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries 2025-03-11 HumanMM: Global Human Motion Recovery from Multi-shot Videos 2025-03-11 RePO: ReLU-based Preference Optimization 2025-03-11 Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning 2025-03-11 Escaping Plato’s Cave: Towards the Alignment of 3D and Text Latent Spaces 2025-03-11 PhiloBERTA: A Transformer-Based Cross-Lingual Analysis of Greek and Latin Lexicons 2025-03-11 Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts 2025-03-11 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding 2025-03-11 What’s in a Latent? Leveraging Diffusion Latent Space for Domain Generalization 2025-03-11 NeuGrasp: Generalizable Neural Surface Reconstruction with Background Priors for Material-Agnostic Object Grasp Detection 2025-03-11 RuCCoD: Towards Automated ICD Coding in Russian 2025-03-10 Unified Reward Model for Multimodal Understanding and Generation 2025-03-10 EuroBERT: Scaling Multilingual Encoders for European Languages 2025-03-10 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160