Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space 2025-05-20 Model Merging in Pre-training of Large Language Models 2025-05-20 MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision 2025-05-20 Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation 2025-05-20 FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA 2025-05-20 CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models 2025-05-20 Fractured Chain-of-Thought Reasoning 2025-05-20 ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models 2025-05-20 Neuro-Symbolic Query Compiler 2025-05-20 SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization 2025-05-20 VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning 2025-05-20 Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images 2025-05-20 ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models 2025-05-20 When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research 2025-05-20 Accelerate TarFlow Sampling with GS-Jacobi Iteration 2025-05-20 R3: Robust Rubric-Agnostic Reward Models 2025-05-20 Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation 2025-05-20 FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance 2025-05-20 MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation 2025-05-20 ExTrans: Multilingual Deep Reasoning Translation via Exemplar-Enhanced Reinforcement Learning 2025-05-20 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213