On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning 2025-05-26 ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection 2025-05-26 Large Language Models Implicitly Learn to See and Hear Just By Reading 2025-05-26 Interactive Post-Training for Vision-Language-Action Models 2025-05-26 DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation 2025-05-26 Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering 2025-05-26 Value-Guided Search for Efficient Chain-of-Thought Reasoning 2025-05-26 Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA 2025-05-26 Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models 2025-05-26 FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS 2025-05-26 NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning 2025-05-26 FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation 2025-05-26 TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios 2025-05-26 Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks 2025-05-26 Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing 2025-05-26 NovelSeek: When Agent Becomes the Scientist — Building Closed-Loop System from Hypothesis to Verification 2025-05-23 Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models 2025-05-23 Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning 2025-05-23 Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning 2025-05-23 KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models 2025-05-23 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213