GuardReasoner: Towards Reasoning-based LLM Safeguards 2025-01-31 Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs 2025-01-31 Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch 2025-01-31 MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding 2025-01-31 PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding 2025-01-31 Large Language Models Think Too Fast To Explore Effectively 2025-01-31 WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training 2025-01-31 o3-mini vs DeepSeek-R1: Which One is Safer? 2025-01-31 CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation 2025-01-31 Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate 2025-01-30 Atla Selene Mini: A General Purpose Evaluation Model 2025-01-30 Exploring the sustainable scaling of AI dilemma: A projective study of corporations’ AI environmental impacts 2025-01-30 Early External Safety Testing of OpenAI’s o3-mini: Insights from the Pre-Deployment Evaluation 2025-01-30 Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation 2025-01-30 People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text 2025-01-30 SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training 2025-01-29 Optimizing Large Language Model Training Using FP4 Quantization 2025-01-29 DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation 2025-01-29 Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling 2025-01-29 Open Problems in Mechanistic Interpretability 2025-01-29 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162