Differential Transformer 2024-10-08 LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations 2024-10-08 VideoGuide: Improving Video Diffusion Models without Training Through a Teacher’s Guide 2024-10-08 FAN: Fourier Analysis Networks 2024-10-08 Named Clinical Entity Recognition Benchmark 2024-10-08 ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery 2024-10-08 Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents 2024-10-08 TLDR: Token-Level Detective Reward Model for Large Vision Language Models 2024-10-08 Presto! Distilling Steps and Layers for Accelerating Music Generation 2024-10-08 GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models 2024-10-08 UniMuMo: Unified Text, Music and Motion Generation 2024-10-08 MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion 2024-10-08 MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs 2024-10-08 OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction 2024-10-08 LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning 2024-10-08 TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles 2024-10-08 SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification 2024-10-08 Autonomous Character-Scene Interaction Synthesis from Text Instruction 2024-10-08 What Matters for Model Merging at Scale? 2024-10-08 SePPO: Semi-Policy Preference Optimization for Diffusion Alignment 2024-10-08 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49