OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction 2024-10-08 LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning 2024-10-08 TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles 2024-10-08 SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification 2024-10-08 Autonomous Character-Scene Interaction Synthesis from Text Instruction 2024-10-08 What Matters for Model Merging at Scale? 2024-10-08 SePPO: Semi-Policy Preference Optimization for Diffusion Alignment 2024-10-08 Grounding Language in Multi-Perspective Referential Communication 2024-10-08 Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach 2024-10-08 SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation 2024-10-08 Addition is All You Need for Energy-efficient Language Models 2024-10-07 NL-Eye: Abductive NLI for Images 2024-10-07 Selective Attention Improves Transformer 2024-10-07 Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise 2024-10-07 Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding 2024-10-07 RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models 2024-10-07 A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond 2024-10-07 Erasing Conceptual Knowledge from Language Models 2024-10-07 MIGA: Mixture-of-Experts with Group Aggregation for Stock Market Prediction 2024-10-07 CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction 2024-10-07 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28