NRGBoost: Energy-Based Generative Boosted Trees 2024-10-07 Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning 2024-10-07 MLP-KAN: Unifying Deep Representation and Function Learning 2024-10-07 AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark 2024-10-07 CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs 2024-10-07 GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs 2024-10-07 Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models 2024-10-04 Video Instruction Tuning With Synthetic Data 2024-10-04 Loong: Generating Minute-level Long Videos with Autoregressive Language Models 2024-10-04 LLaVA-Critic: Learning to Evaluate Multimodal Models 2024-10-04 Contrastive Localized Language-Image Pre-Training 2024-10-04 Depth Pro: Sharp Monocular Metric Depth in Less Than a Second 2024-10-04 VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment 2024-10-04 Large Language Models as Markov Chains 2024-10-04 Distilling an End-to-End Voice Assistant Without Instruction Training Data 2024-10-04 Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models 2024-10-04 CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling 2024-10-04 Training Language Models on Synthetic Edit Sequences Improves Code Synthesis 2024-10-04 SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration 2024-10-04 MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis 2024-10-04 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28