GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details 2024-11-06 Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge 2024-11-06 Inference Optimal VLMs Need Only One Visual Token but Larger Models 2024-11-06 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation 2024-11-06 AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents 2024-11-05 “Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization 2024-11-05 WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning 2024-11-05 Training-free Regional Prompting for Diffusion Transformers 2024-11-05 Survey of Cultural Awareness in Language Models: Text and Beyond 2024-11-05 Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent 2024-11-05 How Far is Video Generation from World Model: A Physical Law Perspective 2024-11-05 DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models 2024-11-05 MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D 2024-11-05 GenXD: Generating Any 3D and 4D Scenes 2024-11-05 Adaptive Caching for Faster Video Generation with Diffusion Transformers 2024-11-05 DynaSaur: Large Language Agents Beyond Predefined Actions 2024-11-05 Sparsing Law: Towards Large Language Models with Greater Activation Sparsity 2024-11-05 PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance 2024-11-05 LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models 2024-11-05 Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models 2024-11-05 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49