Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing 2024-09-26 DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion 2024-09-26 NoTeeline: Supporting Real-Time Notetaking from Keypoints with Large Language Models 2024-09-26 Game4Loc: A UAV Geo-Localization Benchmark from Game Data 2024-09-26 Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors 2024-09-26 HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale 2024-09-26 TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans 2024-09-26 HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models 2024-09-25 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling 2024-09-25 Making Text Embedders Few-Shot Learners 2024-09-25 OmniBench: Towards The Future of Universal Omni-Language Models 2024-09-25 Present and Future Generalization of Synthetic Image Detectors 2024-09-25 MonoFormer: One Transformer for Both Diffusion and Autoregression 2024-09-25 Seeing Faces in Things: A Model and Dataset for Pareidolia 2024-09-25 EuroLLM: Multilingual Language Models for Europe 2024-09-25 MaskBit: Embedding-free Image Generation via Bit Tokens 2024-09-25 Improvements to SDXL in NovelAI Diffusion V3 2024-09-25 Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation 2024-09-25 Reward-Robust RLHF in LLMs 2024-09-25 DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control 2024-09-25 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49