Self-Harmonized Chain of Thought 2024-09-12 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models 2024-09-12 MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis 2024-09-12 gsplat: An Open-Source Library for Gaussian Splatting 2024-09-12 Can Large Language Models Unlock Novel Scientific Research Ideas? 2024-09-12 ProteinBench: A Holistic Evaluation of Protein Foundation Models 2024-09-12 Generative Hierarchical Materials Search 2024-09-12 Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering 2024-09-12 SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories 2024-09-12 GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering 2024-09-11 LLaMA-Omni: Seamless Speech Interaction with Large Language Models 2024-09-11 INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding 2024-09-11 SongCreator: Lyrics-based Universal Song Generation 2024-09-11 Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis 2024-09-11 SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation 2024-09-11 LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation 2024-09-11 Towards a Unified View of Preference Learning for Large Language Models: A Survey 2024-09-10 MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct 2024-09-10 OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs 2024-09-10 MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery 2024-09-10 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28