Temporal Reasoning Transfer from Text to Video 2024-10-10 CursorCore: Assist Programming through Aligning Anything 2024-10-10 AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs 2024-10-10 ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler 2024-10-10 Diversity-Rewarded CFG Distillation 2024-10-10 F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching 2024-10-10 T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design 2024-10-10 TRACE: Temporal Grounding Video LLM via Causal Event Modeling 2024-10-10 Data Selection via Optimal Control for Language Models 2024-10-10 LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints 2024-10-10 Response Tuning: Aligning Large Language Models without Instruction 2024-10-10 Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning 2024-10-10 Mixed-Session Conversation with Egocentric Memory 2024-10-10 ING-VP: MLLMs cannot Play Easy Vision-based Games Yet 2024-10-10 FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance 2024-10-10 Multimodal Situational Safety 2024-10-10 Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning 2024-10-10 Collective Critics for Creative Story Generation 2024-10-10 Retrieval-Augmented Decision Transformer: External Memory for In-context RL 2024-10-10 BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way 2024-10-10 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28