LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models 2024-10-15 MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models 2024-10-15 Toward General Instruction-Following Alignment for Retrieval-Augmented Generation 2024-10-15 Animate-X: Universal Character Image Animation with Enhanced Motion Representation 2024-10-15 MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks 2024-10-15 LiveXiv — A Multi-Modal Live Benchmark Based on Arxiv Papers Content 2024-10-15 Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models 2024-10-15 Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention 2024-10-15 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations 2024-10-15 VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents 2024-10-15 TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models 2024-10-15 Rethinking Data Selection at Scale: Random Selection is Almost All You Need 2024-10-15 LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory 2024-10-15 Tree of Problems: Improving structured problem solving with compositionality 2024-10-15 Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies 2024-10-15 TVBench: Redesigning Video-Language Evaluation 2024-10-15 The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling 2024-10-15 Thinking LLMs: General Instruction Following with Thought Generation 2024-10-15 MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models 2024-10-15 ReLU’s Revival: On the Entropic Overload in Normalization-Free Large Language Models 2024-10-15 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28