Hallucinating AI Hijacking Attack: Large Language Models and Malicious Code Recommenders 2024-10-10 Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach 2024-10-10 Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control 2024-10-10 MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders 2024-10-10 Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA 2024-10-10 TinyEmo: Scaling down Emotional Reasoning via Metric Projection 2024-10-10 TextToon: Real-Time Text Toonify Head Avatar from Single Video 2024-10-10 MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment 2024-10-10 Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis 2024-10-10 VHELM: A Holistic Evaluation of Vision Language Models 2024-10-10 Does Spatial Cognition Emerge in Frontier Models? 2024-10-10 MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering 2024-10-10 VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks 2024-10-10 Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling 2024-10-10 $\textbf{Only-IF}$:Revealing the Decisive Effect of Instruction Diversity on Generalization 2024-10-09 LongGenBench: Long-context Generation Benchmark 2024-10-09 A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation 2024-10-09 RevisEval: Improving LLM-as-a-Judge via Response-Adapted References 2024-10-09 DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search 2024-10-09 Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models 2024-10-09 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28