L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? 2024-10-04 Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos 2024-10-04 MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation 2024-10-04 Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations 2024-10-04 Improving Autonomous AI Agents with Reflective Tree Search and Self-Learning 2024-10-04 Intelligence at the Edge of Chaos 2024-10-04 Learning the Latent Rules of a Game from Data: A Chess Story 2024-10-04 Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data 2024-10-04 Contextual Document Embeddings 2024-10-04 Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models 2024-10-04 SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics 2024-10-04 Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning 2024-10-04 Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models 2024-10-04 RATIONALYST: Pre-training Process-Supervision for Improving Reasoning 2024-10-03 PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation 2024-10-03 LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks 2024-10-03 From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging 2024-10-03 Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis 2024-10-03 Not All LLM Reasoners Are Created Equal 2024-10-03 Quantifying Generalization Complexity for Large Language Models 2024-10-03 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28