Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse 2024-10-30 Accelerating Direct Preference Optimization with Prefix Sharing 2024-10-30 Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback 2024-10-30 Task Vectors are Cross-Modal 2024-10-30 Can Language Models Replace Programmers? REPOCOD Says ‘Not Yet’ 2024-10-30 Measuring memorization through probabilistic discoverable extraction 2024-10-30 RARe: Retrieval Augmented Retrieval with In-Context Examples 2024-10-30 Bielik 7B v0.1: A Polish Language Model — Development, Insights, and Evaluation 2024-10-29 GPT-4o System Card 2024-10-29 AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant 2024-10-29 Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction 2024-10-29 DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation 2024-10-29 MarDini: Masked Autoregressive Diffusion for Video Generation at Scale 2024-10-29 LongReward: Improving Long-context Large Language Models with AI Feedback 2024-10-29 A Survey of Small Language Models 2024-10-29 GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation 2024-10-29 COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training 2024-10-29 Fast Best-of-N Decoding via Speculative Rejection 2024-10-29 Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines 2024-10-29 LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior 2024-10-29 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49