MotiF: Making Text Count in Image Animation with Motion Focal Loss 2024-12-25 Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning 2024-12-25 RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response 2024-12-24 B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners 2024-12-24 Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching 2024-12-24 Diving into Self-Evolving Training for Multimodal Reasoning 2024-12-24 Large Motion Video Autoencoding with Cross-modal Video VAE 2024-12-24 Deliberation in Latent Space via Differentiable Cache Augmentation 2024-12-24 OpenAI o1 System Card 2024-12-24 Outcome-Refining Process Supervision for Code Generation 2024-12-24 Revisiting In-Context Learning with Long Context Language Models 2024-12-24 LearnLM: Improving Gemini for Learning 2024-12-24 DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought 2024-12-24 ResearchTown: Simulator of Human Research Community 2024-12-24 PC Agent: While You Sleep, AI Works — A Cognitive Journey into Digital World 2024-12-24 NILE: Internal Consistency Alignment in Large Language Models 2024-12-24 Agent-SafetyBench: Evaluating the Safety of LLM Agents 2024-12-24 Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding 2024-12-24 OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning 2024-12-24 Parallelized Autoregressive Visual Generation 2024-12-23 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121