MH-MoE:Multi-Head Mixture-of-Experts 2024-11-26 Knowledge Transfer Across Modalities with Natural Language Supervision 2024-11-26 O1 Replication Journey — Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? 2024-11-26 DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation 2024-11-26 One Diffusion to Generate Them All 2024-11-26 GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI 2024-11-26 VisualLens: Personalization through Visual History 2024-11-26 Factorized Visual Tokenization and Generation 2024-11-26 Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry 2024-11-26 Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline 2024-11-26 From CISC to RISC: language-model guided assembly transpilation 2024-11-26 SegBook: A Simple Baseline and Cookbook for Volumetric Medical Image Segmentation 2024-11-26 Cautious Optimizers: Improving Training with One Line of Code 2024-11-26 The Impossible Test: A 2024 Unsolvable Dataset and A Chance for an AGI Quiz 2024-11-26 SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis 2024-11-26 Best of Both Worlds: Advantages of Hybrid Graph Sequence Models 2024-11-26 Predicting Emergent Capabilities by Finetuning 2024-11-26 LLMs Do Not Think Step-by-step In Implicit Reasoning 2024-11-26 Find Any Part in 3D 2024-11-26 Edge Weight Prediction For Category-Agnostic Pose Estimation 2024-11-26 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121