NoTeeline: Supporting Real-Time Notetaking from Keypoints with Large Language Models 2024-09-26 Game4Loc: A UAV Geo-Localization Benchmark from Game Data 2024-09-26 Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors 2024-09-26 HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale 2024-09-26 TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans 2024-09-26 HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models 2024-09-25 MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling 2024-09-25 Making Text Embedders Few-Shot Learners 2024-09-25 OmniBench: Towards The Future of Universal Omni-Language Models 2024-09-25 Present and Future Generalization of Synthetic Image Detectors 2024-09-25 MonoFormer: One Transformer for Both Diffusion and Autoregression 2024-09-25 Seeing Faces in Things: A Model and Dataset for Pareidolia 2024-09-25 EuroLLM: Multilingual Language Models for Europe 2024-09-25 MaskBit: Embedding-free Image Generation via Bit Tokens 2024-09-25 Improvements to SDXL in NovelAI Diffusion V3 2024-09-25 Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation 2024-09-25 Reward-Robust RLHF in LLMs 2024-09-25 DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control 2024-09-25 SLIMER-IT: Zero-Shot NER on Italian Language 2024-09-25 Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts 2024-09-25 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160