From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents 2024-09-06 Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation 2024-09-06 FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation 2024-09-06 Building Math Agents with Multi-Turn Iterative Preference Learning 2024-09-06 Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries 2024-09-06 Statically Contextualizing Large Language Models with Typed Holes 2024-09-06 Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency 2024-09-05 LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture 2024-09-05 LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA 2024-09-05 MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark 2024-09-05 Arctic-SnowCoder: Demystifying High-Quality Data in Code Pretraining 2024-09-05 FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation 2024-09-05 Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for Political Text 2024-09-05 Affordance-based Robot Manipulation with Flow Matching 2024-09-05 Kvasir-VQA: A Text-Image Pair GI Tract Dataset 2024-09-04 OLMoE: Open Mixture-of-Experts Language Models 2024-09-04 LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models 2024-09-04 FLUX that Plays Music 2024-09-04 VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges 2024-09-04 DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos 2024-09-04 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28