Spinning the Golden Thread: Benchmarking Long-Form Generation in Language Models 2024-09-09 GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers 2024-09-09 Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing 2024-09-06 Attention Heads of Large Language Models: A Survey 2024-09-06 FuzzCoder: Byte-level Fuzzing Test via Large Language Model 2024-09-06 CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation 2024-09-06 mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding 2024-09-06 WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild 2024-09-06 From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents 2024-09-06 Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation 2024-09-06 FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation 2024-09-06 Building Math Agents with Multi-Turn Iterative Preference Learning 2024-09-06 Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries 2024-09-06 Statically Contextualizing Large Language Models with Typed Holes 2024-09-06 Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency 2024-09-05 LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture 2024-09-05 LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA 2024-09-05 MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark 2024-09-05 Arctic-SnowCoder: Demystifying High-Quality Data in Code Pretraining 2024-09-05 FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation 2024-09-05 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49