Evaluating Multiview Object Consistency in Humans and Image Models 2024-09-10 How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data 2024-09-09 Configurable Foundation Models: Building LLMs from a Modular Perspective 2024-09-09 Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation 2024-09-09 Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task 2024-09-09 Spinning the Golden Thread: Benchmarking Long-Form Generation in Language Models 2024-09-09 GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers 2024-09-09 Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing 2024-09-06 Attention Heads of Large Language Models: A Survey 2024-09-06 FuzzCoder: Byte-level Fuzzing Test via Large Language Model 2024-09-06 CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation 2024-09-06 mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding 2024-09-06 WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild 2024-09-06 From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents 2024-09-06 Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation 2024-09-06 FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation 2024-09-06 Building Math Agents with Multi-Turn Iterative Preference Learning 2024-09-06 Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries 2024-09-06 Statically Contextualizing Large Language Models with Typed Holes 2024-09-06 Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency 2024-09-05 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49