Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution 2024-09-20 LVCD: Reference-based Lineart Video Colorization with Diffusion Models 2024-09-20 B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests 2024-09-20 Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization 2024-09-20 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion 2024-09-20 StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation 2024-09-20 Language Models Learn to Mislead Humans via RLHF 2024-09-20 FlexiTex: Enhancing Texture Generation with Visual Guidance 2024-09-20 MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions 2024-09-20 Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation 2024-09-20 3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt 2024-09-20 CLAIR-A: Leveraging Large Language Models to Judge Audio Captions 2024-09-20 Qwen2.5-Coder Technical Report 2024-09-19 Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution 2024-09-19 LLMs + Persona-Plug = Personalized LLMs 2024-09-19 To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning 2024-09-19 A Controlled Study on Long Context Extension and Generalization in LLMs 2024-09-19 Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey 2024-09-19 GRIN: GRadient-INformed MoE 2024-09-19 Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models 2024-09-19 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28