An adapted large language model facilitates multiple medical tasks in diabetes care 2024-09-24 MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting 2024-09-24 SpaceBlender: Creating Context-Rich Collaborative Spaces Through Generative 3D Scene Blending 2024-09-24 A Case Study of Web App Coding with OpenAI Reasoning Models 2024-09-24 Self-Supervised Audio-Visual Soundscape Stylization 2024-09-24 Imagine yourself: Tuning-Free Personalized Image Generation 2024-09-23 YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models 2024-09-23 Prithvi WxC: Foundation Model for Weather and Climate 2024-09-23 MuCodec: Ultra Low-Bitrate Music Codec 2024-09-23 Colorful Diffuse Intrinsic Image Decomposition in the Wild 2024-09-23 Portrait Video Editing Empowered by Multimodal Generative Priors 2024-09-23 V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians 2024-09-23 Temporally Aligned Audio for Video with Autoregression 2024-09-23 Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation 2024-09-23 Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments 2024-09-23 Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts 2024-09-23 LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents 2024-09-23 Training Language Models to Self-Correct via Reinforcement Learning 2024-09-20 InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning 2024-09-20 MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines 2024-09-20 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28