Diffusion Policy Policy Optimization 2024-09-04 Compositional 3D-aware Video Generation with LLM Director 2024-09-04 LinFusion: 1 GPU, 1 Minute, 16K Image 2024-09-04 ContextCite: Attributing Model Generation to Context 2024-09-04 Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization 2024-09-04 OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model 2024-09-04 GenAgent: Build Collaborative AI Systems with Automated Workflow Generation — Case Studies on ComfyUI 2024-09-04 Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation 2024-09-04 Density Adaptive Attention-based Speech Network: Enhancing Feature Understanding for Mental Health Disorders 2024-09-04 PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action 2024-09-04 Know When to Fuse: Investigating Non-English Hybrid Retrieval in the Legal Domain 2024-09-04 The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts 2024-09-04 VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters 2024-09-03 Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming 2024-09-03 SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding 2024-09-02 UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios 2024-09-02 CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization 2024-09-02 CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis 2024-09-02 VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers 2024-09-02 InkubaLM: A small language model for low-resource African languages 2024-09-02 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28