MedINST: Meta Dataset of Biomedical Instructions 2024-10-24 M-RewardBench: Evaluating Reward Models in Multilingual Settings 2024-10-24 TP-Eval: Tap Multimodal LLMs’ Potential in Evaluation by Customizing Prompts 2024-10-24 LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias 2024-10-24 PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction 2024-10-23 SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes 2024-10-23 Aligning Large Language Models via Self-Steering Optimization 2024-10-23 JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation 2024-10-23 xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs 2024-10-23 Improve Vision Language Model Chain-of-thought Reasoning 2024-10-23 EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search 2024-10-23 MiniPLM: Knowledge Distillation for Pre-Training Language Models 2024-10-23 Mitigating Object Hallucination via Concentric Causal Attention 2024-10-23 Frontiers in Intelligent Colonoscopy 2024-10-23 Math Neurosurgery: Isolating Language Models’ Math Reasoning Abilities Using Only Forward Passes 2024-10-23 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors 2024-10-23 FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors 2024-10-22 CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution 2024-10-22 SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree 2024-10-22 PUMA: Empowering Unified MLLM with Multi-granular Visual Generation 2024-10-22 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49