Category: CVPR 2024 | Make Papers Easy

CVPR 2024

2025 10

[CogAgent] An AI That Sees Your Screen Like You Do—And Can Use It For You
[MVBench] Beyond Still Frames: The Benchmark Testing if AI Truly Understands Time in Videos [Deformable 3D Gaussians] Bringing 3D Gaussian Splatting to Life for Real-Time Dynamic Scenes
[LISA] From 'Segment the Car' to 'Segment the Safest Place for a Toddler': LLMs Learn to Reason and See
[Wonder3D] From 2D Snap to 3D Asset in 3 Minutes Diffusion
Real-Time Video Rendering with 4D Gaussian Splatting [Depth Anything] How 62 Million Unlabeled Photos Created a New State-of-the-Art Vision Model
[MMMU] The AI 'College Exam' That Even Top Models Fail [RT-DETR] The First End-to-End Detector to Outpace YOLO in Real-Time [LLaVA-1.5] How Simple Changes Created a State-of-the-Art Vision-Language Model

1