CVPR 2024
2025
10
- [CogAgent] An AI That Sees Your Screen Like You Do—And Can Use It For You
- [MVBench] Beyond Still Frames: The Benchmark Testing if AI Truly Understands Time in Videos [Deformable 3D Gaussians] Bringing 3D Gaussian Splatting to Life for Real-Time Dynamic Scenes
- [LISA] From 'Segment the Car' to 'Segment the Safest Place for a Toddler': LLMs Learn to Reason and See
- [Wonder3D] From 2D Snap to 3D Asset in 3 Minutes Diffusion
- Real-Time Video Rendering with 4D Gaussian Splatting [Depth Anything] How 62 Million Unlabeled Photos Created a New State-of-the-Art Vision Model
- [MMMU] The AI 'College Exam' That Even Top Models Fail [RT-DETR] The First End-to-End Detector to Outpace YOLO in Real-Time [LLaVA-1.5] How Simple Changes Created a State-of-the-Art Vision-Language Model
1