CVPR 2024
2025
5
- Real-Time Video Rendering with 4D Gaussian Splatting Depth Anything: How 62 Million Unlabeled Photos Created a New State-of-the-Art Vision Model
- Is GPT-4V a True Expert? A Deep Dive into MMMU, the AI 'College Exam' That Even Top Models Fail RT-DETR: The First End-to-End Detector to Outpace YOLO in Real-Time LLaVA-1.5: How Simple Changes Created a State-of-the-Art Vision-Language Model
1