2025
14
- Smarter, Not Louder: How LLMs Cut Multi-Agent Communication by 53% While Boosting Performance Building the Brain of 6G: A Tutorial on Large AI Models and Agentic AI for Intelligent Communications Teaching AI to Think: A Deep Dive into LLaVA-CoT's Step-by-Step Visual Reasoning Real-Time Video Rendering with 4D Gaussian Splatting Depth Anything: How 62 Million Unlabeled Photos Created a New State-of-the-Art Vision Model From 1 to N: How Scaling AI Agents with 'Behavior Narratives' Unlocks Near-Human Performance
- Sharing is Caring: How a 'Swarm' of Language Models Learns Faster by Sharing Experiences Is GPT-4V a True Expert? A Deep Dive into MMMU, the AI 'College Exam' That Even Top Models Fail RT-DETR: The First End-to-End Detector to Outpace YOLO in Real-Time LLaVA-1.5: How Simple Changes Created a State-of-the-Art Vision-Language Model Sending Pictures with (Almost) Zero Bandwidth? A Breakdown of Multi-Modal Semantic Communication with Intelligent Metasurfaces
- When Tokens Talk Too Much: A Guide to Compressing AI Inputs from Images, Videos, and Audio Blurring to Compress Better: A Deep Dive into Google's Scale-Space Flow for Video Compression Stop Tuning Your Losses: How Uncertainty Can Automatically Balance Multi-Task Learning Models