Paper at a Glance
Paper Title: Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers
Authors: Wei Pang, Kevin Qinghong Lin, Xiangru Jian, Xi He, Philip Torr
Affiliation: University of Waterloo, National University of Singapore, University of Oxford
Published in: arXiv, 2025
L...
Paper at a Glance
Paper Title: CogAgent: A Visual Language Model for GUI Agents
Authors: Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxiao Dong, Ming Ding, Jie Tang
Affiliation: Tsinghua University, Zhipu AI
Published in: Conference on Compute...
Paper at a Glance
Paper Title: MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Authors: Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao
Affiliation: Shanghai AI Laboratory, Chinese Academy of Sciences, T...
Paper at a Glance
Paper Title: Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction
Authors: Ziyi Yang, Xinyu Gao, Wen Zhou, Shaohui Jiao, Yuqing Zhang, Xiaogang Jin
Affiliation: Zhejiang University, ByteDance Inc.
Published in: Conference on Computer Vision and Patte...
Paper at a Glance
Paper Title: Toward Communication-Efficient Holographic Video Transmission Through Semantic Communication and Edge Intelligence
Authors: Han Hu, Kaifeng Song, Rongfei Fan, Cheng Zhan, Xintao Huan, and Jie Xu
Affiliation: Beijing Institute of Technology, China; Southwest Univers...
Paper at a Glance
Paper Title: LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model
Authors: Senqiao Yang, Tianyuan Qu, Xin Lai, Zhuotao Tian, Bohao Peng, Shu Liu, Jiaya Jia
Affiliation: The Chinese University of Hong Kong, SmartMore
Published in: arXiv, January 2024...
Paper at a Glance
Paper Title: LISA: Reasoning Segmentation via Large Language Model
Authors: Xin Lai, Zhuotao Tian, Yukang Chen, Yanwei Li, Yuhui Yuan, Shu Liu, Jiaya Jia
Affiliation: CUHK, HIT (Shenzhen), SmartMore, MSRA
Published in: Conference on Computer Vision and Pattern Recognition (CVPR...
Paper at a Glance
Paper Title: Less is More: Recursive Reasoning with Tiny Networks
Authors: Alexia Jolicoeur-Martineau
Affiliation: Samsung SAIL Montréal
Published in: arXiv 2025
Link to Paper: https://arxiv.org/abs/2510.04871
The Gist of It: TL;DR
In one sentence: This paper introduces the T...
Paper at a Glance
Paper Title: ExGRPO: Learning to Reason from Experience
Authors: Runzhe Zhan, Yafu Li, Zhi Wang, Xiaoye Qu, Dongrui Liu, Jing Shao, Derek F. Wong, and Yu Cheng
Affiliation: University of Macau, Shanghai AI Laboratory, Nanjing University, The Chinese University of Hong Kong
Publ...
Paper at a Glance
Paper Title: Apriel-1.5-15B-Thinker: Mid-training is all you need
Authors: Shruthan Radhakrishna, Aman Tiwari, Aanjaneya Shukla, Masoud Hashemi, Rishabh Maheshwary, et al.
Affiliation: SLAM Lab, ServiceNow
Published in: arXiv, October 2025
Link to Paper: https://arxiv.org/abs/2...