Latest Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding
Shuquan Lian, Yuhang Wu, Jia Ma, et al.
a month ago

DualSG: A Dual-Stream Explicit Semantic-Guided Multivariate Time Series Forecasting Framework
Kuiye Ding, Fanda Fan, Yao Wang, et al.
a month ago

When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token
Compression across Images, Videos, and Audios
Kele Shao, Keda Tao, Kejia Zhang, et al.
a month ago

SmallThinker: A Family of Efficient Large Language Models Natively
Trained for Local Deployment
Yixin Song, Zhenliang Xue, Dongliang Wei, et al.
a month ago

Reconstructing 4D Spatial Intelligence: A Survey
Yukang Cao, Jiahao Lu, Zhisheng Huang, et al.
a month ago

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for
Multi-Task Learning
Zedong Wang, Siyuan Li, Dan Xu
a month ago

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World
Shorts
Yuying Ge, Yixiao Ge, Chen Li, et al.
a month ago

Agentic Reinforced Policy Optimization
Guanting Dong, Hangyu Mao, Kai Ma, et al.
a month ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Huan-ang Gao, Jiayi Geng, Wenyue Hua, et al.
a month ago

SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
Keyan Ding, Jing Yu, Junjie Huang, et al.
a month ago

Specification Self-Correction: Mitigating In-Context Reward Hacking
Through Test-Time Refinement
V\u00edctor Gallego
a month ago

PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving
Maciej K. Wozniak, Lianhang Liu, Yixi Cai, et al.
a month ago

Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI
Jiangkai Wu, Zhiyuan Ren, Liming Liu, et al.
a month ago

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI
Agents
Xuehui Wang, Zhenyu Wu, JingJing Xie, et al.
a month ago

Deep Researcher with Test-Time Diffusion
Rujun Han, Yanfei Chen, Zoey CuiZhu, et al.
a month ago

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
Jiale Chen, Torsten Hoefler, Dan Alistarh
a month ago

MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
Siyi Xun, Yue Sun, Jingkun Chen, et al.
a month ago

OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
Xuetian Chen, Yinghao Chen, Xinfeng Yuan, et al.
a month ago

Hierarchical Budget Policy Optimization for Adaptive Reasoning
Shangke Lyu, Linjuan Wu, Yuchen Yan, et al.
a month ago

Captain Cinema: Towards Short Movie Generation
Junfei Xiao, Ceyuan Yang, Lvmin Zhang, et al.
a month ago

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
Xingyu Wu, Yuchen Yan, Shangke Lyu, et al.
a month ago

MUR: Momentum Uncertainty guided Reasoning for Large Language Models
Hang Yan, Fangzhi Xu, Rongman Xu, et al.
a month ago

NABLA: Neighborhood Adaptive Block-Level Attention
Dmitrii Mikhailov, Aleksey Letunovskiy, Maria Kovaleva, et al.
a month ago

Group Sequence Policy Optimization
Chujie Zheng, Shixuan Liu, Mingze Li, et al.
a month ago

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law
Yicheng Bao, Guanxu Chen, Mingkang Chen, et al.
a month ago

Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory
Mutian Yang, Jiandong Gao, Ji Wu
a month ago

Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny
Chuanhao Yan, Fengdi Che, Xuhan Huang, et al.
a month ago

RAVine: Reality-Aligned Evaluation for Agentic Search
Yilong Xu, Xiang Long, Zhi Zheng, et al.
a month ago

Can One Domain Help Others? A Data-Centric Study on Multi-Domain
Reasoning via Reinforcement Learning
Yu Li, Zhuoshi Pan, Honglin Lin, et al.
a month ago

DesignLab: Designing Slides Through Iterative Detection and Correction
Jooyeol Yun, Heng Wang, Yotaro Shimose, et al.
a month ago