Latest Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers
Jingze Zhu, Yongliang Wu, Wenbo Zhu, et al.
a month ago

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive
Token-Level Computation
Sangmin Bae, Yujin Kim, Reza Bayat, et al.
a month ago

REST: Stress Testing Large Reasoning Models by Asking Multiple Problems
at Once
Zhuoshi Pan, Qizhi Pei, Yu Li, et al.
a month ago

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments
Mingxian Lin, Wei Huang, Yitang Li, et al.
a month ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning
Due to Data Contamination
Mingqi Wu, Zhihao Zhang, Qiaole Dong, et al.
a month ago

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual
Dyadic Interactive Human Generation
Youliang Zhang, Zhaoyang Li, Duomin Wang, et al.
a month ago

VerifyBench: A Systematic Benchmark for Evaluating Reasoning Verifiers Across Domains
Xuzhao Li, Xuchen Li, Shiyu Hu, et al.
a month ago

Sidechain conditioning and modeling for full-atom protein sequence design with FAMPNN
Talal Widatalla, Richard W. Shuai, Brian Hie, et al.
a month ago

One Token to Fool LLM-as-a-Judge
Yulai Zhao, Haolin Liu, Dian Yu, et al.
a month ago

From One to More: Contextual Part Latents for 3D Generation
Shaocong Dong, Lihe Ding, Xiao Chen, et al.
a month ago

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for
Visual Reasoning
Yana Wei, Liang Zhao, Jianjian Sun, et al.
a month ago

Lumos-1: On Autoregressive Video Generation from a Unified Model
Perspective
Hangjie Yuan, Weihua Chen, Jun Cen, et al.
a month ago

Neural-Driven Image Editing
Pengfei Zhou, Jie Xia, Xiaopeng Peng, et al.
a month ago

KV Cache Steering for Inducing Reasoning in Small Language Models
Max Belitsky, Dawid J. Kopiczko, Michael Dorkenwald, et al.
a month ago

NeuralOS: Towards Simulating Operating Systems via Neural Generative
Models
Luke Rivard, Sun Sun, Hongyu Guo, et al.
a month ago

CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive
Neural Rendering
Zhengqing Wang, Yuefan Wu, Jiacheng Chen, et al.
a month ago

Test-Time Scaling with Reflective Generative Model
Zixiao Wang, Yuxin Wang, Xiaorui Wang, et al.
a month ago

System-of-systems Modeling and Optimization: An Integrated Framework for Intermodal Mobility
Paul Saves, Jasper Bussemaker, R\u00e9mi Lafage, et al.
a month ago

All-atom Diffusion Transformers: Unified generative modelling of molecules and materials
Chaitanya K. Joshi, Xiang Fu, Yi-Lun Liao, et al.
a month ago

OST-Bench: Evaluating the Capabilities of MLLMs in Online
Spatio-temporal Scene Understanding
JingLi Lin, Chenming Zhu, Runsen Xu, et al.
a month ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and
Methodology
Haochen Wang, Xiangtai Li, Zilong Huang, et al.
a month ago

MIRIX: Multi-Agent Memory System for LLM-Based Agents
Yu Wang, Xi Chen
a month ago

Skywork-R1V3 Technical Report
Wei Shen, Jiangbo Pei, Yi Peng, et al.
a month ago

T-LoRA: Single Image Diffusion Model Customization Without Overfitting
Vera Soboleva, Aibek Alanov, Andrey Kuznetsov, et al.
a month ago

Scaling RL to Long Videos
Yukang Chen, Wei Huang, Baifeng Shi, et al.
a month ago

Critiques of World Models
Eric Xing, Mingkai Deng, Jinyu Hou, et al.
a month ago

Is Diversity All You Need for Scalable Robotic Manipulation?
Modi Shi, Li Chen, Jin Chen, et al.
a month ago

Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts
Guokan Shang, Hadi Abdine, Ahmad Chamma, et al.
a month ago

GTA1: GUI Test-time Scaling Agent
Yan Yang, Dongxu Li, Yutong Dai, et al.
a month ago

MedGen: Unlocking Medical Video Generation by Scaling
Granularly-annotated Medical Videos
Rongsheng Wang, Junying Chen, Ke Ji, et al.
a month ago