Latest Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

DesignLab: Designing Slides Through Iterative Detection and Correction
Jooyeol Yun, Heng Wang, Yotaro Shimose, et al.
a month ago

Yume: An Interactive World Generation Model
Xiaofeng Mao, Shaoheng Lin, Zhen Li, et al.
a month ago

Pixels, Patterns, but No Poetry: To See The World like Humans
Hongcheng Gao, Zihao Huang, Lin Xu, et al.
a month ago

Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning
Xinyao Liu, Diping Song
a month ago

HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study
Mandar Pitale, Jelena Frtunikj, Abhinaw Priyadershi, et al.
a month ago

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Ang Li, Charles Wang, Kaiyu Yue, et al.
a month ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking
Reasoning
Junhao Shen, Haiteng Zhao, Yuzhe Gu, et al.
a month ago

Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers
Wongi Jeong, Kyungryeol Lee, Hoigi Seo, et al.
a month ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science
Reasoning
Run-Ze Fan, Zengzhi Wang, Pengfei Liu
a month ago

Step-Audio 2 Technical Report
Boyong Wu, Chao Yan, Chen Hu, et al.
a month ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning
Hongyin Luo, Nathaniel Morgan, Tina Li, et al.
a month ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Xiaoyang Chen, Yunhao Chen, Zeren Chen, et al.
a month ago

Uncertainty-Aware Knowledge Transformers for Peer-to-Peer Energy Trading with Multi-Agent Reinforcement Learning
Mian Ibad Ali Shah, Enda Barrett, Karl Mason
a month ago

NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining
Maksim Kuprashevich, Grigorii Alekseenko, Irina Tolstykh, et al.
a month ago

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling
Hayeon Kim, Ji Ha Jang, Se Young Chun
a month ago

WebShaper: Agentically Data Synthesizing via Information-Seeking
Formalization
Zhengwei Tao, Jialong Wu, Wenbiao Yin, et al.
a month ago

The Invisible Leash: Why RLVR May Not Escape Its Origin
Fang Wu, Weihao Xuan, Ximing Lu, et al.
a month ago

GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Fei Tang, Zhangxuan Gu, Zhengxi Lu, et al.
a month ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via
Context-Aware Multi-Stage Policy Optimization
Xingxuan Li, Yao Xiao, Dianwen Ng, et al.
a month ago

Design of intrinsically disordered region binding proteins
Kejia Wu, et al
a month ago

An All-Atom Generative Model for Designing Protein Complexes
Ruizhe Chen, Dongyu Xue, Xiangxin Zhou, et al.
a month ago

RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services
Fei Zhao, Chonggang Lu, Yue Wang, et al.
a month ago

CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models
Quang-Binh Nguyen, Minh Luu, Quang Nguyen, et al.
a month ago

Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models
Gen Luo, Wenhan Dou, Wenhao Li, et al.
a month ago

Franca: Nested Matryoshka Clustering for Scalable Visual Representation
Learning
Shashanka Venkataramanan, Valentinos Pariza, Mohammadreza Salehi, et al.
a month ago

A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges
in Russian Speech Generative Models
Kirill Borodin, Nikita Vasiliev, Vasiliy Kudryavtsev, et al.
a month ago

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs
Zichen Wen, Jiashu Qu, Dongrui Liu, et al.
a month ago

PrefPalette: Personalized Preference Modeling with Latent Attributes
Shuyue Stella Li, Melanie Sclar, Hunter Lang, et al.
a month ago

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning
Xiaoya Li, Xiaofei Sun, Albert Wang, et al.
a month ago

AnyCap Project: A Unified Framework, Dataset, and Benchmark for
Controllable Omni-modal Captioning
Yiming Ren, Zhiqiang Lin, Yu Li, et al.
a month ago