HyperAIHyperAI

Latest Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers
LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers
Jingze Zhu, Yongliang Wu, Wenbo Zhu, et al.
a month ago
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive
  Token-Level Computation
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
Sangmin Bae, Yujin Kim, Reza Bayat, et al.
a month ago
REST: Stress Testing Large Reasoning Models by Asking Multiple Problems
  at Once
REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once
Zhuoshi Pan, Qizhi Pei, Yu Li, et al.
a month ago
EmbRACE-3K: Embodied Reasoning and Action in Complex Environments
EmbRACE-3K: Embodied Reasoning and Action in Complex Environments
Mingxian Lin, Wei Huang, Yitang Li, et al.
a month ago
Reasoning or Memorization? Unreliable Results of Reinforcement Learning
  Due to Data Contamination
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination
Mingqi Wu, Zhihao Zhang, Qiaole Dong, et al.
a month ago
SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual
  Dyadic Interactive Human Generation
SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation
Youliang Zhang, Zhaoyang Li, Duomin Wang, et al.
a month ago
VerifyBench: A Systematic Benchmark for Evaluating Reasoning Verifiers Across Domains
VerifyBench: A Systematic Benchmark for Evaluating Reasoning Verifiers Across Domains
Xuzhao Li, Xuchen Li, Shiyu Hu, et al.
a month ago
Sidechain conditioning and modeling for full-atom protein sequence design with FAMPNN
Sidechain conditioning and modeling for full-atom protein sequence design with FAMPNN
Talal Widatalla, Richard W. Shuai, Brian Hie, et al.
a month ago
One Token to Fool LLM-as-a-Judge
One Token to Fool LLM-as-a-Judge
Yulai Zhao, Haolin Liu, Dian Yu, et al.
a month ago
From One to More: Contextual Part Latents for 3D Generation
From One to More: Contextual Part Latents for 3D Generation
Shaocong Dong, Lihe Ding, Xiao Chen, et al.
a month ago
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for
  Visual Reasoning
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning
Yana Wei, Liang Zhao, Jianjian Sun, et al.
a month ago
Lumos-1: On Autoregressive Video Generation from a Unified Model
  Perspective
Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective
Hangjie Yuan, Weihua Chen, Jun Cen, et al.
a month ago
Neural-Driven Image Editing
Neural-Driven Image Editing
Pengfei Zhou, Jie Xia, Xiaopeng Peng, et al.
a month ago
KV Cache Steering for Inducing Reasoning in Small Language Models
KV Cache Steering for Inducing Reasoning in Small Language Models
Max Belitsky, Dawid J. Kopiczko, Michael Dorkenwald, et al.
a month ago
NeuralOS: Towards Simulating Operating Systems via Neural Generative
  Models
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models
Luke Rivard, Sun Sun, Hongyu Guo, et al.
a month ago
CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive
  Neural Rendering
CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive Neural Rendering
Zhengqing Wang, Yuefan Wu, Jiacheng Chen, et al.
a month ago
Test-Time Scaling with Reflective Generative Model
Test-Time Scaling with Reflective Generative Model
Zixiao Wang, Yuxin Wang, Xiaorui Wang, et al.
a month ago
System-of-systems Modeling and Optimization: An Integrated Framework for Intermodal Mobility
System-of-systems Modeling and Optimization: An Integrated Framework for Intermodal Mobility
Paul Saves, Jasper Bussemaker, R\u00e9mi Lafage, et al.
a month ago
All-atom Diffusion Transformers: Unified generative modelling of molecules and materials
All-atom Diffusion Transformers: Unified generative modelling of molecules and materials
Chaitanya K. Joshi, Xiang Fu, Yi-Lun Liao, et al.
a month ago
OST-Bench: Evaluating the Capabilities of MLLMs in Online
  Spatio-temporal Scene Understanding
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
JingLi Lin, Chenming Zhu, Runsen Xu, et al.
a month ago
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and
  Methodology
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
Haochen Wang, Xiangtai Li, Zilong Huang, et al.
a month ago
MIRIX: Multi-Agent Memory System for LLM-Based Agents
MIRIX: Multi-Agent Memory System for LLM-Based Agents
Yu Wang, Xi Chen
a month ago
Skywork-R1V3 Technical Report
Skywork-R1V3 Technical Report
Wei Shen, Jiangbo Pei, Yi Peng, et al.
a month ago
T-LoRA: Single Image Diffusion Model Customization Without Overfitting
T-LoRA: Single Image Diffusion Model Customization Without Overfitting
Vera Soboleva, Aibek Alanov, Andrey Kuznetsov, et al.
a month ago
Scaling RL to Long Videos
Scaling RL to Long Videos
Yukang Chen, Wei Huang, Baifeng Shi, et al.
a month ago
Critiques of World Models
Critiques of World Models
Eric Xing, Mingkai Deng, Jinyu Hou, et al.
a month ago
Is Diversity All You Need for Scalable Robotic Manipulation?
Is Diversity All You Need for Scalable Robotic Manipulation?
Modi Shi, Li Chen, Jin Chen, et al.
a month ago
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts
Guokan Shang, Hadi Abdine, Ahmad Chamma, et al.
a month ago
GTA1: GUI Test-time Scaling Agent
GTA1: GUI Test-time Scaling Agent
Yan Yang, Dongxu Li, Yutong Dai, et al.
a month ago
MedGen: Unlocking Medical Video Generation by Scaling
  Granularly-annotated Medical Videos
MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos
Rongsheng Wang, Junying Chen, Ke Ji, et al.
a month ago