HyperAIHyperAI

Latest Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding
UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding
Shuquan Lian, Yuhang Wu, Jia Ma, et al.
a month ago
DualSG: A Dual-Stream Explicit Semantic-Guided Multivariate Time Series Forecasting Framework
DualSG: A Dual-Stream Explicit Semantic-Guided Multivariate Time Series Forecasting Framework
Kuiye Ding, Fanda Fan, Yao Wang, et al.
a month ago
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token
  Compression across Images, Videos, and Audios
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
Kele Shao, Keda Tao, Kejia Zhang, et al.
a month ago
SmallThinker: A Family of Efficient Large Language Models Natively
  Trained for Local Deployment
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment
Yixin Song, Zhenliang Xue, Dongliang Wei, et al.
a month ago
Reconstructing 4D Spatial Intelligence: A Survey
Reconstructing 4D Spatial Intelligence: A Survey
Yukang Cao, Jiahao Lu, Zhisheng Huang, et al.
a month ago
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for
  Multi-Task Learning
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
Zedong Wang, Siyuan Li, Dan Xu
a month ago
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World
  Shorts
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts
Yuying Ge, Yixiao Ge, Chen Li, et al.
a month ago
Agentic Reinforced Policy Optimization
Agentic Reinforced Policy Optimization
Guanting Dong, Hangyu Mao, Kai Ma, et al.
a month ago
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Huan-ang Gao, Jiayi Geng, Wenyue Hua, et al.
a month ago
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
Keyan Ding, Jing Yu, Junjie Huang, et al.
a month ago
Specification Self-Correction: Mitigating In-Context Reward Hacking
  Through Test-Time Refinement
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement
V\u00edctor Gallego
a month ago
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving
Maciej K. Wozniak, Lianhang Liu, Yixi Cai, et al.
a month ago
Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI
Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI
Jiangkai Wu, Zhiyuan Ren, Liming Liu, et al.
a month ago
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI
  Agents
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
Xuehui Wang, Zhenyu Wu, JingJing Xie, et al.
a month ago
Deep Researcher with Test-Time Diffusion
Deep Researcher with Test-Time Diffusion
Rujun Han, Yanfei Chen, Zoey CuiZhu, et al.
a month ago
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
Jiale Chen, Torsten Hoefler, Dan Alistarh
a month ago
MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
Siyi Xun, Yue Sun, Jingkun Chen, et al.
a month ago
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?
Xuetian Chen, Yinghao Chen, Xinfeng Yuan, et al.
a month ago
Hierarchical Budget Policy Optimization for Adaptive Reasoning
Hierarchical Budget Policy Optimization for Adaptive Reasoning
Shangke Lyu, Linjuan Wu, Yuchen Yan, et al.
a month ago
Captain Cinema: Towards Short Movie Generation
Captain Cinema: Towards Short Movie Generation
Junfei Xiao, Ceyuan Yang, Lvmin Zhang, et al.
a month ago
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
Xingyu Wu, Yuchen Yan, Shangke Lyu, et al.
a month ago
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
Hang Yan, Fangzhi Xu, Rongman Xu, et al.
a month ago
NABLA: Neighborhood Adaptive Block-Level Attention
NABLA: Neighborhood Adaptive Block-Level Attention
Dmitrii Mikhailov, Aleksey Letunovskiy, Maria Kovaleva, et al.
a month ago
Group Sequence Policy Optimization
Group Sequence Policy Optimization
Chujie Zheng, Shixuan Liu, Mingze Li, et al.
a month ago
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law
SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law
Yicheng Bao, Guanxu Chen, Mingkang Chen, et al.
a month ago
Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory
Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory
Mutian Yang, Jiandong Gao, Ji Wu
a month ago
Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny
Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny
Chuanhao Yan, Fengdi Che, Xuhan Huang, et al.
a month ago
RAVine: Reality-Aligned Evaluation for Agentic Search
RAVine: Reality-Aligned Evaluation for Agentic Search
Yilong Xu, Xiang Long, Zhi Zheng, et al.
a month ago
Can One Domain Help Others? A Data-Centric Study on Multi-Domain
  Reasoning via Reinforcement Learning
Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning
Yu Li, Zhuoshi Pan, Honglin Lin, et al.
a month ago
DesignLab: Designing Slides Through Iterative Detection and Correction
DesignLab: Designing Slides Through Iterative Detection and Correction
Jooyeol Yun, Heng Wang, Yotaro Shimose, et al.
a month ago