HyperAIHyperAI

Latest Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

DesignLab: Designing Slides Through Iterative Detection and Correction
DesignLab: Designing Slides Through Iterative Detection and Correction
Jooyeol Yun, Heng Wang, Yotaro Shimose, et al.
a month ago
Yume: An Interactive World Generation Model
Yume: An Interactive World Generation Model
Xiaofeng Mao, Shaoheng Lin, Zhen Li, et al.
a month ago
Pixels, Patterns, but No Poetry: To See The World like Humans
Pixels, Patterns, but No Poetry: To See The World like Humans
Hongcheng Gao, Zihao Huang, Lin Xu, et al.
a month ago
Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning
Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning
Xinyao Liu, Diping Song
a month ago
HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study
HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study
Mandar Pitale, Jelena Frtunikj, Abhinaw Priyadershi, et al.
a month ago
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Ang Li, Charles Wang, Kaiyu Yue, et al.
a month ago
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking
  Reasoning
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning
Junhao Shen, Haiteng Zhao, Yuzhe Gu, et al.
a month ago
Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers
Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers
Wongi Jeong, Kyungryeol Lee, Hoigi Seo, et al.
a month ago
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science
  Reasoning
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
Run-Ze Fan, Zengzhi Wang, Pengfei Liu
a month ago
Step-Audio 2 Technical Report
Step-Audio 2 Technical Report
Boyong Wu, Chao Yan, Chen Hu, et al.
a month ago
Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning
Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning
Hongyin Luo, Nathaniel Morgan, Tina Li, et al.
a month ago
Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Xiaoyang Chen, Yunhao Chen, Zeren Chen, et al.
a month ago
Uncertainty-Aware Knowledge Transformers for Peer-to-Peer Energy Trading with Multi-Agent Reinforcement Learning
Uncertainty-Aware Knowledge Transformers for Peer-to-Peer Energy Trading with Multi-Agent Reinforcement Learning
Mian Ibad Ali Shah, Enda Barrett, Karl Mason
a month ago
NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining
NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining
Maksim Kuprashevich, Grigorii Alekseenko, Irina Tolstykh, et al.
a month ago
Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling
Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling
Hayeon Kim, Ji Ha Jang, Se Young Chun
a month ago
WebShaper: Agentically Data Synthesizing via Information-Seeking
  Formalization
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization
Zhengwei Tao, Jialong Wu, Wenbiao Yin, et al.
a month ago
The Invisible Leash: Why RLVR May Not Escape Its Origin
The Invisible Leash: Why RLVR May Not Escape Its Origin
Fang Wu, Weihao Xuan, Ximing Lu, et al.
a month ago
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Fei Tang, Zhangxuan Gu, Zhengxi Lu, et al.
a month ago
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via
  Context-Aware Multi-Stage Policy Optimization
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization
Xingxuan Li, Yao Xiao, Dianwen Ng, et al.
a month ago
Design of intrinsically disordered region binding proteins
Design of intrinsically disordered region binding proteins
Kejia Wu, et al
a month ago
An All-Atom Generative Model for Designing Protein Complexes
An All-Atom Generative Model for Designing Protein Complexes
Ruizhe Chen, Dongyu Xue, Xiangxin Zhou, et al.
a month ago
RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services
RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services
Fei Zhao, Chonggang Lu, Yue Wang, et al.
a month ago
CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models
CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models
Quang-Binh Nguyen, Minh Luu, Quang Nguyen, et al.
a month ago
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models
Gen Luo, Wenhan Dou, Wenhao Li, et al.
a month ago
Franca: Nested Matryoshka Clustering for Scalable Visual Representation
  Learning
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
Shashanka Venkataramanan, Valentinos Pariza, Mohammadreza Salehi, et al.
a month ago
A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges
  in Russian Speech Generative Models
A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models
Kirill Borodin, Nikita Vasiliev, Vasiliy Kudryavtsev, et al.
a month ago
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs
Zichen Wen, Jiashu Qu, Dongrui Liu, et al.
a month ago
PrefPalette: Personalized Preference Modeling with Latent Attributes
PrefPalette: Personalized Preference Modeling with Latent Attributes
Shuyue Stella Li, Melanie Sclar, Hunter Lang, et al.
a month ago
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning
Xiaoya Li, Xiaofei Sun, Albert Wang, et al.
a month ago
AnyCap Project: A Unified Framework, Dataset, and Benchmark for
  Controllable Omni-modal Captioning
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning
Yiming Ren, Zhiqiang Lin, Yu Li, et al.
a month ago