HyperAIHyperAI

Command Palette

Search for a command to run...

Deep High-Resolution Representation Learning for Visual Recognition

Abstract

High-resolution representations are essential for position-sensitive visionproblems, such as human pose estimation, semantic segmentation, and objectdetection. Existing state-of-the-art frameworks first encode the input image asa low-resolution representation through a subnetwork that is formed byconnecting high-to-low resolution convolutions \emph{in series} (e.g., ResNet,VGGNet), and then recover the high-resolution representation from the encodedlow-resolution representation. Instead, our proposed network, named asHigh-Resolution Network (HRNet), maintains high-resolution representationsthrough the whole process. There are two key characteristics: (i) Connect thehigh-to-low resolution convolution streams \emph{in parallel}; (ii) Repeatedlyexchange the information across resolutions. The benefit is that the resultingrepresentation is semantically richer and spatially more precise. We show thesuperiority of the proposed HRNet in a wide range of applications, includinghuman pose estimation, semantic segmentation, and object detection, suggestingthat the HRNet is a stronger backbone for computer vision problems. All thecodes are available at~{\url{https://github.com/HRNet}}.


Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp