Command Palette
Search for a command to run...
Sanghyun Byun Kayvan Shah Ayushi Gang Christopher Apton Jacob Song Woo Seong Chung

Abstract
Many state-of-the-art computer vision architectures leverage U-Net for its adaptability and efficient feature extraction. However, the multi-resolution convolutional design often leads to significant computational demands, limiting deployment on edge devices. We present a streamlined alternative: a 1D convolutional encoder that retains accuracy while enhancing its suitability for edge applications. Our novel encoder architecture achieves semantic segmentation through channel-wise 1D convolutions combined with pixel-unshuffle operations. By incorporating PixelShuffle, known for improving accuracy in super-resolution tasks while reducing computational load, OneNet captures spatial relationships without requiring 2D convolutions, reducing parameters by up to 47%. Additionally, we explore a fully 1D encoder-decoder that achieves a 71% reduction in size, albeit with some accuracy loss. We benchmark our approach against U-Net variants across diverse mask-generation tasks, demonstrating that it preserves accuracy effectively. Although focused on image segmentation, this architecture is adaptable to other convolutional applications. Code for the project is available at https://github.com/shbyun080/OneNet .
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-segmentation-on-msd-heart | OneNete,4 | mIoU: 6.6 |
| image-segmentation-on-oxfordpets | OneNete,4-C | Dice Score: 0.967 |
| image-segmentation-on-pascal-voc | OneNeted,4 | mIoU: 14.9 |
| image-segmentation-on-pascal-voc | OneNete,4-S | mAP0.5: 52.75 |
| image-segmentation-on-pascal-voc | OneNete,4-C | mIoU: 63.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.