Command Palette
Search for a command to run...
Rethinking Recurrent Neural Networks and Other Improvements for Image Classification
Nguyen Huu Phong Bernardete Ribeiro

Abstract
Over the long history of machine learning, which dates back several decades, recurrent neural networks (RNNs) have been used mainly for sequential data and time series and generally with 1D information. Even in some rare studies on 2D images, these networks are used merely to learn and generate data sequentially rather than for image recognition tasks. In this study, we propose integrating an RNN as an additional layer when designing image recognition models. We also develop end-to-end multimodel ensembles that produce expert predictions using several models. In addition, we extend the training strategy so that our model performs comparably to leading models and can even match the state-of-the-art models on several challenging datasets (e.g., SVHN (0.99), Cifar-100 (0.9027) and Cifar-10 (0.9852)). Moreover, our model sets a new record on the Surrey dataset (0.949). The source code of the methods provided in this article is available at https://github.com/leonlha/e2e-3m and http://nguyenhuuphong.me.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-cifar-10 | E2E-3M | Percentage correct: 98.52 |
| image-classification-on-cifar-100 | E2E-3M | Percentage correct: 90.27 |
| image-classification-on-fashion-mnist | E2E-3M | Percentage error: 4.08 |
| image-classification-on-icassava-19 | E2E-3M | Top-1 Accuracy: 0.9368 |
| image-classification-on-surrey-asl | E2E-3M | Accuracy (%): 94.90 |
| image-classification-on-svhn | E2E-M3 | Percentage error: 1.0 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.