Command Palette
Search for a command to run...
{Evgenii Pishchik}
Abstract
Non-linear activation functions are one of the main parts of deep neural network architectures. The choice of the activation function can affect model speed, performance and convergence. Most popular activation functions don't have any trainable parameters and don't alter during the training. We propose different activation functions with and without trainable parameters. Said activation functions have a number of advantages and disadvantages. We'll be testing the performance of said activation functions and comparing the results with widely known activation function ReLU. We assume that the activation functions with trainable parameters can outperform functions without ones, because the trainable parameters allow the model to "select'' the type of each of the activation functions itself, however, this strongly depends on the architecture of the deep neural network and the activation function itself.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-cifar-10 | ResNet-44 (Trainable Activations) | Percentage correct: 90.5 |
| image-classification-on-cifar-10 | ResNet-56 (Trainable Activations) | Percentage correct: 88.8 |
| image-classification-on-cifar-10 | ResNet-8 (Trainable Activations) | Percentage correct: 86.5 |
| image-classification-on-cifar-10 | ResNet-32 (Trainable Activations) | Percentage correct: 90.9 |
| image-classification-on-cifar-10 | ResNet-14 (Trainable Activations) | Percentage correct: 89.0 |
| image-classification-on-cifar-10 | ResNet-26 (Trainable Activations) | Percentage correct: 91.1 |
| image-classification-on-cifar-10 | ResNet-20 (Trainable Activations) | Percentage correct: 90.4 |
| image-classification-on-mnist | DNN-3 (Trainable Activations) | Accuracy: 97.0 Percentage error: 3.0 Trainable Parameters: 386719 |
| image-classification-on-mnist | DNN-2 (Trainable Activations) | Accuracy: 96.4 Percentage error: 3.6 Trainable Parameters: 311651 |
| image-classification-on-mnist | DNN-5 (Trainable Activations) | Accuracy: 97.2 Percentage error: 2.8 Trainable Parameters: 575051 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.