Skip to content

Machine Learning - Algorithms and Applications

Activation Functions

Initializing search

machine-learning-textbook

Machine Learning - Algorithms and Applications

machine-learning-textbook

Home
Course Description
FAQ
Glossary
Chapters
Chapters
- 1. ML Fundamentals
  1. ML Fundamentals
  - Quiz
- 2. K-Nearest Neighbors
  2. K-Nearest Neighbors
  - Quiz
- 3. Decision Trees
  3. Decision Trees
  - Quiz
- 4. Logistic Regression
  4. Logistic Regression
  - Quiz
- 5. Regularization
  5. Regularization
  - Quiz
- 6. Support Vector Machines
  6. Support Vector Machines
  - Quiz
- 7. K-Means Clustering
  7. K-Means Clustering
  - Quiz
- 8. Data Preprocessing
  8. Data Preprocessing
  - Quiz
- 9. Neural Networks
  9. Neural Networks
  - Quiz
- 10. Convolutional Networks
  10. Convolutional Networks
  - Quiz
- 11. Transfer Learning
  11. Transfer Learning
  - Quiz
- 12. Evaluation & Optimization
  12. Evaluation & Optimization
  - Quiz
MicroSims
MicroSims
Learning Graph
Learning Graph
About
License
Contact
Feature Checklist

Table of contents

Description
Learning Objectives
How to Use
Key Concepts
Interactive Features
Related Concepts

Home
MicroSims

Activation Function Comparison¶

View Fullscreen

Description¶

An interactive visualization comparing sigmoid, tanh, ReLU, and Leaky ReLU activation functions.

Learning Objectives¶

Compare shapes and output ranges of common activation functions
Understand derivatives and gradient flow through different activations
Recognize saturation regions and vanishing gradient problems
Identify the dying neuron problem in ReLU and how Leaky ReLU addresses it

How to Use¶

Adjust x: Slide to change the input value and see function outputs
Show Derivatives: Toggle to display derivative curves (dashed lines)
Leaky ReLU α: Adjust the negative slope parameter for Leaky ReLU
Highlight Saturation: Toggle to show saturation zones (yellow regions)
Comparison Mode: View all functions overlaid on a single plot

Key Concepts¶

Sigmoid¶

Output range: [0, 1]
Saturates at extremes (vanishing gradient)
Used in binary classification output layers

Tanh¶

Output range: [-1, 1]
Zero-centered (better than sigmoid)
Still suffers from vanishing gradients

ReLU (Rectified Linear Unit)¶

Output range: [0, ∞)
Fast computation, no vanishing gradient
Can have "dying neurons" (stuck at zero)
Most common for hidden layers

Leaky ReLU¶

Output range: (-∞, ∞)
Small negative slope prevents dying neurons
Combines ReLU benefits with gradient flow

Interactive Features¶

2×2 Grid View: Compare all four functions simultaneously
Comparison Mode: Overlay all functions on one plot
Real-time Derivatives: See gradient values at any input
Property Table: Quick reference for key characteristics

Related Concepts¶

Activation Functions
Neural Networks

Categorical Encoding Explorer

Copyright © 2025 | CC BY-NC-SA 4.0 DEED