Courage to Learn ML: Tackling Vanishing and Exploding Gradients (Part 2) | Towards Data Science
A Comprehensive Survey on Activation Functions, Weights Initialization, Batch Normalization, and Their Applications in PyTorch

Source: Towards Data Science
A Comprehensive Survey on Activation Functions, Weights Initialization, Batch Normalization, and Their Applications in PyTorch