Softmax activation

takes as input a vector z of K real numbers, and normalizes it into a probability distribution consisting of K probabilities proportional to the exponentials of the input numbers. components will add up to 1.

used as final layer of a neural network-based classifier. Such networks are commonly trained under a Cross Entropy Error regime

Softmax function
⚠ Switch to EXCALIDRAW VIEW in the MORE OPTIONS menu of this document. ⚠

Kronecker Delta

Text Elements
Link to original

🪴Digital Garden

Explorer

Softmax activation

Softmax function

Text Elements

Graph View

Backlinks