Categorical Cross Entropy

These are some additional notes that I am taking on the incredible book by David Foster on Generative Deep Learning

The calculation for this is:

$$\Huge - \overset{n}{\underset{i = 1}{\Sigma}} y_i\space log(p_i)$$

This simply shows the log of the prediction times the ground truth. Then you sum all of this up for each data point.