HomeQ15 – Softmax and Cross Entropy

Q15 – Softmax and Cross Entropy

April 5, 2017April 5, 2017 thenuttynetter Questions & Answers

The softmax function for $m$ classes is given by

$p_i = \frac{e^{x_i}}{\sum_{j=1}^m e^{x_j}} \text{ for } i = 1\ldots m$ .

It transforms a vector $(x_i)$ of real values into a probability mass vector for a categorical distribution. It is often used in conjunction with the cross-entropy loss
$L(x, y) = - \sum_{i=1}^m y_i \log p_i$

Find a simplified expression for $p_i$ when $k = 2$ .
Differentiate $p_i$ with respect to $x_k$ .
Differentiate $L$ with respect to $x_k$ .

5 thoughts on “Q15 – Softmax and Cross Entropy”

julianzaidi says:

April 8, 2017 at 11:26 pm

What is the variable k in Q1 ?

LikeLike

Reply
- isabelaalb says:
  
  April 9, 2017 at 12:57 am
  
  I considered it the number of classes.
  
  LikeLiked by 1 person
  
  Reply
- Théo Rubenach says:
  
  April 9, 2017 at 4:46 pm
  
  I think it should be m, it is just a mistake.
  
  LikeLike
  
  Reply
ohini says:

April 9, 2017 at 3:27 pm

it gives sigmoid function as in Q5.

LikeLike

Reply
sebyjacob says:

April 12, 2017 at 7:55 pm

1. sigmoid
2. $$p_i(1-p_i)$ if i=k $p_ip_k$ if i !=k$
3. $$\sum_{i=1,i!=k}^{m}y_ip_k -y_k(1-p_k)$

LikeLike

Reply