The centroids are for the unlabelled data are obtained as follows
$$c_i^U = \alpha \sum_{i=1}^o \frac{f_{c,\iota}^U}{\lVert {f_{c,\iota}^U} \rVert_2} \\ \alpha = \frac{1}{M} \sum_{j=1}^M \lVert {c_j^L} \rVert_2$$
The weight update for $w_n$, the columns of $W$(i.e. learning rate $\eta$ times loss gradient with respect to $w_n$) can be shown to be:
$$\Delta w_n = -\eta\nabla_{w_n}l = -\eta\sum_{f \in I_n}(1-p_n)f + \eta\sum_{f \notin I_n}p_n'f $$
$$ p_n = \frac{\exp y_n}{\sum_{n'=1}^{N}{\exp y_{n'}}} \text{ }\text{(i.e. predicted probabilty that the class of the example is $n$)}$$
$$ y = W^Tf$$