It might be a naive question but I am really struggling with this. I have looked at number of papers and articles that present the formula of the Mutual information and the Conditional Mutual information.. they usually write it in two ways :
and
Are those formulas actually equivalent ? and does PX(x)
actually means the same as P(x)
?