Bayesian statistics

This is just a note to self.

\[P(A|B) = \frac{P(B|A) \cdot P(A)}{P(B)}\]

$P(A|B)$ is called the posterior probability.
$P(B|A)$ is called the likelihood.
$P(A)$ is called the prior probability.
$P(B)$ is called the marginal likelihood.

Maximum likelihood estimation is based on maximizing $\mathcal{L} = P(B|A)$, or equivalently, minimizing $-\log \mathcal{L}$. Maximum a posteriori (MAP) estimation is based on minimizing $-\log \mathcal{P} = -\log P(A|B)$, including the prior during the minimization.