Statistics

See also: Statistics.

Probability

Distributions

Distributions

Estimation

Design of experiments (DOE)

Maximum likelihood estimation (MLE) finds parameters to maximize P(data).

Linear regression

Binary logistic regression

Multinomial logistic regression

KL distance or relative entropy from Q to P: \(\KL(p\|q) = \E_p \log\frac{p}q\).

Testing

https://en.wikipedia.org/wiki/Bayesian_probability

Curse of dimensionality

Graphical models

Variational inference

https://en.m.wikipedia.org/wiki/Metropolis–Hastings_algorithm
https://en.m.wikipedia.org/wiki/Rejection_sampling#Adaptive_rejection_sampling
https://en.m.wikipedia.org/wiki/Inverse_transform_sampling
https://en.m.wikipedia.org/wiki/Box–Muller_transform

https://jotterbach.github.io/content/posts/tsne/2016-05-23-TSNE/
https://strathprints.strath.ac.uk/52372/1/Connor_etal_LNCS2013_Evaluation_Jensen_Shannon_distance_over_sparse_data.pdf
https://github.com/cran/entropy/blob/master/R/KL.plugin.R
https://github.com/cran/entropy/blob/master/R/entropy.empirical.R
https://www.stefanom.io/notes/2021/02/25/concept_drift.html
http://web.archive.org/web/20150121224302/https://www.tsc.uc3m.es/~fernando/bare_conf3.pdf
https://notesonai.com/Jensen%E2%80%93Shannon+Divergence

Variational inference
https://arxiv.org/abs/1606.05908
https://arxiv.org/abs/1312.6114
https://arxiv.org/abs/2108.13083