(see lecture notes http://www.ics.uci.edu/~smyth/courses/cs277/public_slides/text_classification.pdf)
Assume that data are generated by a d-sided die.