# with short integer indices, and keep raw words for non-dictionary terms. def build_dictionary(texts, max_entries=256): Build a dictionary of the most frequent words from a list of HTTP bodies.