Words ranking and Hirsch index for identifying the core of the hapaxes in political texts
This paper deals with a quantitative analysis of the content of official political speeches.We study a set of about one thousand talks pronounced by the US Presidents, ranging from Washington to Trump. In particular, we search for the relevance of the rare words, i.e. those said only once in each speech – the so-called hapaxes. We implement a rank-size procedure of Zipf–Mandelbrot type for discussing the hapaxes’ frequencies regularity over the overall set of speeches.