#26 +V
Chapter 5 Facts
Number of paragraphs: 329
Number of sentences: 937
Number of tokens: 8,382
Number of unique tokens: 1,848
Number of speakers: 13
Grace : 1420 tokens
Stratt : 855 tokens
Dimitri : 215 tokens
Ms. Xi : 138 tokens
Minister Voigt : 102 tokens
African Diplomat : 67 tokens
Steve (the army guy) : 25 tokens
Air Force Pilot : 22 tokens
Air Force Guide : 18 tokens
U.S. Navy Man : 16 tokens
Translator of Dr. Matsuka : 16 tokens
Helicopter Pilot : 8 tokens
American Woman : 6 tokens
Direct speech: 34.72% of tokens
Earth: 5 sections; 100.00% of tokens
Words unusually frequent for Earth sections:
light, Venus, star, Astrophage, Xi.
Words unusually infrequent or lacking for Earth sections:
she, DuBois, Lokken, mission, her.
For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.
Speaker identification was done manually.
Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).