#62 I+V
Chapter 11 Facts
Number of paragraphs: 143
Number of sentences: 445
Number of tokens: 4,093
Number of unique tokens: 1,156
Number of speakers: 6
Stratt : 289 tokens
Grace : 193 tokens
Theodore Canton : 154 tokens
Justice Spencer : 112 tokens
Bailiff : 47 tokens
Rocky : 2 tokens
Direct speech: 19.47% of tokens
Space: 2 sections; 80.80% of tokens
Earth: 1 sections; 19.20% of tokens
Words unusually frequent for Earth sections:
justice, Spencer, bailiff, defense, Canton.
Words unusually infrequent or lacking for Earth sections:
Astrophage, there, know, look, about.
Words unusually frequent for Space sections:
tape, measure, he, clock, sound.
Words unusually infrequent or lacking for Space sections:
Astrophage, take, Taumoeba, they, good.
For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.
Speaker identification was done manually.
Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).