#154 +I+
Chapter 26 Facts
Number of paragraphs: 92
Number of sentences: 355
Number of tokens: 3,358
Number of unique tokens: 1,102
Number of speakers: 2
Stratt : 644 tokens
Grace : 87 tokens
Direct speech: 21.77% of tokens
Space: 2 sections; 69.92% of tokens
Earth: 1 sections; 30.08% of tokens
Words unusually frequent for Earth sections:
food, war, famine, history, door.
Words unusually infrequent or lacking for Earth sections:
Stratt, he, light, a, need.
Words unusually frequent for Space sections:
farm, powder, ish, will, matter.
Words unusually infrequent or lacking for Space sections:
you, question, this, Rocky, second.
For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.
Speaker identification was done manually.
Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).