#9 Iλ
Chapter 2 Facts
Number of paragraphs: 185
Number of sentences: 551
Number of tokens: 4,586
Number of unique tokens: 1,311
Number of speakers: 8
Dr. Browne : 328 tokens
Marissa : 284 tokens
Grace : 262 tokens
Sandra Elias : 172 tokens
Computer : 32 tokens
Waiter : 10 tokens
JPL Probe Checker : 4 tokens
Shocked JPL Person : 3 tokens
Direct speech: 23.88% of tokens
Space: 3 sections; 60.49% of tokens
Earth: 2 sections; 39.51% of tokens
Words unusually frequent for Earth sections:
Browne, reporter, Marissa, ArcLight, Petrova.
Words unusually infrequent or lacking for Earth sections:
Stratt, Astrophage, light, I, you.
Words unusually frequent for Space sections:
pendulum, centrifuge, kid, apartment, toilet.
Words unusually infrequent or lacking for Space sections:
he, Rocky, his, Astrophage, you.
For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.
Speaker identification was done manually.
Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).