#39 Iℓλ
Chapter 07 Facts
Number of paragraphs: 156 Number of sentences: 504 Number of tokens: 4,849 Number of unique tokens: 1,252
Number of speakers: 1
Grace : 41 tokens
Direct speech: 0.85% of tokens
Space: 2 sections; 100.00% of tokens
Words unusually frequent for Space sections:
cylinder, suit, alien, panel, ship.
Words unusually infrequent or lacking for Space sections:
he, Rocky, his, Taumoeba, question.
For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.
Speaker identification was done manually.
Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).