#186 V̶Iℓ
Chapter 30 Facts
Number of paragraphs: 77
Number of sentences: 227
Number of tokens: 1,989
Number of unique tokens: 717
Number of speakers: 2
Rocky : 269 tokens
Grace : 188 tokens
Direct speech: 22.98% of tokens
Space: 1 sections; 100.00% of tokens
Words unusually frequent for Space sections:
thrum, dome, vitamin, Sol, meeting.
Words unusually infrequent or lacking for Space sections:
ship, the, look, question, hand.
For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.
Speaker identification was done manually.
Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).