#117 λIλ
Chapter 20 Facts
Number of paragraphs: 181
Number of sentences: 569
Number of tokens: 5,567
Number of unique tokens: 1,357
Number of speakers: 2
Grace : 87 tokens
Computer : 45 tokens
Direct speech: 2.37% of tokens
Space: 4 sections; 100.00% of tokens
Words unusually frequent for Space sections:
box, drill, pressure, upside, painkiller.
Words unusually infrequent or lacking for Space sections:
say, question, you, Taumoeba, tunnel.
For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.
Speaker identification was done manually.
Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).