WARNING!

This website contains spoilers for Andy Weir’s Project Hail Mary.
It is recommended you read the book before exploring this site.

#62 I+V

Chapter 11 Facts

Number of paragraphs: 143
Number of sentences: 445
Number of tokens: 4,093
Number of unique tokens: 1,156

Number of speakers: 6
    Stratt : 289 tokens
    Grace : 193 tokens
    Theodore Canton : 154 tokens
    Justice Spencer : 112 tokens
    Bailiff : 47 tokens
    Rocky : 2 tokens
Direct speech: 19.47% of tokens

Space: 2 sections; 80.80% of tokens
Earth: 1 sections; 19.20% of tokens

Words unusually frequent for Earth sections:
    justice, Spencer, bailiff, defense, Canton.
Words unusually infrequent or lacking for Earth sections:
    Astrophage, there, know, look, about.

Words unusually frequent for Space sections:
    tape, measure, he, clock, sound.
Words unusually infrequent or lacking for Space sections:
    Astrophage, take, Taumoeba, they, good.

For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.

Speaker identification was done manually.

Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).