WARNING!

This website contains spoilers for Andy Weir’s Project Hail Mary.
It is recommended you read the book before exploring this site.

#5

Chapter 1 Facts

Number of paragraphs: 247
Number of sentences: 710
Number of tokens: 5,340
Number of unique tokens: 1,346

Number of speakers: 2
    Computer : 142 tokens
    Grace : 114 tokens
Direct speech: 4.81% of tokens

Space: 5 sections; 88.63% of tokens
Earth: 1 section; 11.37% of tokens

Words unusually frequent for Earth sections:
    arc, waitress, nebula, infrared, Irina.
Words unusually infrequent or lacking for Earth sections:
    say, we, she, you, this.

Words unusually frequent for Space sections:
    tube, bed, incorrect, I, ladder.
Words unusually infrequent or lacking for Space sections:
    he, Rocky, ship, Astrophage, his.

For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.

Speaker identification was done manually.

Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).