WARNING!

This website contains spoilers for Andy Weir’s Project Hail Mary.
It is recommended you read the book before exploring this site.

#98 V+V

Chapter 17 Facts

Number of paragraphs: 219
Number of sentences: 569
Number of tokens: 4,653
Number of unique tokens: 1,295

Number of speakers: 5
    Grace : 532 tokens
    Rocky : 338 tokens
    DuBois : 217 tokens
    Forrester : 120 tokens
    Shapiro : 65 tokens
Direct speech: 27.29% of tokens

Space: 2 sections; 70.96% of tokens
Earth: 1 section; 29.04% of tokens

Words unusually frequent for Earth sections:
    tool, Annie, Forrester, pool, Shapiro.
Words unusually infrequent or lacking for Earth sections:
    light, do, a, then, energy.

Words unusually frequent for Space sections:
    Adrian, methane, green, sampler, population.
Words unusually infrequent or lacking for Space sections:
    Taumoeba, fuel, xenonite, away.

For the sentences count, segmentation was performed using spaCy. Tokenization is just based on whitespace, em-dash, en-dash, and ellipsis delimiters. Unique tokens are case-insensitive.

Speaker identification was done manually.

Unusually frequent or infrequent words are based on log-likelihood of lemmas (lemmatization by spaCy).