#21 λλ
The second of six major features of the site has now been launched, the Section Map and Section Pages.
The Section Map visualizes the different sections in the novel as rectangles, organized by chapter. The length of each rectangle indicates the size of the section in paragraphs.
You can choose whether the color of the rectangles indicates which timeline the section is on (Earth or Space) or the proportion of direct speech.
If you click on a rectangle, you are taken to the page for that specific section with information about the number of paragraphs, tokens, types, and lemmas as well as the proportion of speech and how many tokens are spoken by each character in that section.
We will likely add more data about each section over time.
A section break is marked by an extra blank line or a horizontal rule. It indicates a jump in time or a switch to another timeline. By “Space” sections we mean sections not on Earth.
These counts rely on the tokenization and lemmatization performed by the NLP library, spaCy and punctuation is included. Some manual corrections have been made.
By types we mean the unique tokens. So if the word “tube” occurs 34 times and the word “tubes” 10 times, that’s 44 tokens, 2 types (“tube” and “tubes”), and one lemma (“tube”) because “tubes” is just the plural form of “tube”.