Use Stylometry to Identify Authors

Computational text analysis with Python and NLTK

Lee Vaughan
Towards AI
Published in
22 min readFeb 21, 2024

--

A mashup poster of the novels The Hound of the Baskervilles, The War of the Worlds, and The Lost World.
Mashup poster of three novels by Dall-e-3 and the author

Stylometry is the quantitative study of literary style through computational text analysis. It’s based on the idea that we all have a unique, consistent, and recognizable style in our writing. This includes our vocabulary, our use of punctuation, the average length of our sentences and words, and so on.

--

--

Author of “Python Tools for Scientists,” “Impractical Python Projects,” and “Real World Python.” Former Senior Principal Scientist for ExxonMobil.