After AMR's critique, I felt compelled to try again and limit each word to appearing only once. I updated the code to handle basic contractions and better deal with embedded HTML. What follows are the most used five words for each person that are also not the among anyone else's most used five words. For most people, the five words rank among their top 15-25 words.
Tag Archives: word fingerprint
WGOM Fingerprint Words
After seeing that fingerprint word article, it got me curious about the fingerprint words of the Citizens here. The long list below is not quite that, but it is close. What follows are the five most used words for each Citizen after removing the 250 most common words of the site. Due to the simple word extraction I used, contractions lose their apostrophe. That means "wouldn't" will show up as "wouldn".