Most common words in a document


Improved version of “Find out which words you use often in a text“.

The older version didn’t recognize full words correctly, so go with this:

unzip -p Seminararbeit.odt content.xml|
sed 's/<[^>]*>/ /g'| sed 's/[^a-zA-Z]/ /g'|grep -Eo "[^ ]{3,}" |
sort -n|uniq -c| grep -viwf ~/worte.txt |grep -v "^[ ]*1" |sort -n

I changed the grep parameters from -vf to -viwf.

  1. No comments yet.
(will not be published)

  1. No trackbacks yet.