Improved version of “Find out which words you use often in a text“.
The older version didn’t recognize full words correctly, so go with this:
unzip -p Seminararbeit.odt content.xml|
sed 's/<[^>]*>/ /g'| sed 's/[^a-zA-Z]/ /g'|grep -Eo "[^ ]{3,}" |
sort -n|uniq -c| grep -viwf ~/worte.txt |grep -v "^[ ]*1" |sort -n
I changed the grep parameters from -vf to -viwf.
Recent Comments