Although tagcloud seems a little bit outdated and criticized visualization format, I have no doubt it can be useful sometimes. And if you can create one with only a few key strokes it is pretty sweet. Below I’ll show the technic of extracting Twitter #hashtags but you can use this technic to virtually any text source.
Running the above command on your Twitter data, you will extract the top 100 must frequent hashtags. Go ahead and edit the file manually to remove irrelevant or too frequent hashtags.
1 2 3 4 5
You may receive some error messages like this
jq: error: Cannot iterate over null, this is because some tweets doesn’t contains any hashtags and
jq throws a error when it tries to extract the
text field. More about
jq on this post.
hashtags.txt file will looks like:
1 2 3 4 5 6 7 8 9 10 11 12 13
Now go to Wordle Advanced and past the content of this archive. Save as PNG and you’re done!
If you prefer a more pythonic way, I found a excellent tutorial: A Wordcloud in Python