Saturday, January 20, 2018

#Wikipedia - entering the rabbit hole

When you start reading Wikipedia, when you continue with a next article and the next, you become part of a click stream identifying what people read and how they get there. It is hugely interesting and dumps for this click stream are available for the English, Russian, German, Spanish, and Japanese Wikipedias.

Just consider; all articles on the same subject have a Wikidata identifier. This makes it possible to aggregate these click streams. When a particular link between articles is popular in multiple Wikipedias, there is a good chance that adding a missing article will be popular as well.

It is always a question if suggestions like this will be taken up, if they indeed prove to be read more than just an average new article in a domain. That is however the subject of follow up research. In the mean time it provides an argument to collect the click streams for any and all Wikipedias. Providing educated guesses of what will be popular stimulates people to write what will be read.
Thanks,
     GerardM
Post a Comment