Over the last couple of weeks a new data extract for the wikipedia web services was implemented and deployed. The major change is certainly the dramatically increased number of geo located wikipedia articles.
A new attribute ‘rank‘ has been added to the xml and json responses. It gives an indication of the popularity or relevancy of an article. The rank is an integer number from ‘1‘ for the least popular articles to ‘100‘ for the most popular articles. It is calculated from the number of links pointing to an article and the article length. The articles are more or less evenly distributed over the 100 ranks.
The ‘elevation‘ field is now filled for nearly all articles, where no elevation could be parsed from the article itself it was enhanced with a reverse geocoded value from srtm3 or aster. The ‘countryCode‘ coverage has also been improved. The attributes ‘population‘ and ‘elevation‘ are no longer set to ‘0’ for unknown values, they are left empty instead.
3 thoughts on “Wikipedia Web Services”
[…] обновила веб-сервис доступа к статьям с […]
I noticed their is still a lot of geo located wikipedia articles missing, has the data been loaded yet or still in testing?
the data has been loaded weeks ago. articles with weird geo templates are not included as the parser will miss them. It is a pity that people continue to invent new templates instead of using the exiting ones.