wikidata2df

https://img.shields.io/pypi/v/wikidata2df.svg https://img.shields.io/travis/jvfe/wikidata2df.svg Documentation Status

Utility package for easily turning a SPARQL query into a dataframe

Ever wished you could easily and programatically get data from Wikidata into a nice and analysable Pandas DataFrame? Well, this package solves that problem: With a single function you can turn your SPARQL query into a pandas DataFrame, without having to deal with the messy JSON intermediate.

Basic Usage

To install:

$ pip install wikidata2df
from wikidata2df import wikidata2df

# A SPARQL query to return all cats in Wikidata!

cat_query = """
#Cats
SELECT ?item ?itemLabel
WHERE
{
?item wdt:P31 wd:Q146.
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
}
"""

cats_dataframe = wikidata2df(cat_query) # Returns a Pandas DataFrame

You can also use it in the command line, if you have text file with a SPARQL query:

$ wikidata2csv -q query.rq -o query_results.csv

Alternatives

  • Maybe you want more sofisticated functions? Or a way to edit Wikidata programatically? Awesome!

  • Would you rather use R? That’s cool too!

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.