The gh-impact score data are available from this website, and this post will explain how to obtain a CSV file. I used this procedure to create a CSV file that is current as of 2016-08-24, which can be downloaded here. To obtain fresh data, keep reading this post.
The data stored on the website are optimized for online search, so they use the JSON file format. Most researchers would probably prefer a CSV, which is much easier to load into R or Excel.
The following Python script will download the entire gh-impact database, which is stored in 256 separate JSON files, and assemble a single CSV file called
This script can be downloaded to run on your own machine. Be sure to install the requests library by running
pip install requests. Finally, run
python download-gh-impact.py and wait about 9 minutes for the CSV to be created.