Download Latest Version wikidata_person_bio-2024-01-combined.7z (93.6 MB)
Email in envelope

Get an email when there's a new version of entity-metadata

Home / open_academic_graph
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2019-11-20 1.8 kB
open_academic_graph_v2.zip 2019-03-30 116.3 MB
Totals: 2 Items   116.3 MB 0

Open Academic Graph

This folder contains the names of authors and organizations from the Open Academic Graph version 2, which is a combination of two academic graphs: Microsoft Academic Graph and AMiner. These are authors of academic articles. See their site about more information, including citation and license.

Quality

There are casing errors. For example, Derek O'hagan is listed at GSK. He has 11 publications and 86 citations, but his LinkedIn profile has has named capitalized O'Hagan.

Here is a more complex error. There is an entry for Iii Chester O. Baxter of Ethicon with 87 publications and 771 citations. However, his patent lists his name as Chester O. Baxter III, so the order and case are both wrong.

However, the casing is not all algorithmic. For example, Joanne is capitalized both Joanne and JoAnne. The persons JoAnne Yates of MIT and Joanne L. Slavin of the University of Minnesota are both correctly capitalized. Each has at least 6000 citations, so that might explain it.

Data

The extract here is a subset created using this script. Modify the script to get all names or another subset.

Columns

  • Name of author
  • Name of last organization (usually a university)
  • Number of publications
  • Number of citations

Row counts

Row count Filename
306512 aminer_authors_0.csv
125290 aminer_authors_1.csv
1496715 aminer_authors_2.csv
1119899 aminer_authors_3.csv
3930507 mag_authors_0.csv
403232 mag_authors_1.csv
536051 mag_authors_2.csv
Source: README.md, updated 2019-11-20