Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2019-11-20 | 1.8 kB | |
open_academic_graph_v2.zip | 2019-03-30 | 116.3 MB | |
Totals: 2 Items | 116.3 MB | 0 |
Open Academic Graph
This folder contains the names of authors and organizations from the Open Academic Graph version 2, which is a combination of two academic graphs: Microsoft Academic Graph and AMiner. These are authors of academic articles. See their site about more information, including citation and license.
Quality
There are casing errors. For example, Derek O'hagan is listed at GSK. He has 11 publications and 86 citations, but his LinkedIn profile has has named capitalized O'Hagan.
Here is a more complex error. There is an entry for Iii Chester O. Baxter of Ethicon with 87 publications and 771 citations. However, his patent lists his name as Chester O. Baxter III, so the order and case are both wrong.
However, the casing is not all algorithmic. For example, Joanne is capitalized both Joanne and JoAnne. The persons JoAnne Yates of MIT and Joanne L. Slavin of the University of Minnesota are both correctly capitalized. Each has at least 6000 citations, so that might explain it.
Data
The extract here is a subset created using this script. Modify the script to get all names or another subset.
Columns
- Name of author
- Name of last organization (usually a university)
- Number of publications
- Number of citations
Row counts
Row count | Filename |
---|---|
306512 | aminer_authors_0.csv |
125290 | aminer_authors_1.csv |
1496715 | aminer_authors_2.csv |
1119899 | aminer_authors_3.csv |
3930507 | mag_authors_0.csv |
403232 | mag_authors_1.csv |
536051 | mag_authors_2.csv |