The Database uses these software to funciton:
- SparQL for queries
- Oxigraph for webpage serving
- Open-Alex snapshots for data
Database
Structure
The database is synced from the open-alex aws snapshot.
The data is downloaded as jsonl files, and then converted to the graph database.
Authors
The snapshot json has the following structure
-
id: open alex author accession link -
orcid: todo -
display_name: The name of the author -
display_name_alternatives: list of alternative names -
works_count: the number of works -
cited_by_count: number of citations of the author by other users -
most_cited_work: the name of the most cited work -
summary_stats: summary of stats2yr_mean_citedness: avreage citedness over two yearsh_index: todoi10_index: todooa_percent: todoworks_count: ammount of workscited_by_count: number of citations of the author by other users2yr_works_count: todo2yr_cited_by_count: todo2yr_i10_index: todo2yr_h_index: todo
-
ids: TODO -
last_known_institution: The last known institiutionid: open alex institution accession linkror: tododisplay_name: Name of the institutioncountry_code: country code of the institutiontype: Type of the institution
-
counts_by_year: list of work counts by years.year: year of the work publishmentworks_count: ammount of works publishedoa_works_count: todocited_by_count: todo
-
x_concepts: todo -
works_api_url: todo -
updated_date: todo -
created_date: todo -
updated: todo
Autogenerated
id: stringorcid: nothingdisplay_name: stringdisplay_name_alternatives: listworks_count: intcited_by_count: intmost_cited_work: stringsummary_stats:2yr_mean_citedness: inth_index: inti10_index: intoa_percent: intworks_count: intcited_by_count: int2yr_works_count: int2yr_cited_by_count: int2yr_i10_index: int2yr_h_index: int
ids:openalex: string
last_known_institution:id: stringror: stringdisplay_name: stringcountry_code: stringtype: string
counts_by_year:year: intworks_count: intoa_works_count: intcited_by_count: int
x_concepts:id: stringwikidata: stringdisplay_name: stringlevel: intscore: float
works_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Concepts
Autogenerated
id: stringwikidata: stringdisplay_name: stringlevel: intdescription: stringworks_count: intcited_by_count: intsummary_stats:2yr_mean_citedness: inth_index: inti10_index: intoa_percent: intworks_count: intcited_by_count: int2yr_works_count: int2yr_cited_by_count: int2yr_i10_index: int2yr_h_index: int
ids:openalex: stringwikidata: stringwikipedia: stringmag: int
image_url: nothingimage_thumbnail_url: nothinginternational:display_name:ar: stringbe: stringbn: stringca: stringcs: stringde: stringen: stringeo: stringes: stringet: stringfa: stringfi: stringfr: stringhe: stringhu: stringhy: stringit: stringja: stringkk: stringkk-arab: stringkk-cyrl: stringkk-latn: stringky: stringnb: stringnl: stringoc: stringpl: stringru: stringsl: stringsr: stringsv: stringta: stringuk: stringuz: stringvi: stringzh: stringzh-hans: stringzh-hant: stringzh-hk: string
description:bn: stringca: stringde: stringen: stringfr: stringru: stringsr: string
ancestors:id: stringwikidata: stringdisplay_name: stringlevel: int
related_concepts:id: stringwikidata: nothingdisplay_name: stringlevel: intscore: float
counts_by_year: listworks_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Domains
Autogenerated
id: stringdisplay_name: stringdescription: stringdisplay_name_alternatives: listids:wikidata: stringwikipedia: string
fields:id: stringdisplay_name: string
siblings:id: stringdisplay_name: string
works_count: intcited_by_count: intworks_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Fields
Autogenerated
id: stringdisplay_name: stringdescription: stringdisplay_name_alternatives: listids:wikidata: stringwikipedia: string
domain:id: stringdisplay_name: string
subfields:id: stringdisplay_name: string
siblings:id: stringdisplay_name: string
works_count: intcited_by_count: intworks_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Funders
Autogenerated
id: stringdisplay_name: stringids:openalex: stringwikidata: stringror: stringcrossref: intdoi: string
alternate_titles: listcountry_code: stringdescription: stringhomepage_url: stringimage_url: stringimage_thumbnail_url: stringroles:role: stringid: stringworks_count: int
grants_count: intworks_count: intcited_by_count: intsummary_stats:2yr_mean_citedness: floath_index: inti10_index: intoa_percent: floatworks_count: intcited_by_count: int2yr_works_count: int2yr_cited_by_count: int2yr_i10_index: int2yr_h_index: int
counts_by_year:year: intworks_count: intoa_works_count: intcited_by_count: int
x_concepts:id: stringwikidata: stringdisplay_name: stringlevel: intscore: float
updated_date: stringcreated_date: stringupdated: string
Institutions
Autogenerated
id: stringror: stringdisplay_name: stringcountry_code: stringtype: stringhomepage_url: stringimage_url: stringimage_thumbnail_url: stringdisplay_name_acronyms: listdisplay_name_alternatives: listworks_count: intcited_by_count: intsummary_stats:2yr_mean_citedness: floath_index: inti10_index: intoa_percent: floatworks_count: intcited_by_count: int2yr_works_count: int2yr_cited_by_count: int2yr_i10_index: int2yr_h_index: int
ids:openalex: stringror: stringgrid: stringwikipedia: stringwikidata: stringmag: int
roles:role: stringid: stringworks_count: int
repositories: listgeo:city: nothinggeonames_city_id: nothingregion: nothingcountry_code: stringcountry: nothinglatitude: floatlongitude: float
international:display_name:ar: stringazb: stringbe: stringbg: stringbr: stringca: stringcs: stringcy: stringde: stringel: stringen: stringen-gb: stringeo: stringes: stringet: stringeu: stringfa: stringfr: stringgl: stringhe: stringhu: stringhy: stringid: stringit: stringja: stringko: stringlb: stringnb: stringnl: stringnn: stringpl: stringpt: stringro: stringru: stringsk: stringsv: stringtl: stringuk: stringvi: stringzh: stringzh-hant: stringzh-hk: string
associated_institutions:id: stringror: stringdisplay_name: stringcountry_code: stringtype: stringrelationship: string
counts_by_year:year: intworks_count: intoa_works_count: intcited_by_count: int
x_concepts:id: stringwikidata: stringdisplay_name: stringlevel: intscore: float
works_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Merged_ids
Publishers
Autogenerated
id: stringdisplay_name: stringids:openalex: string
alternate_titles: listparent_publisher: nothinglineage: listhierarchy_level: intcountry_codes: listhomepage_url: nothingimage_url: nothingimage_thumbnail_url: nothingroles:role: stringid: stringworks_count: int
works_count: intcited_by_count: intsources_count: intsummary_stats:2yr_mean_citedness: inth_index: inti10_index: intoa_percent: intworks_count: intcited_by_count: intsources_count: int2yr_works_count: int2yr_cited_by_count: int2yr_i10_index: int2yr_h_index: int
counts_by_year:year: intworks_count: intoa_works_count: intcited_by_count: int
x_concepts:id: stringwikidata: stringdisplay_name: stringlevel: intscore: float
sources_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Sources
Autogenerated
id: stringissn_l: stringissn: listdisplay_name: stringpublisher: stringhost_organization: stringhost_organization_name: stringhost_organization_lineage: listhost_organization_lineage_names: listis_oa: boolis_in_doaj: boolhost_institution_lineage: listhost_institution_lineage_names: listpublisher_lineage: listpublisher_lineage_names: listpublisher_id: stringtype: stringworks_count: intcited_by_count: intsummary_stats:2yr_mean_citedness: floath_index: inti10_index: intoa_percent: floatworks_count: intcited_by_count: int2yr_works_count: int2yr_cited_by_count: int2yr_i10_index: int2yr_h_index: int
alternate_titles: listabbreviated_title: stringhomepage_url: stringcountry_code: stringids:openalex: stringissn_l: stringissn: listfatcat: stringwikidata: string
apc_prices:price: intcurrency: string
apc_usd: intsocieties: listcounts_by_year:year: intworks_count: intoa_works_count: intcited_by_count: int
x_concepts:id: stringwikidata: stringdisplay_name: stringlevel: intscore: float
works_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Subfields
Autogenerated
id: stringdisplay_name: stringdescription: stringdisplay_name_alternatives: listids:wikidata: stringwikipedia: string
field:id: stringdisplay_name: string
domain:id: stringdisplay_name: string
topics:id: stringdisplay_name: string
siblings:id: stringdisplay_name: string
works_count: intcited_by_count: intworks_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Topics
Autogenerated
id: stringdisplay_name: stringsubfield:id: stringdisplay_name: string
field:id: stringdisplay_name: string
domain:id: stringdisplay_name: string
description: stringkeywords: listids:openalex: stringwikipedia: string
siblings:id: stringdisplay_name: string
works_count: intcited_by_count: intworks_api_url: stringupdated_date: stringcreated_date: stringupdated: string
Works
Autogenerated
id: stringdoi: stringdoi_registration_agency: stringdisplay_name: nothingtitle: nothingpublication_year: intpublication_date: stringlanguage: nothingids:openalex: stringdoi: string
primary_location:source: nothingpdf_url: nothinglanding_page_url: stringis_oa: boolversion: nothinglicense: nothingdoi: string
best_oa_location: nothingtype: stringopen_access:is_oa: booloa_status: stringoa_url: nothingany_repository_has_fulltext: bool
authorships: listcorresponding_author_ids: listcorresponding_institution_ids: listcited_by_count: intsummary_stats:cited_by_count: int2yr_cited_by_count: int
biblio:volume: nothingissue: nothingfirst_page: nothinglast_page: nothing
is_retracted: boolis_paratext: boolconcepts:id: stringwikidata: stringdisplay_name: stringlevel: intscore: float
mesh: listlocations_count: intlocations:source: nothingpdf_url: nothinglanding_page_url: stringis_oa: boolversion: nothinglicense: nothingdoi: string
referenced_works: listreferenced_works_count: intsustainable_development_goals: listgrants: listapc_list: nothingapc_paid: nothingrelated_works: listabstract_inverted_index: nothingcounts_by_year: listcited_by_api_url: stringupdated_date: stringcreated_date: stringupdated: stringauthors_count: intconcepts_count: int
Usage
You can query the database using the SparQL language here