The number of scholarly documents on the public web
- PMID: 24817403
- PMCID: PMC4015892
- DOI: 10.1371/journal.pone.0093949
The number of scholarly documents on the public web
Abstract
The number of scholarly documents available on the web is estimated using capture/recapture methods by studying the coverage of two major academic search engines: Google Scholar and Microsoft Academic Search. Our estimates show that at least 114 million English-language scholarly documents are accessible on the web, of which Google Scholar has nearly 100 million. Of these, we estimate that at least 27 million (24%) are freely available since they do not require a subscription or payment of any kind. In addition, at a finer scale, we also estimate the number of scholarly documents on the web for fifteen fields: Agricultural Science, Arts and Humanities, Biology, Chemistry, Computer Science, Economics and Business, Engineering, Environmental Sciences, Geosciences, Material Science, Mathematics, Medicine, Physics, Social Sciences, and Multidisciplinary, as defined by Microsoft Academic Search. In addition, we show that among these fields the percentage of documents defined as freely available varies significantly, i.e., from 12 to 50%.
Conflict of interest statement
Figures
is an estimate of
,the fraction of documents indexed by MAS. The total number of documents N would be
where
is the size of MAS.
Comment in
-
Need a paper? Get a plug-in.Nature. 2017 Nov 16;551(7680):399-400. doi: 10.1038/d41586-017-05922-9. Nature. 2017. PMID: 29144489 No abstract available.
Similar articles
-
Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations' COCI: a multidisciplinary comparison of coverage via citations.Scientometrics. 2021;126(1):871-906. doi: 10.1007/s11192-020-03690-4. Epub 2020 Sep 21. Scientometrics. 2021. PMID: 32981987 Free PMC article.
-
"Publish or Perish" as citation metrics used to analyze scientific output in the humanities: International case studies in economics, geography, social sciences, philosophy, and history.Arch Immunol Ther Exp (Warsz). 2008 Nov-Dec;56(6):363-71. doi: 10.1007/s00005-008-0043-0. Epub 2008 Dec 1. Arch Immunol Ther Exp (Warsz). 2008. PMID: 19043670
-
An Improved Forensic Science Information Search.Forensic Sci Rev. 2015 Jan;27(1):41-52. Forensic Sci Rev. 2015. PMID: 26227137 Review.
-
Completeness and overlap in open access systems: Search engines, aggregate institutional repositories and physics-related open sources.PLoS One. 2017 Dec 21;12(12):e0189751. doi: 10.1371/journal.pone.0189751. eCollection 2017. PLoS One. 2017. PMID: 29267327 Free PMC article.
-
Quality of nutrition related information on the internet for osteoporosis patients: a critical review.Technol Health Care. 2011;19(6):391-400. doi: 10.3233/THC-2011-0643. Technol Health Care. 2011. PMID: 22129940 Review.
Cited by
-
Open access publishing in gastroenterology: good for the researcher and good for the public!Frontline Gastroenterol. 2019 Feb 18;11(2):170-171. doi: 10.1136/flgastro-2018-101166. eCollection 2020 Mar. Frontline Gastroenterol. 2019. PMID: 32133118 Free PMC article. No abstract available.
-
MRI Robots for Needle-Based Interventions: Systems and Technology.Ann Biomed Eng. 2018 Oct;46(10):1479-1497. doi: 10.1007/s10439-018-2075-x. Epub 2018 Jun 19. Ann Biomed Eng. 2018. PMID: 29922958 Free PMC article.
-
The Question of Data Integrity in Article-Level Metrics.PLoS Biol. 2015 Aug 21;13(8):e1002161. doi: 10.1371/journal.pbio.1002161. eCollection 2015 Aug. PLoS Biol. 2015. PMID: 26296237 Free PMC article.
-
Digital Presence of Norwegian Scholars on Academic Network Sites--Where and Who Are They?PLoS One. 2015 Nov 13;10(11):e0142709. doi: 10.1371/journal.pone.0142709. eCollection 2015. PLoS One. 2015. PMID: 26565408 Free PMC article.
-
Looking into Pandora's Box: The Content of Sci-Hub and its Usage.F1000Res. 2017 Apr 21;6:541. doi: 10.12688/f1000research.11366.1. eCollection 2017. F1000Res. 2017. PMID: 28529712 Free PMC article.
References
-
- Web of Science fact page. Available: http://wokinfo.com/realfacts/qualityandquantity/.
-
- Based on the statistics reported at the homepage of Microsoft Academic Search as of January 10, 2013. Available: http://academic.research.microsoft.com.
-
- Bar-Ilan J (2008) Which h-index? a comparison of WoS, Scopus and Google Scholar. Scientometrics 74: 257–271.
-
- Bar-Ilan J (2010) Citations to the introduction to informetrics indexed byWOS, Scopus and Google Scholar. Scientometrics 82: 495–506.
-
- Björk BC, Roos A, Lauri M (2009) Scientific journal publishing—yearly volume and open access availability. Information Research 14: 391.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
