EDITORIAL
11 September 2019

Set citation data free

Respondents to a Nature poll want to make their own decisions about how to interpret citation metrics. That requires data to be freely accessible.

You have full access to this article via your institution.

Download PDF

Abstract image of people icons with interlinking connections — For many papers, citation data are locked inside proprietary databases.Credit: iLexx/Getty

Whenever scientists are ranked and rewarded by metrics, such as citations, some are tempted to grab a little extra credit where they can. As we report this week, the publisher Elsevier has been investigating cases in which reviewers have repeatedly asked authors of papers to cite the reviewers’ own work.

This is not an isolated incident. Last month, we reported that some 250 highly cited scientists had amassed more than half of their citations from their own work or that of co-authors — much more than the usual proportion for their field or career stage (see Nature 572, 578–579; 2019).

Such examples should not come as a surprise, because the gaming of measurement systems is well known. In economics it is called Goodhart’s law, named after the economist Charles Goodhart, who described the concept. It was refined by the anthropologist Marilyn Strathern, and states that when a measure becomes a target, it ceases to be a good measure.

One obvious answer is for institutions and funders to just stop using citation-based metrics as a proxy for importance or quality when assessing researchers. “Stop the damn bean-counting!” one reader exclaimed in response to an online poll in Nature last month, in which we asked what — if anything — needed to be done to curb excessive self-citation. Metrics-based analysis can certainly reveal useful insights about research. But any assessment procedure that rewards scientists according to citation-based metrics alone seems designed to invite game-playing.

It can also be argued that, all things considered, excessive self-citation is a minor problem and therefore doesn’t need a particular response. Of the more than 5,000 readers who answered Nature’s poll, 10% said nothing needed to be done. “Let active researchers draw their own conclusions about self-citing researchers, and allow reputation to build naturally,” one respondent wrote.

However, most poll respondents felt that citation-based indicators are useful, but that they should be deployed in more nuanced and open ways. The most popular responses to the poll were that citation-based indicators should be tweaked to exclude self-citations, or that self-citation rates should be reported alongside other metrics (see ‘The numbers game’). On the whole, respondents wanted to be able to judge for themselves when self-citations might be appropriate, and when not; to be able to compare self-citation across fields; and more.

But this is where there is a real problem, because for many papers citation data are locked inside proprietary databases. Since 2000, more and more publishers have been depositing information about research-paper references with an organization called Crossref, the non-profit agency that registers digital object identifiers (DOIs), the strings of characters that identify papers on the web. But not all publishers allow their reference lists to be made open for anyone to download and analyse — only 59% of the almost 48 million articles deposited with Crossref currently have open references.

There is, however, a solution. Two years ago, the Initiative for Open Citations (I4OC) was established for the purpose of promoting open scholarly citation data. As of 1 September, more than 1,000 publishers were members, including Sage Publishing, Taylor and Francis, Wiley and Springer Nature — which joined last year. Publishers still to join I4OC include the American Chemical Society, Elsevier — the largest not to do so — and the IEEE.

Last January, I4OC co-founder David Shotton at the Oxford e-Research Centre, University of Oxford, UK, urged all research publishers to join the initiative (see Nature 553, 129; 2018). They should. Excessive self-citation cannot be eliminated, but free access to citation data for everyone — researchers and non-researchers — will help to illuminate some darker corners. Without more journals coming on board, these necessary efforts to analyse self-citation data will remain incomplete.

Nature 573, 163-164 (2019)

doi: https://doi.org/10.1038/d41586-019-02669-3

Reprints and permissions

Subjects

Latest on:

Researchers want a ‘nutrition label’ for academic-paper facts

Nature Index 17 APR 24

Rwanda 30 years on: understanding the horror of genocide

Editorial 09 APR 24

Three ways ChatGPT helps me in my academic writing

Career Column 08 APR 24

Researchers want a ‘nutrition label’ for academic-paper facts

Nature Index 17 APR 24

Structure peer review to make it more robust

World View 16 APR 24

US COVID-origins hearing puts scientific journals in the hot seat

News 16 APR 24

Researchers want a ‘nutrition label’ for academic-paper facts

Nature Index 17 APR 24

Adopt universal standards for study adaptation to boost health, education and social-science research

Correspondence 02 APR 24

How AI is being used to accelerate clinical trials

Nature Index 13 MAR 24

Jobs

FACULTY POSITION IN PATHOLOGY RESEARCH

Dallas, Texas (US)

The University of Texas Southwestern Medical Center (UT Southwestern Medical Center)
Postdoc Fellow / Senior Scientist

The Yakoub and Sulzer labs at Harvard Medical School-Brigham and Women’s Hospital and Columbia University

Boston, Massachusetts (US)

Harvard Medical School and Brigham and Women's Hospital
Postdoc in Computational Genomics – Machine Learning for Multi-Omics Profiling of Cancer Evolution

Computational Postdoc - Artificial Intelligence in Oncology and Regulatory Genomics and Cancer Evolution at the DKFZ - limited to 2 years

Heidelberg, Baden-Württemberg (DE)

German Cancer Research Center in the Helmholtz Association (DKFZ)
Computational Postdoc

The German Cancer Research Center is the largest biomedical research institution in Germany.

Heidelberg, Baden-Württemberg (DE)

German Cancer Research Center in the Helmholtz Association (DKFZ)
PhD / PostDoc Medical bioinformatics (m/f/d)

The Institute of Medical Bioinformatics and Systems Medicine / University of Freiburg is looking for a PhD/PostDoc Medical bioinformatics (m/w/d)

Freiburg im Breisgau, Baden-Württemberg (DE)

University of Freiburg

Set citation data free

Subjects

Latest on:

Jobs

FACULTY POSITION IN PATHOLOGY RESEARCH

Postdoc Fellow / Senior Scientist

Postdoc in Computational Genomics – Machine Learning for Multi-Omics Profiling of Cancer Evolution

Computational Postdoc

PhD / PostDoc Medical bioinformatics (m/f/d)

Search

Quick links

Related Articles

Subjects

Latest on:

Jobs

FACULTY POSITION IN PATHOLOGY RESEARCH

Postdoc Fellow / Senior Scientist

Postdoc in Computational Genomics – Machine Learning for Multi-Omics Profiling of Cancer Evolution

Computational Postdoc

PhD / PostDoc Medical bioinformatics (m/f/d)

Search

Quick links