Author-level metrics
   HOME

TheInfoList



OR:

Author-level metrics are
citation metrics Citation impact is a measure of how many times an academic journal article or book or author is cited by other articles, books or authors. Citation counts are interpreted as measures of the impact or influence of academic work and have given ris ...
that measure the bibliometric impact of individual authors, researchers, academics, and scholars. Many metrics have been developed that take into account varying numbers of factors (from only considering the total number of citations, to looking at their distribution across papers or journals using statistical or
graph-theoretic In mathematics, graph theory is the study of ''graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of '' vertices'' (also called ''nodes'' or ''points'') which are conne ...
principles). The main motivation for these quantitative comparisons between researchers is to allocate resources (e.g. funding, academic appointments). However, there remains controversy in the academic community as to how well author-level metrics achieve this goal. Author-level metrics differ from
journal-level metrics Journal ranking is widely used in academic circles in the evaluation of an academic journal's impact and quality. Journal rankings are intended to reflect the place of a journal within its field, the relative difficulty of being published in that ...
which attempt to measure the bibliometric impact of academic journals rather than individuals. However, metrics originally developed for academic journals can be reported at researcher level, such as the author-level eigenfactor and the author impact factor.


List of metrics


''h''-index

Formally, if ''f'' is the function that corresponds to the number of citations for each publication, we compute the ''h''-index as follows. First, we order the values of ''f'' from the largest to the lowest value. Then, we look for the last position in which ''f'' is greater than or equal to the position (we call ''h'' this position). For example, if we have a researcher with 5 publications A, B, C, D, and E with 10, 8, 5, 4, and 3 citations, respectively, the ''h''-index is equal to 4 because the 4th publication has 4 citations and the 5th has only 3. In contrast, if the same publications have 25, 8, 5, 3, and 3 citations, then the index is 3 because the fourth paper has only 3 citations.


Author-level Eigenfactor

Author-level Eigenfactor is a version of
Eigenfactor The Eigenfactor score, developed by Jevin West and Carl Bergstrom at the University of Washington, is a rating of the total importance of a scientific journal. Journals are rated according to the number of incoming citations, with citations from ...
for single authors. Eigenfactor regards authors as nodes in a network of citations. The score of an author according to this metric is his or her
eigenvector centrality In graph theory, eigenvector centrality (also called eigencentrality or prestige score) is a measure of the influence of a node in a network. Relative scores are assigned to all nodes in the network based on the concept that connections to high-sco ...
in the network.


Erdős number

It has been argued that "For an individual researcher, a measure such as
Erdős number The Erdős number () describes the "collaborative distance" between mathematician Paul Erdős and another person, as measured by authorship of mathematical papers. The same principle has been applied in other fields where a particular individual ...
captures the structural properties of the network whereas the ''h''-index captures the citation impact of the publications. One can be easily convinced that ranking in coauthorship networks should take into account both measures to generate a realistic and acceptable ranking." Several author ranking systems have been proposed already, for instance the Phys Author Rank Algorithm.


i-10-index

The i-10 index indicates the number of academic publications an author has written that have been cited by at least 10 sources. It was introduced in July 2011 by
Google Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...
as part of their work on
Google Scholar Google Scholar is a freely accessible web search engine that indexes the full text or metadata of scholarly literature across an array of publishing formats and disciplines. Released in beta in November 2004, the Google Scholar index includes ...
.


RG Score

''ResearchGate Score'' or ''RG Score'' is an author-level metric introduced by
ResearchGate ResearchGate is a European commercial social networking site for scientists and researchers to share papers, ask and answer questions, and find collaborators. According to a 2014 study by ''Nature'' and a 2016 article in ''Times Higher Education'' ...
in 2012. According to ResearchGate's CEO Dr.
Ijad Madisch Ijad Madisch (born 7 October 1980 in Wolfsburg, Germany) is a German virologist, founder and CEO of the research network ResearchGate and member of the Digital Council (''Digitalrat'') of the Cabinet of Germany (''Bundesregierung''). Personal l ...
, “ e RG Score allows real-time feedback from the people who matter: the scientists themselves.” RG Score has been reported to be correlated with existing author-level metrics and has an undisclosed calculation methodology. Two studies reported that RG Score seems to incorporate the journal impact factors into the calculation. The RG Score was found to be negatively correlated with network
centrality In graph theory and network analysis, indicators of centrality assign numbers or rankings to nodes within a graph corresponding to their network position. Applications include identifying the most influential person(s) in a social network, key ...
– users that are the most active on ResearchGate usually do not have high RG scores. It was also found to be strongly positively correlated with
Quacquarelli Symonds Quacquarelli Symonds (QS) is a British company specialising in the analysis of higher education institutions around the world. The company was founded in 1990 by Nunzio Quacquarelli. History On 5 October 2017, QS Quacquarelli Symonds acquired Ho ...
university rankings at the institutional level, but only weakly with
Elsevier Elsevier () is a Dutch academic publishing company specializing in scientific, technical, and medical content. Its products include journals such as '' The Lancet'', '' Cell'', the ScienceDirect collection of electronic journals, '' Trends'', ...
SciVal rankings of individual authors. While it was found to be correlated with different university rankings, the correlation in between these rankings themselves was higher.


Field-weighted Citation Impact

Field-weighted Citation Impact (FWCI) is an author-level metric introduced and applied by
Scopus Scopus is Elsevier's abstract and citation database launched in 2004. Scopus covers nearly 36,377 titles (22,794 active titles and 13,583 inactive titles) from approximately 11,678 publishers, of which 34,346 are peer-reviewed journals in top- ...
SciVal. FWCI equals to the total citations actually received divided by the total citations that would be expected based on the average of the considered field. FWCI of 1 means that the output performs just as expected for the global average. More than 1 means that the author outperforms the average, and less than 1 means that the author underperforms. For instance, 1.55 means 55% more likely to be cited.


''m''-index

The ''m''-index is defined as ''h''/''n'', where ''h'' is the ''h''-index and ''n'' is the number of years since the first published paper of the scientist; also called ''m''-quotient.


Individual ''h''-index

An individual ''h''-index normalized by the number of authors has been proposed: h_I = h^2/N_a^, with N_a^ being the number of authors considered in the h papers. It was found that the distribution of the ''h''-index, although it depends on the field, can be normalized by a simple rescaling factor. For example, assuming as standard the ''h''s for biology, the distribution of ''h'' for mathematics collapse with it if this ''h'' is multiplied by three, that is, a mathematician with ''h'' = 3 is equivalent to a biologist with ''h'' = 9. This method has not been readily adopted, perhaps because of its complexity. It might be simpler to divide citation counts by the number of authors before ordering the papers and obtaining the ''h''-index, as originally suggested by Hirsch.


''h''2

Three additional metrics have been proposed: ''h''2 lower, ''h''2 center, and ''h''2 upper, to give a more accurate representation of the distribution shape. The three ''h''2 metrics measure the relative area within a scientist's citation distribution in the low impact area, ''h''2 lower, the area captured by the ''h''-index, ''h''2 center, and the area from publications with the highest visibility, ''h''2 upper. Scientists with high ''h''2 upper percentages are perfectionists, whereas scientists with high ''h''2 lower percentages are mass producers. As these metrics are percentages, they are intended to give a qualitative description to supplement the quantitative ''h''-index.


''g''-index

For ''g''-index is introduced in 2006 as largest number of top g articles, which have received together at least g^2 citations.


''e''-index

The ''e''-index, the square root of surplus citations for the ''h''-set beyond ''h''2, complements the ''h''-index for ignored citations, and therefore is especially useful for highly cited scientists and for comparing those with the same ''h''-index (iso-''h''-index group).


''c''-index

The ''c''-index accounts not only for the citations but for the quality of the citations in terms of the collaboration distance between citing and cited authors. A scientist has ''c''-index ''n'' if ''n'' of is/her''N'' citations are from authors which are at collaboration distance at least ''n'', and the other (''N'' − ''n'') citations are from authors which are at collaboration distance at most ''n''.


''o''-index

The ''o''-index corresponds to the
geometric mean In mathematics, the geometric mean is a mean or average which indicates a central tendency of a set of numbers by using the product of their values (as opposed to the arithmetic mean which uses their sum). The geometric mean is defined as the ...
of the ''h''-index and the most cited paper of a researcher.


Normalized ''h''-index

The ''h''-index has been shown to have a strong discipline bias. However, a simple normalization h/\langle h \rangle_d by the average ''h'' of scholars in a discipline ''d'' is an effective way to mitigate this bias, obtaining a universal impact metric that allows comparison of scholars across different disciplines.


RA-index

The RA-index accommodates improving the sensitivity of the ''h''-index on the number of highly cited papers and has many cited paper and uncited paper under the ''h''-core. This improvement can enhance the measurement sensitivity of the ''h''-index.


''L''-index

''L''-index combines the number of citations, the number of coauthors, the age of publications into a single value, which is independent of the number of publications and conveniently ranges from 0.0 to 9.9. With ''c'' as number of citations, ''a'' as number of authors and ''y'' as number of years, ''L''-index is defined by the formula: :L=\ln\left(\right)+1 ''L''-index is automatically calculated by the Exaly database.


''s''-index

An ''s''-index, accounting for the non-entropic distribution of citations, has been proposed and it has been shown to be in a very good correlation with ''h''.


''w''-index

''w''-index is defined as follow: if ''w'' of a researcher's papers have at least 10w citations each and the other papers have fewer than 10(w+1) citations, that researcher's ''w''‐index is ''w''.


Author Impact Factor

Author Impact Factor (AIF) is the
Impact Factor The impact factor (IF) or journal impact factor (JIF) of an academic journal is a scientometric index calculated by Clarivate that reflects the yearly mean number of citations of articles published in the last two years in a given journal, as ...
applied to authors. The AIF of an author X in year y is the mean number of citations given by papers published in year y to papers published by X in a period of \Delta y years before year y. Unlike the h-index, AIF is able to capture trends and variations of the impact of the scientific output of scientists over time, which is a growing measure taking into account the whole career path.


Additional variations of ''h''-index

There are a number of models proposed to incorporate the relative contribution of each author to a paper, for instance by accounting for the rank in the sequence of authors. A generalization of the ''h''-index and some other indices that gives additional information about the shape of the author's citation function (heavy-tailed, flat/peaked, etc.) has been proposed. Because the ''h''-index was never meant to measure future publication success, recently, a group of researchers has investigated the features that are most predictive of future ''h''-index. It is possible to try the predictions using an online tool. However, later work has shown that since ''h''-index is a cumulative measure, it contains intrinsic auto-correlation that led to significant overestimation of its predictability. Thus, the true predictability of future ''h''-index is much lower compared to what has been claimed before. The ''h''-index can be timed to analyze its evolution during one's career, employing different time windows.


Criticism

Some academics, such as physicist Jorge E. Hirsch, have praised author-level metrics as a ''"useful yardstick with which to compare, in an unbiased way, different individuals competing for the same resource when an important evaluation criterion is scientific achievement."'' However, other members of the scientific community, and even Hirsch himself have criticized them as particularly susceptible to
gaming the system Gaming the system (also rigging, abusing, cheating, milking, playing, working, or breaking the system, or gaming or bending the rules) can be defined as using the rules and procedures meant to protect a system to, instead, manipulate the system ...
. Work in bibliometrics has demonstrated multiple techniques for the manipulation of popular author-level metrics. The most used metric ''h''-index can be manipulated through self-citations, and even computer-generated nonsense documents can be used for that purpose, for example using
SCIgen SCIgen is a paper generator that uses context-free grammar to randomly generate nonsense in the form of computer science research papers. Its original data source was a collection of computer science papers downloaded from CiteSeer. All elem ...
. Metrics can also be manipulated by
coercive citation Coercive citation is an academic publishing practice in which an editor of a scientific or academic journal forces an author to add spurious citations to an article before the journal will agree to publish it. This is done to inflate the journal' ...
, a practice in which an editor of a journal forces authors to add spurious citations to their own articles before the journal will agree to publish it. Additionally, if the ''h''-index is considered as a decision criterion for research funding agencies, the game-theoretic solution to this competition implies increasing the average length of coauthors' lists.
Leo Szilard Leo Szilard (; hu, Szilárd Leó, pronounced ; born Leó Spitz; February 11, 1898 – May 30, 1964) was a Hungarian-German-American physicist and inventor. He conceived the nuclear chain reaction in 1933, patented the idea of a nuclear ...
, the inventor of the
nuclear chain reaction In nuclear physics, a nuclear chain reaction occurs when one single nuclear reaction causes an average of one or more subsequent nuclear reactions, thus leading to the possibility of a self-propagating series of these reactions. The specific nu ...
, also expressed criticism of the decision-making system for scientific funding in his book "The Voice of the Dolphins and Other Stories". Senator
J. Lister Hill Joseph Lister Hill (December 29, 1894 – December 20, 1984) was an American politician. A member of the Democratic Party, he represented Alabama in the U.S. Congress for more than forty-five years, as both a U.S. Representative (1923–1938) ...
read excerpts of this criticism in a 1962 senate hearing on the slowing of government-funded
cancer research Cancer research is research into cancer to identify causes and develop strategies for prevention, diagnosis, treatment, and cure. Cancer research ranges from epidemiology, molecular bioscience to the performance of clinical trials to evaluate an ...
. Szilard's work focuses on metrics slowing scientific progress, rather than on specific methods of gaming:
''"As a matter of fact, I think it would be quite easy. You could set up a foundation, with an annual endowment of thirty million dollars. Research workers in need of funds could apply for grants, if they could mail out a convincing case. Have ten committees, each committee, each composed of twelve scientists, appointed to pass on these applications. Take the most active scientists out of the laboratory and make them members of these committees. And the very best men in the field should be appointed as chairman at salaries of fifty thousand dollars each. Also have about twenty prizes of one hundred thousand dollars each for the best scientific papers of the year. This is just about all you would have to do. Your lawyers could easily prepare a charter for the foundation. As a matter of fact, any of the National Science Foundation bills which were introduced in the Seventy-ninth and Eightieth Congress could perfectly well serve as a model."'' ''"First of all, the best scientists would be removed from their laboratories and kept busy on committees passing on applications for funds. Secondly the scientific workers in need of funds would concentrate on problems which were considered promising and were pretty certain to lead to publishable results. For a few years there might be a great increase in scientific output; but by going after the obvious, pretty soon science would dry out. Science would become something like a parlor game. Somethings would be considered interesting, others not. There would be fashions. Those who followed the fashions would get grants. Those who wouldn’t would not, and pretty soon they would learn to follow the fashion, too."''


See also

*
Academic age The academic age is the time that a scientist has been in the research field and performed active research. The academic age of a scientist may be computed as the span of years from their first published work up until the present. Another definiti ...
*
Scientometrics Scientometrics is the field of study which concerns itself with measuring and analysing scholarly literature. Scientometrics is a sub-field of informetrics. Major research issues include the measurement of the impact of research papers and academ ...


References


Further reading

* * * * {{Academic publishing Academic publishing Citation metrics Game theory Cheating in science