ArnetMiner
   HOME

TheInfoList



OR:

AMiner (formerly ArnetMiner) is a free online service used to index, search, and mine big
scientific data In the pursuit of knowledge, data (; ) is a collection of discrete Value_(semiotics), values that convey information, describing quantity, qualitative property, quality, fact, statistics, other basic units of meaning, or simply sequences of sy ...
.


Overview

AMiner (ArnetMiner) is designed to search and perform data mining operations against
academic publications Academic publishing is the subfield of publishing which distributes academic research and scholarship. Most academic work is published in academic journal articles, books or theses. The part of academic written output that is not formally publ ...
on the
Internet The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, pub ...
, using
social network A social network is a social structure made up of a set of social actors (such as individuals or organizations), sets of dyadic ties, and other social interactions between actors. The social network perspective provides a set of methods for an ...
analysis to identify connections between researchers, conferences, and publications. This allows it to provide services such as expert finding, geographic search, trend analysis, reviewer recommendation, association search, course search, academic performance evaluation, and topic modeling. AMiner was created as a research project in social influence analysis, social network ranking, and social network extraction. A number of peer-reviewed papers have been published arising from the development of the system. It has been in operation for more than three years, and has indexed 130,000,000 researchers and more than 265 million publications. The research was funded by the Chinese National High-tech R&D Program and the National Science Foundation of China. AMiner is commonly used in academia to identify relationships between and draw statistical correlations about research and researchers. It has attracted more than 10 million independent IP accesses from 220 countries and regions. The product has been used in
Elsevier Elsevier () is a Dutch academic publishing company specializing in scientific, technical, and medical content. Its products include journals such as ''The Lancet'', ''Cell'', the ScienceDirect collection of electronic journals, '' Trends'', th ...
's SciVerse platform, and academic conferences such as SIGKDD, ICDM, PKDD, WSDM.


Operation

AMiner automatically extracts the researcher profile from the web. It collects and identifies the relevant pages, then uses a unified approach to extract data from the identified documents. It also extracts publications from online digital libraries using heuristic rules. It integrates the extracted researchers’ profiles and the extracted publications. It employs the researcher name as the identifier. A probabilistic framework has been proposed to deal with the name ambiguity problem in the integration. The integrated data is stored into a researcher network knowledge base (RNKB). The principal other product in the area are Google Scholar, Elsevier's Scirus, and the open source project CiteSeer.


History

It was initiated and created by professor
Jie Tang Jie Tang (born 1977) is a full-time professor at the Department of Computer Science of Tsinghua University. He received a PhD in computer science from the same university in 2006. He is known for building the academic social network search system A ...
from
Tsinghua University Tsinghua University (; abbreviation, abbr. THU) is a National university, national Public university, public research university in Beijing, China. The university is funded by the Ministry of Education of the People's Republic of China, Minis ...
, China. It was first launched in March 2006. The following provide a list of updates in the past years: * March 2006, Version 0.1, Functions include researcher profiling, expert search, conference search, and publication search. The system was developed in Perl; * August 2006, Version 1.0, The system was re-implemented in Java; * July 2007, Version 2.0, New functions include researcher interest mining, association search, survey paper finding (unavailable now); * April 2008, Version 3.0, New functions include
query understanding Query understanding is the process of inferring the user intent, intent of a search engine (computing), search engine user by extracting semantic meaning from the searcher’s keywords. Query understanding methods generally take place before the sea ...
, new GUI, and search log analysis; * November 2008, Version 4.0, New functions include graph search, topic modeling, NSF/NSFC funding information extraction; * April 2009, Version 5.0, New functions include Profile edition, open API service, Bole search, course search (unavailable now); * December 2009, Version 6.0, New functions include academic performance evaluation, user feedback, conference analysis; * May 2010, Version 7.0, New functions include name disambiguation, paper-reviewer recommendation, ArnetPage creation; * March 2012, Version II, renamed as AMiner, rewrote all the codes and redesign the GUI. New functions include: geographic search, ArnetAPP platform. * June 2014, Version II, renamed as AMiner, rewrote all the codes and redesign the GUI. New functions include: geographic search, ArnetAPP platform. * December 2015, a completely new version got online. * May 2017, professional version got online. * April 2018, New functions include Trend Analysis, a deep learning based Name Disambiguation


Resources

AMiner published several datasets for academic research purpose, including Open Academic Graph, DBLP+citation (a data set augmenting citations into the DBLP data from Digital Bibliography & Library Project), Name Disambiguation, Social Tie Analysis. For more available datasets and source codes for research, please refer to.{{cite web, url=http://arnetminer.org/download, title=Open Data and Codes by ArnetMiner, access-date=24 April 2012


See also

*
List of academic databases and search engines This article contains a representative list of notable databases and search engines useful in an academic setting for finding and accessing articles in academic journals, institutional repositories, archives, or other collections of scientific and ...
*
CiteSeerX CiteSeerX (formerly called CiteSeer) is a public search engine and digital library for scientific and academic papers, primarily in the fields of computer and information science. CiteSeer's goal is to improve the dissemination and access of ac ...
* Digital Bibliography & Library Project *
Google Scholar Google Scholar is a freely accessible web search engine that indexes the full text or metadata of scholarly literature across an array of publishing formats and disciplines. Released in beta in November 2004, the Google Scholar index includes p ...
*
Microsoft Academic Search Microsoft Academic Search was a research project and academic search engine retired in 2012. It relaunched in 2016 as Academic. History Microsoft launched a search tool called Windows Live Academic Search in 2006 to directly compete with Google ...
*
Scirus Scirus was a comprehensive science-specific search engine, first launched in 2001. Like CiteSeerX and Google Scholar, it was focused on scientific information. Unlike CiteSeerX, Scirus was not only for computer sciences and IT and not all of the ...
*
Scopus Scopus is Elsevier's abstract and citation database launched in 2004. Scopus covers nearly 36,377 titles (22,794 active titles and 13,583 inactive titles) from approximately 11,678 publishers, of which 34,346 are peer-reviewed journals in top-l ...


References


External links


AMiner.orgArnetminer.org
is now archived)
AMiner.cn


Further reading

* Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, Zhong Su. Arnetminer: extraction and mining of academic social networks. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining (SIGKDD'2008) * Chi Wang,
Jiawei Han Jiawei Han (; born August 10, 1949) is a Chinese-American computer scientist and researcher. He currently holds the position of Michael Aiken Chair Professorship, Michael Aiken Chair Professor in the Department of Computer Science at the Unive ...
, Yuntao Jia,
Jie Tang Jie Tang (born 1977) is a full-time professor at the Department of Computer Science of Tsinghua University. He received a PhD in computer science from the same university in 2006. He is known for building the academic social network search system A ...
, Duo Zhang, Yintao Yu, and Jingyi Guo. Mining Advisor-Advisee Relationships from Research Publication Networks. InProceedings of the Sixteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD'2010). *
Jie Tang Jie Tang (born 1977) is a full-time professor at the Department of Computer Science of Tsinghua University. He received a PhD in computer science from the same university in 2006. He is known for building the academic social network search system A ...
, Jimeng Sun, Chi Wang, and Zi Yang. Social Influence Analysis in Large-scale Networks. In Proceedings of the Fifteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD'2009). pp. 807–816. *
Jie Tang Jie Tang (born 1977) is a full-time professor at the Department of Computer Science of Tsinghua University. He received a PhD in computer science from the same university in 2006. He is known for building the academic social network search system A ...
, Ruoming Jin, and Jing Zhang. A Topic Modeling Approach and its Integration into the Random Walk Framework for Academic Search. In Proceedings of 2008 IEEE International Conference on Data Mining (ICDM'2008). pp. 1055–1060. *
Jie Tang Jie Tang (born 1977) is a full-time professor at the Department of Computer Science of Tsinghua University. He received a PhD in computer science from the same university in 2006. He is known for building the academic social network search system A ...
, Limin Yao, Duo Zhang, and Jing Zhang. A Combination Approach to Web User Profiling. ACM Transactions on Knowledge Discovery from Data (TKDD), (vol. 5 no. 1), Article 2 (December 2010), 44 pages. * Yutao Zhang, Fanjin Zhang, Peiran Yao, and
Jie Tang Jie Tang (born 1977) is a full-time professor at the Department of Computer Science of Tsinghua University. He received a PhD in computer science from the same university in 2006. He is known for building the academic social network search system A ...
. Name Disambiguation in AMiner: Clustering, Maintenance, and Human in the Loop. In Proceedings of the Twenty-Fourth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'18). pp. 1002-1011. Data mining Social media Bibliographic databases and indexes Scientific databases