[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Subscribe]
No SubjectFrom: S t e v e _ L a w r e n c e (Steve Lawrence) Date: Tue, 15 Feb 2000 19:08:24 -0500 <lawrence@research.nj.nec.com> To: datamine-l@nautilus-sys.com Subject: DM: Paper available: Indexing and Retrieval of Scientific Literature Message-ID: <20000215190824.A28119@research.nj.nec.com> Sender: owner-datamine-l@nautilus-sys.com Precedence: bulk The following paper discusses ResearchIndex (CiteSeer). ResearchIndex is the world's largest free full-text index of scientific literature. http://www.neci.nec.com/~lawrence/papers.html http://citeseer.nj.nec.com/details/lawrence99indexing.html Indexing and Retrieval of Scientific Literature Steve Lawrence, Kurt Bollacker, C. Lee Giles NEC Research Institute The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher homepages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use. -- Steve Lawrence - http://www.neci.nec.com/~lawrence/ http://csindex.com/ - 250,000+ computer science papers
|
MHonArc
2.2.0