Universität Bielefeld Electronic Collections animiertes Foto Universität Bielefeld

Access to the Document

Self-Organized Ordering of Terms and Documents in NSF Awards Data

Klami, Mikaela ; Honkela, Timo

Download file

We present the results of an analysis of a text corpus of 129,000 abstracts of NSF-sponsored basic research projects between years 1990 and 2003. The methods used in the analysis include term extraction based on a reference corpus and an entropy measure, and the Self-Organizing Map algorithm for the formation of a term map and a document map. Methodologically, the basic approach is based on earlier developments, such as word category maps and the WEBSOM method, but in the level of details, we report several new aspects and quantitative comparison results between methodological variants in this article. The data covers a quite large proportion of US-based scientific research during recent years. The analysis results indicate the basic patterns discernable in the data, both at the level of the awards and at the terminology used in them.

Keywords: text mining, term extraction, self-organizing map
Institution: Faculty of Technology, Research Groups in Informatics
DDC classification: Data processing, computer science, computer systems

Suggested Citation:
Klami, Mikaela ; Honkela, Timo  (2007)  Self-Organized Ordering of Terms and Documents in NSF Awards Data.

URL: http://biecoll.ub.uni-bielefeld.de/volltexte/2007/144

 Questions or comments: publikationsdienste.ub@uni-bielefeld.de
 Latest update: 15 Feb 2011
 Legal Notice
OPUS-Logo     OAI compliant      BU Logo