Universität Bielefeld Electronic Collections animiertes Foto Universität Bielefeld

Zugang zum Dokument

Self-Organized Ordering of Terms and Documents in NSF Awards Data

Klami, Mikaela ; Honkela, Timo

We present the results of an analysis of a text corpus of 129,000 abstracts of NSF-sponsored basic research projects between years 1990 and 2003. The methods used in the analysis include term extraction based on a reference corpus and an entropy measure, and the Self-Organizing Map algorithm for the formation of a term map and a document map. Methodologically, the basic approach is based on earlier developments, such as word category maps and the WEBSOM method, but in the level of details, we report several new aspects and quantitative comparison results between methodological variants in this article. The data covers a quite large proportion of US-based scientific research during recent years. The analysis results indicate the basic patterns discernable in the data, both at the level of the awards and at the terminology used in them.

Schlagwörter: text mining, term extraction, self-organizing map
Beteiligte Einrichtung: Technische Fakultät, Arbeitsgruppen der Informatik
DDC-Sachgruppe: Datenverarbeitung, Informatik

Klami, Mikaela ; Honkela, Timo  (2007)  Self-Organized Ordering of Terms and Documents in NSF Awards Data.

URL: http://biecoll.ub.uni-bielefeld.de/volltexte/2007/144

 Fragen und Anregungen an: publikationsdienste.ub@uni-bielefeld.de
 Letzte Änderung: 15.2.2011
OPUS-Logo     OAI-zertifiziert      Universitätsbibliothek Bielefeld