Universität Bielefeld Electronic Collections animiertes Foto Universität Bielefeld

Access to the full text



Dimensionality Reduction of very large document collections by Semantic Mapping

Corrêa, Renato Fernandes ; Ludermir, Teresa Bernarda



Download the full text

Abstract:
This paper describes improving in Semantic Mapping, a feature extraction method useful to dimensionality reduction of vectors representing documents of large text collections. This method may be viewed as a specialization of the Random Mapping, method proposed in WEBSOM project. Semantic Mapping, Random Mapping and Principal Component Analysis (PCA) are applied to categorization of document collections using Self-Organizing Maps (SOM). Semantic Mapping generated document representation as good as PCA and much better than Random Mapping.


Keywords: Document Clustering, Dimensionality Reduction, Semantic Mapping
Institution: Faculty of Technology, Research Groups in Informatics
DDC classification: Data processing, computer science, computer systems

Suggested Citation:
Corrêa, Renato Fernandes ; Ludermir, Teresa Bernarda  (2007)  Dimensionality Reduction of very large document collections by Semantic Mapping.
URL: http://biecoll.ub.uni-bielefeld.de/volltexte/2007/133



 Questions or comments: ecollections.ub@uni-bielefeld.de
 Latest update: 07 Jul 2009
 Imprint
OPUS-Logo     OAI compliant      UB-Logo
OAI-Logo