Access to the Document
Building a DDC Annotated Corpus from OAI Metadata
Lösch, Mathias ; Waltinger, Ulli ; Horstmann, Wolfram ; Mehler, Alexander
The 5th International Conference on Open Repositories (OR2010)
Madrid, Spain, 6-9 July 2010
A frequently overlooked benefit of open access publications is that they are an easy accessible and cost-effective data source for research disciplines like text mining, natural language processing or computational linguistics. In those fields, linguistic data is usually managed in the form of corpora, i.e. machine readable bodies of texts that represent a particular variety of language.
||OR2010 / Posters Sessions , Dewey Decimal Classication , OAI metadata , corpus construction
||Faculty of Technology, Research Groups in Informatics
||University Library (UB)
||Library and information sciences
Building a DDC Annotated Corpus from OAI Metadata.
The 5th International Conference on Open Repositories (OR2010), Madrid, Spain, 6-9 July 2010