Universität Bielefeld Electronic Collections animiertes Foto Universität Bielefeld

Access to the Document



Building a DDC Annotated Corpus from OAI Metadata

Lösch, Mathias ; Waltinger, Ulli ; Horstmann, Wolfram ; Mehler, Alexander

The 5th International Conference on Open Repositories (OR2010)
Madrid, Spain, 6-9 July 2010

Download files:
File 1; File 2;

Abstract:
A frequently overlooked benefit of open access publications is that they are an easy accessible and cost-effective data source for research disciplines like text mining, natural language processing or computational linguistics. In those fields, linguistic data is usually managed in the form of corpora, i.e. machine readable bodies of texts that represent a particular variety of language.


Keywords: OR2010 / Posters Sessions , Dewey Decimal Classi cation , OAI metadata , corpus construction
Institution: Faculty of Technology, Research Groups in Informatics
Institution: University Library (UB)
DDC classification: Library and information sciences

Suggested Citation:
Building a DDC Annotated Corpus from OAI Metadata. The 5th International Conference on Open Repositories (OR2010), Madrid, Spain, 6-9 July 2010


URL: http://biecoll.ub.uni-bielefeld.de/volltexte/2011/5151



 Questions or comments: publikationsdienste.ub@uni-bielefeld.de
 Latest update: 15 Feb 2011
 Legal Notice
OPUS-Logo     OAI compliant      BU Logo
OAI-Logo