Corpus2Wiki: A Tool for Automatically Generating Wikiditions in Digital Humanities
SMWCon Fall 2020 | |
---|---|
Corpus2Wiki: A Tool for Automatically Generating Wikiditions in Digital Humanities | |
Talk details | |
Description: | Corpus2Wiki is a tool for generating so-called Wikiditions out of text corpora. It provides text analyses, annotations and their visualizations without requiring programming or advanced computer skills. |
Speaker(s): | Alexander Mehler, Wahed Hemati |
Type: | Talk |
Audience: | |
Event start: | 2020/11/26 13:30:00 |
Event finish: | 2020/11/26 13:50:00 |
Length: | 10 minutes |
Video: | click here |
Keywords: | natural language processing |
Give feedback |
Corpus2Wiki is a tool for generating so-calledWikiditions out of text corpora. It provides text analyses, annotations and their visualizations without requiring programming or advanced computer skills. By using TextImager as a back-end, Corpus2Wiki can automatically analyze input documents at different linguistic levels. Currently, it automatically annotates information regarding lemmatization, parts of speech, morphological information, named entities, geolocations and topic labels based on the Dewey Decimal Classification (DDC). Any results are stored and displayed by means of a modified and extended MediaWiki which makes it easy to further process texts and their annotations. The aim of this paper is to present the capabilities of Corpus2wiki, to point out the improvements made and to make suggestions for further development.