Instructors: Dr. Christoph Rühlemann
Event type:
Seminar
Org-unit: Anglistik/Amerikanistik
Displayed in timetable as:
Hours per week:
2
Language of instruction:
Englisch
Min. | Max. participants:
- | 30
Requirements and recommendations:
This seminar, which is closely related to the Hauptseminar on "Analyzing
conversation" and decidedly practical in its approach, aims to provide
participants with hands-on experience of analyzing conversation and related
speech genres. The focus is on computer-based methods for analyzing spoken
data. You will learn, for example, how to identify the most frequent words
and the most typical words in spoken texts and you will be introduced to
free online collections of spoken texts and their search functionalities.
You will also acquire basic skills in encoding texts in XML and exploiting
this annotion by using XPath and XQuery. The seminar is Schein-free.
Comment:
This seminar, which is tightly linked to my seminar on Corpus linguistics, aims to provide students with hands-on experience in constructing and exploiting small corpora. Participants will learn how to encode texts in XML, how to annotate them for linguistically relevant features, and how to exploit that annotation using XPath and XQuery, two related programming languages for XML texts. While students are free to choose their own texts, it is recommended to work on political speeches because both an XML framework and various XQuery resources are available for that particular text type. Recommended reading: Rühlemann, Christoph, Andrej Bagoutdinov and Matthew B. O'Donnell. 2015. Modest XPath and XQuery: Exploiting deep XML annotation. ICAME Journal 39.
|