bulletin4corpus: Parallel corpus created from the Credit Suisse Bulletins
A manually corrected corpus with part-of-speech tagsof approx. 62.000 tokens (Language: German; Domain: Reports about the University of Zurich; PoS-Tagset: STTS)
4561 German test cases for PP-attachment from the computer magazine used in the habilitation: Martin Volk: The automatic resolution of prepositional phrase attachment ambiguities in German. University of Zurich. 2001.
3000 sentences annotated in the NEGRA format (computer magazine). Please contact Martin Volk.
The German-language thesaurus UniNet, which comprises approx. 20'000 nouns in the WordNet format belonging to the domain of (Swiss) university terminology. For information, please contact Simon Clematide.
The Gold Standard corpus of temporal annotations comprising approx. 34,000 tokens. The corpus contains 50 historical legal texts in Early New High German from the Collection of the Swiss Law Sources Foundation.