Navigation auf uzh.ch

Suche

Department of Computational Linguistics Text Technologies

Rico Sennrich

Rico Sennrich, Prof. Dr.

SNSF Professor
Tel.
+41 44 63 57131
Raumbezeichnung
AND-2.40
This is a selection of software, data and systems that I created hands-on, listed in roughly reverse chronological order.
My research team and collaborators release code and/or data for the majority of research published. The list of publications has further links to code and datasets.

Software

Nematus - an attention-based encoder-decoder model for neural machine translation

subword-nmt subword segmentation scripts for neural machine translation, including byte-pair encoding (BPE).

Zmorge - Zurich Morphological Lexicon for German

clevertagger - morphologically informed POS-tagging

Bleualign - an MT-based sentence alignment tool

ParZu - The Zurich Dependency Parser for German online demo

Data

LingEval97, a test set of contrastive translation pairs for NMT evaluation.

WMT 2016 systems Pre-trained neural models for WMT 2016 shared translation task.

WMT 2016 backtranslations Synthetic parallel data (back-translated monolingual data), used at WMT 2016.

WMT 2016 factors Linguistically annotated data sets (for factored neural MT).

WMT 2015 German treebank Dependency parses (with ParZu) of WMT 2015 training data.