5 - GeMTeX

General Information about the Project

Find official information about GeMTeX on the official website!

GeMTeX is a project funded by the German Medical Informatics Initiative (MII) and aims at releasing the largest real-text clinical text corpus of the German language, along with SNOMED CT annotations. Among 18 technical and methodological partners, six German University Hospitals compile, de-identify, and annotate real discharge letters, anamnesis reports, or counselings:

Charité Berlin
University Hospital Dresden
University Hospital Erlangen
University Hospital Essen
University Hospital Leipzig
TUM University Hospital Munich

The whole de-identification and annotation process is managed with INCEpTION, an open-source annotation platform. We use Averbis Health Discovery for automatic pre-annotations and ID Logic for refinement of the SNOMED CT concept annotations.

If you're interested in the project, particularly in the semantic annotation guidelines, please contact me.