Welcome!

Core Information/photo.jpg|200

About Me

I'm a doctoral student at the Institute for AI and Informatics in Medicine (AIIM) at the Technical University of Munich, with a formal and computational linguistics background.

My PhD Project

My PhD project in clinical natural language processing (NLP) focuses on the differences between general and clinical German as well as the specialties of clinical subdomains and clinical document types. Moreover, I'm interested in how well large language models (LLMs) process, analyze, or generate accurate clinical texts. I'm thankful that Prof. Dr. Martin Boeker and Dr. Diego Frassinelli supervise me, and Luise Modersohn and Dr. Jacqueline Lammert advise me on my project.

Learn more about my PhD project.

GeMTeX

In the German Medical Text Corpus (GeMTeX) project, we aim to create the largest shareable collection of de-identified and semantically annotated clinical documents in German. Within that project, I'm responsible for the guidelines of semantic annotation, where relevant clinical terms are grounded with SNOMED CT concepts. Furthermore, I supervise the local annotation team and curate their results.

Learn more about GeMTeX.

Academic Background

I received my B.A. in Linguistics and my M.A. in Speech and Language Processing at the University of Konstanz. During my Bachelor's, my thesis, Influences of the Prosody on the Presuppositional Content of Wh-Questions in German, focused on the phonetics-pragmatics interface in spoken language and was supervised by Prof. Dr. Maribel Romero and Prof. Dr. Bettina Braun.
In the Speech and Language Processing Master's, I was able to dive deeper into the field of computational linguistics. I wrote my thesis,’Form2Bot’: A pipeline to automate the generation of conversational assistants from medical questionnaires, in collaboration with the Fraunhofer Institute for Manufacturing Engineering and Automation (IPA). The thesis aimed at automating the generation of anamnesis chatbots from FHIR resources. Prof. Dr. Miriam Butt and Jun.-Prof. Dr. Diego Frassinelli supervised me, Sebastian Schöning advised me at Fraunhofer IPA.

Learn more about my background in my CV.

You want to know more?

Most recent news

Date News
06/25 Our paper GerMedIQ: At the Gap Between Human and Synthetic Clinical Text has been accepted at the ACL Student Research Workshop as a poster!
06/25 Our poster presentation Introducing Medical Semantic Annotation Guidelines for German Clinical Documentation with SNOMED CT has been accepted at GMDS 2025!
06/25 Our paper When the Devil is in the Details: GeMTeX’s De-Identification Annotation moves from Sandbox to productive Routine has been accepted at GMDS 2025!
06/25 The latest GeMTeX Semantic Annotation Guidelines have been released. Reach out to me if you're interested.
05/25 Our MIE 2025 Paper German Medical NER with BERT and LLMs: The Impact of Training Data Size has been released!

Feel free to reach out to me any time, I look forward to hearing from you.

Get in Touch

Justin Hofenbitzer, M.A. Speech and Language Processing
Researcher in GeMTeX & PhD Candidate

Technical University of Munich
School of Medicine and Health 
Institute for AI and Informatics in Medicine
TUM University Hospital

Grillparzerstr. 18 / 3. OG
81675 Munich
Room 3.15

+49 89 4140 4322
justin.hofenbitzer@tum.de

www.kiinformatik.mri.tum.de
https://justinhofenbitzer.vercel.app