JobMesh

ASR Linguistics Intern

Cerence · Aachen, North Rhine-Westphalia, DE

A Moving Experience. Motivation Statistical and neural network ASR systems normalize text before training models with it. It works in two directions: our ‘to...

Job description

A Moving Experience. Motivation: Statistical and neural network ASR systems normalize text before training models with it. It works in two directions: our ‘tokenizer’ not only splits on whitespace, but also normalizes text as it is spoken. Numbers in digits, dates, time expressions etc. will be written out in words, spelling will be normalized, and some other minor rewrites will be done in order to bring all training material into our token space. On the other hand, the end user does not want to see ‘first of July nineteen ninety two’, so will need the inverse normalization to ‘July 1, 1992’ as a step we call ‘formatting’. Currently both the tokenizer and the formatter are largely driven by grammar rules. Both require the input to be in a fixed format, normal text for the tokenizer and in ‘token space’ for the formatter. However, nowadays we are dealing with hybrid systems more and more and the output of these might not necessarily be the expected input. On top of this, we are increasingly dealing with multilingual systems where the input can be in several languages. Target Languages & Phenomena: - Monolingual systems: English (EN) and a few other (European, non-agglutinative) lang...