Design and Implementation of a Luganda Text Normalization Module for a Speech Synthesis Software Program

Kagumire, Sulaiman

View/Open

Undergraduate dissertation (909.9Kb)

Date

2019-06

Author

Kagumire, Sulaiman

Metadata

Show full item record

Abstract

This report investigates the problem of text normalization; specifically, the normalization of nonstandard words (NSWs) in Luganda. Non-standard words can be defined as those word tokens which do not have a dictionary entry, and cannot be pronounced using the usual letter tophoneme conversion rules. NSWs pose a challenge to the proper functioning of text to speech technology, and the solution is to spell them out in such a way that they can be pronounced appropriately. In addition to ordinary words and names, real text contains non-standard “words” (NSW), including numbers, abbreviations, dates, currency amounts and acronyms. Typically, one cannot find NSW in a dictionary, nor can one find their pronunciation by an application of ordinary “letter-to-sound” rules. Non-standard words also have a greater propensity than ordinary words to be ambiguous with respect to their interpretation or pronunciation. In many applications, it is desirable to “normalize” text by replacing the NSWs with the contextually appropriate ordinary word or sequence of words. Typical technology for text normalization involves sets of ad hoc rules tuned to handle one or two genres of text (often newspaper-style text) with the expected result that the techniques do not usually generalize well to new domains. Text normalization means converting non-standard words into standard words. Such words can be in the format of numbers, dates, time, measurements, currencies and abbreviations. Text Normalization ensures that these non-standard words are pronounced easily by a TTS system. Itis therefore an important part of any text-to-speech system because unintelligible speech is produced, especially for languages like Luganda, if text normalization is not implemented. In this report, a rule-based Luganda text normalization module that detects, classifies and verbalizes numbers, dates, time, measurements, currencies and abbreviations into Luganda words was designed and implemented using python programming language. Its implementation will enable production of intelligible speech by Luganda text-to-speech systems.

URI

http://hdl.handle.net/20.500.12281/8014

Collections

Academic submissions (CEDAT)