The development of accented English synthetic voices

dc.contributor.advisorManamela, M. J. D.
dc.contributor.advisorModipa, T. I.
dc.contributor.authorMalatji, Promise Tshepiso
dc.date.accessioned2019-11-27T05:35:41Z
dc.date.available2019-11-27T05:35:41Z
dc.date.issued2019
dc.descriptionThesis (M. Sc. (Computer Science)) --University of Limpopo, 2019en_US
dc.description.abstractA Text-to-speech (TTS) synthesis system is a software system that receives text as input and produces speech as output. A TTS synthesis system can be used for, amongst others, language learning, and reading out text for people living with different disabilities, i.e., physically challenged, visually impaired, etc., by native and non-native speakers of the target language. Most people relate easily to a second language spoken by a non-native speaker they share a native language with. Most online English TTS synthesis systems are usually developed using native speakers of English. This research study focuses on developing accented English synthetic voices as spoken by non-native speakers in the Limpopo province of South Africa. The Modular Architecture for Research on speech sYnthesis (MARY) TTS engine is used in developing the synthetic voices. The Hidden Markov Model (HMM) method was used to train the synthetic voices. Secondary training text corpus is used to develop the training speech corpus by recording six speakers reading the text corpus. The quality of developed synthetic voices is measured in terms of their intelligibility, similarity and naturalness using a listening test. The results in the research study are classified based on evaluators’ occupation and gender and the overall results. The subjective listening test indicates that the developed synthetic voices have a high level of acceptance in terms of similarity and intelligibility. A speech analysis software is used to compare the recorded synthesised speech and the human recordings. There is no significant difference in the voice pitch of the speakers and the synthetic voices except for one synthetic voice.en_US
dc.format.extentxv, 107 leavesen_US
dc.identifier.urihttp://hdl.handle.net/10386/2917
dc.language.isoenen_US
dc.publisherUniversity of Limpopoen_US
dc.relation.requiresPDFen_US
dc.subjectText-to-speech synthesis systemen_US
dc.subjectLanguage learningen_US
dc.subject.lcshText data miningen_US
dc.subject.lcshData compression (computer science)en_US
dc.subject.lcshInformal language learningen_US
dc.titleThe development of accented English synthetic voicesen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
malatji_pt_2019.pdf
Size:
1.72 MB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: