The development of an automatic pronunciation assistant

dc.contributor.advisorManamela, M. J. D.
dc.contributor.advisorModipa, T. I.
dc.contributor.authorSefara, Tshephisho Joseph
dc.date.accessioned2019-11-20T08:05:36Z
dc.date.available2019-11-20T08:05:36Z
dc.date.issued2019
dc.descriptionThesis (M. Sc. (Computer Science)) -- University of Limpopo, 2019en_US
dc.description.abstractThe pronunciation of words and phrases in any language involves careful manipulation of linguistic features. Factors such as age, motivation, accent, phonetics, stress and intonation sometimes cause a problem of inappropriate or incorrect pronunciation of words from non-native languages. Pronunciation of words using different phonological rules has a tendency of changing the meaning of those words. This study presents the development of an automatic pronunciation assistant system for under-resourced languages of Limpopo Province, namely, Sepedi, Xitsonga, Tshivenda and isiNdebele. The aim of the proposed system is to help non-native speakers to learn appropriate and correct pronunciation of words/phrases in these under-resourced languages. The system is composed of a language identification module on the front-end side and a speech synthesis module on the back-end side. A support vector machine was compared to the baseline multinomial naive Bayes to build the language identification module. The language identification phase performs supervised multiclass text classification to predict a person’s first language based on input text before the speech synthesis phase continues with pronunciation issues using the identified language. The speech synthesis on the back-end phase is composed of four baseline text-to-speech synthesis systems in selected target languages. These text-to-speech synthesis systems were based on the hidden Markov model method of development. Subjective listening tests were conducted to evaluate the performance of the quality of the synthesised speech using a mean opinion score test. The mean opinion score test obtained good performance results on all targeted languages for naturalness, pronunciation, pleasantness, understandability, intelligibility, overall quality of the system and user acceptance. The developed system has been implemented on a “real-live” production web-server for performance evaluation and stability testing using live data.en_US
dc.format.extentxxii, 173 leavesen_US
dc.identifier.urihttp://hdl.handle.net/10386/2906
dc.language.isoenen_US
dc.relation.requiresPDFen_US
dc.subjectPronunciationen_US
dc.subjectLinguisticen_US
dc.subjectLanguage identificationen_US
dc.subject.lcshSpeech synthesisen_US
dc.subject.lcshComputational linguisticsen_US
dc.subject.lcshMachine translatingen_US
dc.titleThe development of an automatic pronunciation assistanten_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
sefara_tj_2019.pdf
Size:
3.1 MB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: