Development of a Sepedi-English code-switching automatic speech recognition system using connectionist temporal classification

dc.contributor.advisorModipa, T. I.
dc.contributor.authorPhaladi, Amanda
dc.date.accessioned2026-03-12T12:14:59Z
dc.date.available2026-03-12T12:14:59Z
dc.date.issued2025
dc.descriptionThesis (M.Sc. (Computer Science)) -- University of Limpopo, 2025en_US
dc.description.abstractSpeech technology includes several approaches and technologies that allow ma- chines to engage with spoken language, which include spoken dialog systems and automatic speech recognition. The end-to-end (E2E) techniques, such as Connec- tionist Temporal Classification (CTC) and attention-based methods, dominate Auto- matic Spdeech Recognition (ASR) system development. However, these methodolo- gies have primarily advanced in research for high-resourced languages with exten- sive speech datasets, leaving low-resource languages relatively underserved. The efficacy of the CTC method specifically for Sepedi, a low-resource language, remains uncertain. This study addresses this gap by developing and evaluating an automatic speech recognition (ASR) system for Sepedi-English code-switched speech. Utilizing the Se- pedi Prompted Code Switching (SPCS) corpus and applying the CTC approach, we implemented an E2E ASR system. We rigorously evaluated the system’s performance across various parameters using both the National Centre for Human Language Tech- nology (NCHLT) Sepedi test corpus and the Sepedi Prompted Code Switching corpus. Our findings demonstrate promising results overall. However, the system faced challenges in accurately recognizing speech from the Sepedi NCHLT test corpus. This study shows the importance of adapting advanced ASR techniques to suit the linguistic characteristics and data limitations of low-resource languages. Addressing these challenges is crucial for expanding the applicability of speech technology to diverse linguistic contexts, ultimately facilitating broader accessibility and usability of ASR systems worldwide.en_US
dc.format.extentviii, 66 leavesen_US
dc.identifier.urihttp://hdl.handle.net/10386/5380
dc.language.isoenen_US
dc.relation.requiresPDFen_US
dc.subjectASR systemsen_US
dc.subjectCode switchingen_US
dc.subjectTemporal classificationen_US
dc.subject.lcshCode switching (Linguistics)en_US
dc.subject.lcshAutomatic speech recognitionen_US
dc.titleDevelopment of a Sepedi-English code-switching automatic speech recognition system using connectionist temporal classificationen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
phaladi_a_2025.pdf
Size:
2.36 MB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: