Show simple item record

dc.contributor.advisor Modipa, T. I.
dc.contributor.author Phaladi, Amanda
dc.date.accessioned 2026-03-12T12:14:59Z
dc.date.available 2026-03-12T12:14:59Z
dc.date.issued 2025
dc.identifier.uri http://hdl.handle.net/10386/5380
dc.description Thesis (M.Sc. (Computer Science)) -- University of Limpopo, 2025 en_US
dc.description.abstract Speech technology includes several approaches and technologies that allow ma- chines to engage with spoken language, which include spoken dialog systems and automatic speech recognition. The end-to-end (E2E) techniques, such as Connec- tionist Temporal Classification (CTC) and attention-based methods, dominate Auto- matic Spdeech Recognition (ASR) system development. However, these methodolo- gies have primarily advanced in research for high-resourced languages with exten- sive speech datasets, leaving low-resource languages relatively underserved. The efficacy of the CTC method specifically for Sepedi, a low-resource language, remains uncertain. This study addresses this gap by developing and evaluating an automatic speech recognition (ASR) system for Sepedi-English code-switched speech. Utilizing the Se- pedi Prompted Code Switching (SPCS) corpus and applying the CTC approach, we implemented an E2E ASR system. We rigorously evaluated the system’s performance across various parameters using both the National Centre for Human Language Tech- nology (NCHLT) Sepedi test corpus and the Sepedi Prompted Code Switching corpus. Our findings demonstrate promising results overall. However, the system faced challenges in accurately recognizing speech from the Sepedi NCHLT test corpus. This study shows the importance of adapting advanced ASR techniques to suit the linguistic characteristics and data limitations of low-resource languages. Addressing these challenges is crucial for expanding the applicability of speech technology to diverse linguistic contexts, ultimately facilitating broader accessibility and usability of ASR systems worldwide. en_US
dc.format.extent viii, 66 leaves en_US
dc.language.iso en en_US
dc.relation.requires PDF en_US
dc.subject ASR systems en_US
dc.subject Code switching en_US
dc.subject Temporal classification en_US
dc.subject.lcsh Code switching (Linguistics) en_US
dc.subject.lcsh Automatic speech recognition en_US
dc.title Development of a Sepedi-English code-switching automatic speech recognition system using connectionist temporal classification en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search ULSpace


Browse

My Account