Show simple item record

dc.contributor.advisor Mokwena, S. N.
dc.contributor.author Tlouyamma, Joseph
dc.date.accessioned 2024-09-17T10:23:53Z
dc.date.available 2024-09-17T10:23:53Z
dc.date.issued 2024
dc.identifier.uri http://hdl.handle.net/10386/4625
dc.description Thesis (Ph.D. (Computer Science)) -- University of Limpopo,2024 en_US
dc.description.abstract The Health and Demographic Surveillance System (HDSS) is a data collection system that can track crucial events such as births, deaths, and migrations in well-defined geographic areas, particularly in low- and middle-income households. HDSS tracks the life events of approximately three million people in 18 low- and middle-income African, Asian, and Oceanian nations. Having HDSSs strategically located within a country can provide a more complete picture of health-related and other social problems affecting the public. The HDSS keeps tabs on vital demographic and health indicators as well as other metrics to help shape national policies and programmes for departments of basic education, home affairs, social development, and health. However, their establishment was plagued by several difficulties, including the difficulty of obtaining high-quality data because of the use of antiquated methods or systems. The cornerstone of a wellfunctioning HDSS is high-quality, and timely health data, which is often lacking in lowand middle-income countries. There is a paucity of high-quality, disaggregated data to monitor health inequities and promote the equitable delivery of health services. HDSSs are confronted with data quality-related problems due to how data is acquired and managed. This study addresses these problems by building a data system that integrates a novel framework known as the 3-Tier Total Data Quality Management Framework (3TTDQMF). The framework manages the quality of data from the point of collection through to the storage in the database. At the core of the framework, is an automated data quality control methodology to autonomously validate and control the quality of data. Open source technologies such as Pentaho data integration (PDI), R application programming interface (R-API), Windows task scheduler, Bash and Python programming languages were used to automate and quality control the data. The experiment was set up in Hyper-converged IT infrastructure running the Windows 2016 server operating system. The results have shown that the proposed approach greatly improved the overall efficiency of the system and the quality of data. The efficiency in dealing with data quality issues was ensured through the implementation of an automated system. The research evaluated the system’s capacity to generate high-quality data using measures such as data accuracy, completeness, consistency, timeliness, and validity. All quality metrics exhibited an increasing trend, indicating that the proposed approach led to a substantial improvement in data quality. The results further demonstrated that the use of Pareto analysis and Process control techniques in data quality management can greatly improve the quality of data by identifying and monitoring the causes of data quality issues. en_US
dc.description.sponsorship South African Population Research Institute Network (SAPRIN) en_US
dc.format.extent xiii,179 leaves en_US
dc.language.iso en en_US
dc.relation.requires PDF en_US
dc.subject Application programming interface en_US
dc.subject Automated data quality management en_US
dc.subject Data collection en_US
dc.subject Data collection platforms en_US
dc.subject Data quality en_US
dc.subject Data quality metrics en_US
dc.subject Data quality management framework en_US
dc.subject Electronic data collection en_US
dc.subject Robotic process automation en_US
dc.subject Total data quality management framework en_US
dc.subject Survey Solutions en_US
dc.subject Pentaho data integration en_US
dc.subject Windows task scheduler en_US
dc.subject.lcsh Machine-readable bibliographic data -- Quality en_US
dc.subject.lcsh Electronic surveillance -- Social aspects en_US
dc.subject.lcsh Public health surveillance en_US
dc.subject.lcsh Data collection platforms en_US
dc.title Integrated and automated demographic surveillance data quality systems for rural areas en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search ULSpace


Browse

My Account