Learning to clean heritage point clouds
| dc.contributor.advisor | Marais, Patrick | |
| dc.contributor.author | Hayward, Luc | |
| dc.date.accessioned | 2026-04-22T11:24:21Z | |
| dc.date.available | 2026-04-22T11:24:21Z | |
| dc.date.issued | 2023 | |
| dc.date.updated | 2026-04-22T08:39:17Z | |
| dc.description.abstract | Laser scanning technology enables accurate distance measurements between a scanner and the environment producing a grid of points in 3D space. Many laser scans can be registered together to produce a “point cloud”. This is commonly used in the Cultural Heritage domain to create highly detailed 3D models. However, the raw point cloud often contains unwanted details and artefacts which require manual removal through a time-consuming process called point cloud cleaning. Whilst previous works have shown promising results, the application of deep learning in the cultural heritage domain has not been fully explored. In this study, we investigate the use of deep learning models for the binary segmentation of semantically varied cultural heritage sites using a few-shot fine-tuning approach. By relying only on learned geomet- ric information without additional point attributes such as colour, we demonstrate the e↵ective application of modern deep learning approaches in this domain. We present a method in which the user is presented with an unlabelled scan and asked to label a percentage between 2.5-50% of the scene. This labelled data is used to train the model and predict labels for the remainder of the scene. We compare the performance of three di↵erent deep learning models (Pointnet++, KPConv, and Point Transformer) against a baseline Random Forest model. We consider factors such as speed, pretraining, and the amount of hand labelling required. Our results show that minimal human labelling is needed to provide su�cient data for modern deep learning approaches and simple scenes are e�ciently labelled using Random Forests. We achieve up to 90% mean Intersection over Union on the remaining points, with as little as 5% of the scene labelled for training, in under two hours on consumer hardware. This translates to a reduction in manual labelling e↵ort of up to 90%. Among the models tested, KPConv reliably produces good results with minimal need for human labelling. The remaining models are slower to train and required more human labelling, while the Random Forest is e↵ective only on simple scans, where a machine learning model would be ine�cient to train. Our work demonstrates the potential of deep learning methods for label-e�cient and accurate point cloud cleaning in the cultural heritage domain, reducing the time spent on manual cleaning. | |
| dc.identifier.apacitation | Hayward, L. (2023). <i>Learning to clean heritage point clouds</i>. (). University of Cape Town ,Faculty of Science ,Department of Computer Science. Retrieved from http://hdl.handle.net/11427/43116 | en_ZA |
| dc.identifier.chicagocitation | Hayward, Luc. <i>"Learning to clean heritage point clouds."</i> ., University of Cape Town ,Faculty of Science ,Department of Computer Science, 2023. http://hdl.handle.net/11427/43116 | en_ZA |
| dc.identifier.citation | Hayward, L. 2023. Learning to clean heritage point clouds. . University of Cape Town ,Faculty of Science ,Department of Computer Science. http://hdl.handle.net/11427/43116 | en_ZA |
| dc.identifier.ris | TY - Thesis / Dissertation AU - Hayward, Luc AB - Laser scanning technology enables accurate distance measurements between a scanner and the environment producing a grid of points in 3D space. Many laser scans can be registered together to produce a “point cloud”. This is commonly used in the Cultural Heritage domain to create highly detailed 3D models. However, the raw point cloud often contains unwanted details and artefacts which require manual removal through a time-consuming process called point cloud cleaning. Whilst previous works have shown promising results, the application of deep learning in the cultural heritage domain has not been fully explored. In this study, we investigate the use of deep learning models for the binary segmentation of semantically varied cultural heritage sites using a few-shot fine-tuning approach. By relying only on learned geomet- ric information without additional point attributes such as colour, we demonstrate the e↵ective application of modern deep learning approaches in this domain. We present a method in which the user is presented with an unlabelled scan and asked to label a percentage between 2.5-50% of the scene. This labelled data is used to train the model and predict labels for the remainder of the scene. We compare the performance of three di↵erent deep learning models (Pointnet++, KPConv, and Point Transformer) against a baseline Random Forest model. We consider factors such as speed, pretraining, and the amount of hand labelling required. Our results show that minimal human labelling is needed to provide su�cient data for modern deep learning approaches and simple scenes are e�ciently labelled using Random Forests. We achieve up to 90% mean Intersection over Union on the remaining points, with as little as 5% of the scene labelled for training, in under two hours on consumer hardware. This translates to a reduction in manual labelling e↵ort of up to 90%. Among the models tested, KPConv reliably produces good results with minimal need for human labelling. The remaining models are slower to train and required more human labelling, while the Random Forest is e↵ective only on simple scans, where a machine learning model would be ine�cient to train. Our work demonstrates the potential of deep learning methods for label-e�cient and accurate point cloud cleaning in the cultural heritage domain, reducing the time spent on manual cleaning. DA - 2023 DB - OpenUCT DP - University of Cape Town KW - Computer Science KW - Cultural Heritage KW - 3D space LK - https://open.uct.ac.za PB - University of Cape Town PY - 2023 T1 - Learning to clean heritage point clouds TI - Learning to clean heritage point clouds UR - http://hdl.handle.net/11427/43116 ER - | en_ZA |
| dc.identifier.uri | http://hdl.handle.net/11427/43116 | |
| dc.identifier.vancouvercitation | Hayward L. Learning to clean heritage point clouds. []. University of Cape Town ,Faculty of Science ,Department of Computer Science, 2023 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/43116 | en_ZA |
| dc.language.iso | en | |
| dc.language.rfc3066 | eng | |
| dc.publisher.department | Department of Computer Science | |
| dc.publisher.faculty | Faculty of Science | |
| dc.publisher.institution | University of Cape Town | |
| dc.subject | Computer Science | |
| dc.subject | Cultural Heritage | |
| dc.subject | 3D space | |
| dc.title | Learning to clean heritage point clouds | |
| dc.type | Thesis / Dissertation | |
| dc.type.qualificationlevel | Masters | |
| dc.type.qualificationlevel | Masters |