Learning to clean heritage point clouds

Hayward, Luc

Learning to clean heritage point clouds

dc.contributor.advisor	Marais, Patrick
dc.contributor.author	Hayward, Luc
dc.date.accessioned	2026-04-22T11:24:21Z
dc.date.available	2026-04-22T11:24:21Z
dc.date.issued	2023
dc.date.updated	2026-04-22T08:39:17Z
dc.description.abstract	Laser scanning technology enables accurate distance measurements between a scanner and the environment producing a grid of points in 3D space. Many laser scans can be registered together to produce a “point cloud”. This is commonly used in the Cultural Heritage domain to create highly detailed 3D models. However, the raw point cloud often contains unwanted details and artefacts which require manual removal through a time-consuming process called point cloud cleaning. Whilst previous works have shown promising results, the application of deep learning in the cultural heritage domain has not been fully explored. In this study, we investigate the use of deep learning models for the binary segmentation of semantically varied cultural heritage sites using a few-shot fine-tuning approach. By relying only on learned geomet- ric information without additional point attributes such as colour, we demonstrate the e↵ective application of modern deep learning approaches in this domain. We present a method in which the user is presented with an unlabelled scan and asked to label a percentage between 2.5-50% of the scene. This labelled data is used to train the model and predict labels for the remainder of the scene. We compare the performance of three di↵erent deep learning models (Pointnet++, KPConv, and Point Transformer) against a baseline Random Forest model. We consider factors such as speed, pretraining, and the amount of hand labelling required. Our results show that minimal human labelling is needed to provide su�cient data for modern deep learning approaches and simple scenes are e�ciently labelled using Random Forests. We achieve up to 90% mean Intersection over Union on the remaining points, with as little as 5% of the scene labelled for training, in under two hours on consumer hardware. This translates to a reduction in manual labelling e↵ort of up to 90%. Among the models tested, KPConv reliably produces good results with minimal need for human labelling. The remaining models are slower to train and required more human labelling, while the Random Forest is e↵ective only on simple scans, where a machine learning model would be ine�cient to train. Our work demonstrates the potential of deep learning methods for label-e�cient and accurate point cloud cleaning in the cultural heritage domain, reducing the time spent on manual cleaning.
dc.identifier.apacitation	Hayward, L. (2023). <i>Learning to clean heritage point clouds</i>. (). University of Cape Town ,Faculty of Science ,Department of Computer Science. Retrieved from http://hdl.handle.net/11427/43116	en_ZA
dc.identifier.chicagocitation	Hayward, Luc. <i>"Learning to clean heritage point clouds."</i> ., University of Cape Town ,Faculty of Science ,Department of Computer Science, 2023. http://hdl.handle.net/11427/43116	en_ZA
dc.identifier.citation	Hayward, L. 2023. Learning to clean heritage point clouds. . University of Cape Town ,Faculty of Science ,Department of Computer Science. http://hdl.handle.net/11427/43116	en_ZA
dc.identifier.ris	TY - Thesis / Dissertation AU - Hayward, Luc AB - Laser scanning technology enables accurate distance measurements between a scanner and the environment producing a grid of points in 3D space. Many laser scans can be registered together to produce a “point cloud”. This is commonly used in the Cultural Heritage domain to create highly detailed 3D models. However, the raw point cloud often contains unwanted details and artefacts which require manual removal through a time-consuming process called point cloud cleaning. Whilst previous works have shown promising results, the application of deep learning in the cultural heritage domain has not been fully explored. In this study, we investigate the use of deep learning models for the binary segmentation of semantically varied cultural heritage sites using a few-shot fine-tuning approach. By relying only on learned geomet- ric information without additional point attributes such as colour, we demonstrate the e↵ective application of modern deep learning approaches in this domain. We present a method in which the user is presented with an unlabelled scan and asked to label a percentage between 2.5-50% of the scene. This labelled data is used to train the model and predict labels for the remainder of the scene. We compare the performance of three di↵erent deep learning models (Pointnet++, KPConv, and Point Transformer) against a baseline Random Forest model. We consider factors such as speed, pretraining, and the amount of hand labelling required. Our results show that minimal human labelling is needed to provide su�cient data for modern deep learning approaches and simple scenes are e�ciently labelled using Random Forests. We achieve up to 90% mean Intersection over Union on the remaining points, with as little as 5% of the scene labelled for training, in under two hours on consumer hardware. This translates to a reduction in manual labelling e↵ort of up to 90%. Among the models tested, KPConv reliably produces good results with minimal need for human labelling. The remaining models are slower to train and required more human labelling, while the Random Forest is e↵ective only on simple scans, where a machine learning model would be ine�cient to train. Our work demonstrates the potential of deep learning methods for label-e�cient and accurate point cloud cleaning in the cultural heritage domain, reducing the time spent on manual cleaning. DA - 2023 DB - OpenUCT DP - University of Cape Town KW - Computer Science KW - Cultural Heritage KW - 3D space LK - https://open.uct.ac.za PB - University of Cape Town PY - 2023 T1 - Learning to clean heritage point clouds TI - Learning to clean heritage point clouds UR - http://hdl.handle.net/11427/43116 ER -	en_ZA
dc.identifier.uri	http://hdl.handle.net/11427/43116
dc.identifier.vancouvercitation	Hayward L. Learning to clean heritage point clouds. []. University of Cape Town ,Faculty of Science ,Department of Computer Science, 2023 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/43116	en_ZA
dc.language.iso	en
dc.language.rfc3066	eng
dc.publisher.department	Department of Computer Science
dc.publisher.faculty	Faculty of Science
dc.publisher.institution	University of Cape Town
dc.subject	Computer Science
dc.subject	Cultural Heritage
dc.subject	3D space
dc.title	Learning to clean heritage point clouds
dc.type	Thesis / Dissertation
dc.type.qualificationlevel	Masters
dc.type.qualificationlevel	Masters

Files

Original bundle

Now showing 1 - 1 of 1

Name:: thesis_sci_2023_hayward luc.pdf
Size:: 126.36 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.72 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Masters