Ebrahimi, Ali (2024) Perspective-independent point cloud processing: towards streamlining 3D computer vision workflows and enhancing 3D indoor scene perception. Doctoral (PhD) thesis, Memorial University of Newfoundland.
[English]
PDF
- Accepted Version
Available under License - The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission. Download (22MB) |
Abstract
The advent of commercially accessible depth sensors, augmented reality headsets, and smartphones equipped with depth sensing technologies has revolutionized the acquisition of 3D data, enabling comprehensive spatial understanding in real-world environments. This advancement has led to the widespread adoption of 3D data, offering significant benefits across a range of applications, from augmented reality to autonomous navigation. However, the complexity of indoor scenes poses significant challenges for 3D computer vision systems, particularly in cluttered environments where background surfaces hinder the detection and analysis of relevant foreground objects. This PhD thesis presents a comprehensive study of perspective-independent point cloud processing techniques tailored to address the challenges posed by cluttered and complex indoor environments. The primary objectives focus on streamlining 3D computer vision workflows by contextually segmenting and subtracting 3D background surfaces while enhancing 3D scene perception through accurate identification of these surfaces and spatial relationships within indoor scenes. Alongside the primary objectives, this thesis also addresses two sub-objectives: size reduction of indoor point clouds and labeling of various elements within complex indoor scenes. To achieve these objectives, four research endeavors are presented. Initially, two techniques were implemented for bounding surface segmentation and removal: Iterative Region-based RANdom SAmple Consensus (IR-RANSAC) and orientation-based M-estimator SAmple Consensus (MSAC). They considerably reduce the size of 3D datasets and the search space of various 3D computer vision applications, resulting in enhanced performance and faster processing times. IR-RANSAC demonstrates robust performance with a mean F₁ score above 94%, while Orientation-based MSAC achieves a mean F1 score exceeding 98%, showcasing its superior performance and notable computational efficiency. In the subsequent work, PiGPDS, a perspective-independent ground plane detection and segmentation method, was introduced as a method for detecting and segmenting ground planes in 3D complex indoor environments, where the position and orientation of the sensor are unrestricted and unknown. PiGPDS demonstrated exceptional performance, achieving an average F1 score of 96.01%, accurately segmenting ground surfaces of complex 3D indoor scenes acquired from diverse locations with varying pitches and yaws. Finally, in the concluding endeavor, PiPCS, a Perspective-Independent Point Cloud Simplifier, stands as a significant advancement, building upon the foundational research laid out in earlier studies. PiPCS redefines conventional 3D background subtraction techniques by contextually segmenting and eliminating 3D background components, yielding precisely segmented 3D foreground objects without relying on colour or historical data. PiPCS demonstrates outstanding performance, achieving an average F1 score of 91.27% and substantial size reductions averaging 74.11% across all dataset's point clouds. PiPCS optimizes 3D computer vision systems by streamlining their workflows, enhancing indoor scene perception, reducing point cloud size, and enabling precise labeling within complex indoor environments.
Item Type: | Thesis (Doctoral (PhD)) |
---|---|
URI: | http://research.library.mun.ca/id/eprint/16680 |
Item ID: | 16680 |
Additional Information: | Includes bibliographical references |
Keywords: | point cloud simplification, 3D Background subtraction, 3D foreground segmentation, 3D indoor scene perception, point cloud size reduction |
Department(s): | Engineering and Applied Science, Faculty of |
Date: | October 2024 |
Date Type: | Submission |
Digital Object Identifier (DOI): | https://doi.org/10.48336/PB2K-P285 |
Library of Congress Subject Heading: | Computer graphics; Three-dimensional modeling; Cloud computing; Three-dimensional imaging; Mathematical models |
Actions (login required)
View Item |