Recovering missing information when projecting 3D points or unprojecting image pixels

Chen, Wayne

Recovering missing information when projecting 3D points or unprojecting image pixels

dc.contributor.author	Chen, Wayne
dc.date.accessioned	2024-03-05T01:00:54Z
dc.date.available	2024-03-05T01:00:54Z
dc.date.issued	2024
dc.description.abstract	Computer vision aims to bridge the divide between 2D and 3D spaces. With the significant advance- ments in computational resources and deep learning techniques, neural networks have become the cornerstone for solving computer vision tasks. As the training of a neural network is data-driven, both input and ground truth data play pivotal roles in the network training process. However, 2D data is usually dense but involves a projection operation that loses 3D information; 3D data is often sparse due to sensor limitations. Addressing this challenge, our research focuses on the recovery of missing information when pro- jecting 3D points or unprojecting image pixels, exploring this problem across three different tasks: novel view synthesis, uncertainty-aware monocular depth estimation, and latent space analyses for the deepSDF model. Novel view synthesis from sparse coloured point clouds aims to generate dense RGB images given a sparse XYZRGB input Uncertainty-aware Monocular Depth Estimation (MDE) targets the generation of dense depth es- timation given a dense RGB input and sparse depth ground truth. We propose a novel network with an encoder-decoder structure and a novel loss function that enables joint training of depth and uncertainty estimation. This model competes closely with state-of-the-art solutions on depth estimation evaluation metrics and outperforms them on uncertainty estimation. The latent space analysis for the deepSDF model explores the connections among latent represent- ations of different 3D models. Our experiments reveal that these latent codes are not independent; latent codes generated from linear interpolation between each pair of latent codes represent the transformation from one model to another. Our findings confirm the existence and impact of sparsity within input data. However, our proposed methods demonstrate not only how to overcome these challenges but also how to evaluate their impact on the accuracy of the generated results. This work contributes significantly to enhancing the accuracy and reliability of models tackling data sparsity in the field of computer vision.
dc.identifier.uri	http://hdl.handle.net/1885/315712
dc.language.iso	en_AU
dc.title	Recovering missing information when projecting 3D points or unprojecting image pixels
dc.type	Thesis (MPhil)
local.contributor.authoremail	u5152653@anu.edu.au
local.contributor.supervisor	Zhang, Jing
local.contributor.supervisorcontact	u1031665@anu.edu.au
local.identifier.doi	10.25911/4B7N-7X21
local.mintdoi	mint
local.thesisANUonly.author	dcd9c621-5c3d-462a-b3c5-bf249d9297db
local.thesisANUonly.key	5b2a98c7-353b-7e06-ff1b-617d309e5528
local.thesisANUonly.title	000000029187_TC_1

Downloads

Original bundle

Now showing 1 - 1 of 1

Name:: Chen_Thesis_Recovering missing information when projecting 3D points or unprojecting image pixels_2024.pdf
Size:: 10.3 MB
Format:: Adobe Portable Document Format
Description:: Thesis Material

Download

Collections

Open Access Theses