Deep Geometric Prior for Surface Reconstruction

Deep Geometric Prior for Surface Reconstruction Francis Williams1, Teseo Schneider1, Claudio Silva 1, Denis Zorin1, Joan Bruna1, and Daniele Panozzo1 1New York University [email protected], [email protected], [email protected], [email protected], [email protected], [email protected] Abstract retains critical surface features and approximates the sam- pled surface well, is a pervasive and challenging problem. The reconstruction of a discrete surface from a point Different approaches have been proposed, mostly cloud is a fundamental geometry processing problem that grouped into several categories: (1) using the points to de- has been studied for decades, with many methods devel- fine a volumetric scalar function whose 0 level-set corre- oped. We propose the use of a deep neural network as a sponds to the desired surface, (2) attempt to ”connect the geometric prior for surface reconstruction. Specifically, we dots” in a globally consistent way to create a mesh, (3) fit a overfit a neural network representing a local chart parame- set of primitive shapes so that the boundary of their union terization to part of an input point cloud using the Wasser- is close to the point cloud, and (4) fit a set of patches to the stein distance as a measure of approximation. By jointly point cloud approximating the surface. fitting many such networks to overlapping parts of the point cloud, while enforcing a consistency condition, we compute We propose a novel method, based, on the one hand, on a manifold atlas. By sampling this atlas, we can produce a constructing a manifold atlas commonly used in differen- dense reconstruction of the surface approximating the input tial geometry to define a surface, and, on the other hand, on cloud. The entire procedure does not require any training observed remarkable properties of deep image priors [37], data or explicit regularization, yet, we show that it is able using an overfitted neural network for interpolation. We to perform remarkably well: not introducing typical overfit- define a set of 2D parametrizations, each one mapping a ting artifacts, and approximating sharp features closely at square in parametric space to a region of a surface, ensur- the same time. We experimentally show that this geomet- ing consistency between neighbouring patches. This prob- ric prior produces good results for both man-made objects lem is inherently ambiguous: there are many possible valid containing sharp features and smoother organic objects, as parametrizations, and only a small subset will correspond to well as noisy inputs. We compare our method with a number a faithful representation of the underlying surface. We com- of well-known reconstruction methods on a standard sur- pute each parametrization by overfiting a network to a part face reconstruction benchmark. of the point cloud, while enforcing consistency conditions between different patches. We observe that the result is a reconstruction which is superior quantitatively and qualita- 1. Introduction tively to commonly used surface reconstruction methods. We use the Wasserstein distance as a training loss, which 3D geometry is commonly acquired in the form of col- is naturally robust to outliers, and has the added advantage lections of (possibly incomplete) range images (laser scan- of providing us explicit correspondences between the para- ning, structured light, etc) or measurements of more com- metric domain coordinates and the fitted points, allowing us plex structure (LIDAR). Unordered set of points (point to explicitly measure, and thus minimize, the disagreement clouds) is a commonly used representation of combined between neighbouring patches covering a surface. registered results of scanning objects or scenes. Point clouds can be obtained in other ways, e.g., by sampling an We use a standard shape reconstruction benchmark to implicit surface using ray casting. Computing a continu- compare our method with 12 competing methods, showing ous representation of a surface from the discrete point cloud that, despite the conceptual simplicity of our algorithm, our (e.g., a polygonal mesh, an implicit surface, or a set of pa- reconstructions are superior in terms of quantitative and vi- rameteric patches) in way that is robust to noise, and yet sual quality. 1 2. Related work nomenclature. They separate techniques into four main categories: indicator function, point-set surfaces, multi-level Geometric Deep Learning A variety of architectures partition of unity, and scattered point meshing. were proposed for geometric applications. A few work with Indicator function techniques define a scalar function in point clouds as input; in most cases however, these methods space that can be used for testing if a given point is in- are designed for classification or segmentation tasks. One side or outside the surface. There are multiple ways to of the first examples are PointNet [33] and PointNet++ [34] define such a function from which a surface is generated originally designed for classification and segmentation, us- by isocontouring. Poisson surface reconstruction (Pois- ing a set-based neural architecture [39, 36]. PCPNet [14] son)[21] inverts the gradient operator by solving a Pois- is version of PointNet architecture, for estimation of local son equation to define the indicator function. Fourier sur- shape properties. A number of learning architectures for face reconstruction (Fourier)[20] represents the indicator 3D geometry work with voxel data converting input point function in the Fourier basis, while Wavelet surface recon- clouds to this form (e.g., [38]). The closest problems these struction (Wavelet)[27] employs a Haar or a Daubechies types of networks solve is shape completion and point cloud (4-tap) basis. Screened Poisson surface reconstruction upsampling. (Screened)[22] is an extension of [21] that incorporates Shape completion is considered, e.g., in [9], where vol- point constraints to avoid over smoothing of the earlier tech- umetric CNN is used to predict a very course shape com- nique. This technique is not considered in [5], but is con- pletion, which is then refined using data-driven local shape sidered by us. synthesis on small volumetric patches. [16], follows a Point-set surfaces [2] define a projection operator that somewhat similar approach, combining multiview and vol- moves points in space to a point on the surface, where the umetric low-resolution global data at a first stage, and using surface is defined to be the collection of stationary points of a volumetric network to synthesize local patches to obtain the projection operator. Providing a definition of the projec- higher resolution. Neither of these methods aims to achieve tion operators are beyond the scope of our paper (see [5]). high-quality surface reconstruction. In our experiments, we have used simple point set surfaces PU-Net, described in [41], is to the best of our knowl- (SPSS)[1], implicit moving least squares (IMLS)[24], and edge, the only learning-based work addressing point cloud algebraic point set surfaces (APSS)[13]. upsampling directly. The method proceeds by splitting in- Edge-Aware Point Set Resampling (EAR)[18] (also not put shapes into patches and learning hierarchical features considered in [5], but considered by us) works by first com- using PointNet++ type architecture. Then feature aggrega- puting reliable normals away from potential singularities, tion and expansion is used to perform point set expansion in followed by a resampling step with a novel bilateral filter feature space, followed by the (upsampled) point set recon- towards surface singularities. Reconstruction can be done struction. using different techniques on the resulting augmented point In contrast to other methods, the untrained networks in set with normals. our method take parametric coordinates in square paramet- Multi-level Partition of Unity defines an implicit func- ric domains as inputs and produce surface points as output. tion by integrating weight function of a set of input points. An important exception is the recent work [12] defining an The original approach of [29](MPU) uses linear functions architecture, AtlasNet, in which the decoder part is simi- as low-order implicits, while [28](MPUSm) defines differ- lar to ours, but with some important distinctions discussed ential operators directly on the MPU function. The method in Section 3. Finally, [4] studied the ability of neural net- of [31](RBF) uses compactly-supported radial basis func- works to approximate low-dimensional manifolds, showing tions. that even two-layer ReLU networks have remarkable abil- Scattered ity to encode smooth structures with near-optimal number Scattered Point Meshing [30]( ) grows of parameters. In our setting, we rely on overparametrisa- weighted spheres around points in order to determine the tion and leverage the implicit optimization bias of gradient connectivity in the output triangle mesh. descent. The work in [32] uses a manifold-based approach to a direct construction of a global parametrization from a set of range images (a point cloud, or any other surface repre- Surface Reconstruction is an established research area sentation, can be converted to such a set by projection to dating back at least to the early 1990s (e.g, [17]); [6] is multiple planes). It uses range images as charts with pro- a recent comprehensive survey of the area. We focus our jections as chart maps; our method computes chart maps by discussion on the techniques that we use for comparison, fitting. [40] jointly optimizes for connectivity and geome- which are a superset of those included in the surface re- try to produce a single mesh for an entire input point cloud. construction benchmark of Berger et al [5]. Berger tested In contrast, our method produces a global chart map using 10 different techniques in their paper; we will follow their only a local optimization procedure. X 1 We propose to approximate ϕ using a neural network, X φ θ V R3, where θ is a vector of parameters, that 2 (·; p): → p we fit so that the image of φ approximates Up ∩S.

Deep Geometric Prior for Surface Reconstruction

NTGAN: Learning Blind Image Denoising Without Clean Reference

High-Quality Self-Supervised Deep Image Denoising

Learning Deep Image Priors for Blind Image Denoising

Layered Neural Rendering for Retiming People in Video

Deep Image Prior for Undersampling High-Speed Photoacoustic Microscopy

Image Deconvolution with Deep Image Prior: Final Report

PROCEEDINGS of the ICA CONGRESS (Onl the ICA PROCEEDINGS OF

Controlling Neural Networks Via Energy Dissipation

2007 Registration Document

Denoising and Regularization Via Exploiting the Structural Bias of Convolutional Generators

Deep Learning at the Edge for Channel Estimation in Beyond-5G Massive MIMO Mauro Belgiovine, Kunal Sankhe, Carlos Bocanegra, Debashri Roy, and Kaushik R

Deep Image Prior