Computer Vision and Image Understanding xxx (2011) xxx–xxx
Contents lists available at ScienceDirect
Computer Vision and Image Understanding
journal homepage: www.elsevier.com/locate/cviu
GPU-friendly multi-view stereo reconstruction using surfel representation and graph cuts ⇑ Ju Yong Chang a, Haesol Park a, In Kyu Park b, , Kyoung Mu Lee a, Sang Uk Lee a a Department of Electrical Engineering and Computer Science, ASRI, Seoul National University, Seoul 151-742, Republic of Korea b School of Information and Communication Engineering, Inha University, Incheon 402-751, Republic of Korea article info abstract
Article history: In this paper, we present a new surfel (surface element) based multi-view stereo algorithm that runs Received 19 February 2010 entirely on GPU. We utilize the flexibility of surfel-based 3D shape representation and global optimiza- Revised 1 November 2010 tion by graph cuts in the same framework. Unlike previous works, the algorithm is optimized to massive Accepted 1 November 2010 parallel processing on GPU. First, we construct surfel candidates by local stereo matching and voting. Available online xxxx After refining the position and orientation of the surfel candidates, we extract the optimal surfels by employing graph cuts under photo-consistency and surfel orientation constraints. In contrast to the con- Keywords: ventional voxel based methods, the proposed algorithm utilizes more accurate photo-consistency and Multi-view stereo reconstructs the 3D shape up to sub-voxel accuracy. The orientation of the constructed surfel candidates 3D shape reconstruction Surfel imposes an effective constraint that reduces the effect of the minimal surface bias. The entire processing Graph cuts pipeline is implemented on the latest GPU to significantly speed up the processing. The experimental GPU results show that the proposed approach reconstructs the 3D shape of an object accurately and effi- Massive parallel processing ciently, which runs more than 100 times faster than on CPU. Ó 2010 Elsevier Inc. All rights reserved.
1. Introduction surfel set as the output. Moreover, the proposed pipeline can be fully implemented on a massive parallel GPU (graphics processing Reconstructing 3D shapes from images has been an important unit). We provide details of the parallel algorithm development, topic in computer vision. Among several approaches, the which is customized for the GPU architecture. The parallel algo- multi-view stereo has received a lot of attention since it recon- rithm reduces the runtime to a fraction compared to the CPU based structs accurate 3D shapes without using an expensive active implementation. device like a laser scanner. Until now, a substantial amount of A brief description of the proposed method is as follows. First, the methodologies have been proposed [34]. The main criteria for initial surfel candidates are determined based on surface likelihood evaluating the performance of multi-view stereo methods are and structured to make the subsequent graph cuts optimization reconstruction accuracy and computational complexity. Early vox- applicable. More specifically, we propose a method of parallel vot- el based methods such as voxel coloring [35] and space carving ing of surfel position and orientation from multiple depth maps, [19] lack accuracy, and even the recent methods based on graph in which visible cameras for each surfel are implicitly determined. cuts [13,39,40] have limited accuracy due to their unadaptable The initial surfel candidates are then refined by parallel refinement voxel structure. Moreover, the computational complexity of mul- based on photo-consistency. Finally, optimal surfels are determined ti-view stereo is considerable since the amount of data is signifi- by globally optimizing the energy functional via parallel graph cuts. cant and the solution space for optimization is vast. In addition to the conventional photo-consistency term, our energy In this paper, we propose a multi-view stereo reconstruction functional includes another term based on the normal vectors of pipeline to simultaneously achieve these two objectives, i.e., high surfels. This acts as a novel ballooning constraint helpful to reduce accuracy and speed, in a single framework. In order to achieve high the minimal surface bias in graph cuts based multi-view stereo accuracy, we exploit the flexibility of surfel (surface element) based [39,40]. 3D shape representation and the global optimization by graph cuts. The contributions of our work can be summarized as follows: Consequently, the proposed method produces a globally optimized We effectively combine a surfel-based surface representation with the optimization framework using graph cuts [3], allowing
⇑ Corresponding author. Mobile: +82 10 6347 9834; fax: +82 32 873 8970. the proposed method to produce the optimal surfels in a princi- E-mail address: [email protected] (I.K. Park). pled way.
1077-3142/$ - see front matter Ó 2010 Elsevier Inc. All rights reserved. doi:10.1016/j.cviu.2010.11.017
Please cite this article in press as: J.Y. Chang et al., GPU-friendly multi-view stereo reconstruction using surfel representation and graph cuts, Comput. Vis. Image Understand. (2011), doi:10.1016/j.cviu.2010.11.017 2 J.Y. Chang et al. / Computer Vision and Image Understanding xxx (2011) xxx–xxx