Klassifikation Urbaner Punktwolken Mittels 3D CNNs In Kombination mit Rekonstruktion von Gehsteigen
DIPLOMARBEIT
zur Erlangung des akademischen Grades
Diplom-Ingenieurin
im Rahmen des Studiums
Visual Computing
eingereicht von
Lisa Maria Kellner,BSc Matrikelnummer 01428183
an der Fakultät für Informatik der Technischen Universität Wien Betreuung: Univ.-Prof.Dipl.-Ing. Dr.techn. Dr.h.c. Werner Purgathofer Mitwirkung: Dipl.-Ing. Dr.Stefan Maierhofer
Wien, 23. Februar 2021 Lisa Maria Kellner Werner Purgathofer
Technische UniversitätWien A-1040 Wien Karlsplatz 13 Tel. +43-1-58801-0 www.tuwien.ac.at
Classification of Urban Point CloudsUsing 3D CNNs In Combination with Reconstruction of Sidewalks
DIPLOMA THESIS
submitted in partial fulfillmentofthe requirements forthe degree of
Diplom-Ingenieurin
in
Visual Computing
by
Lisa Maria Kellner,BSc Registration Number 01428183
to the Faculty of Informatics at the TU Wien Advisor: Univ.-Prof.Dipl.-Ing. Dr.techn. Dr.h.c. Werner Purgathofer Assistance: Dipl.-Ing. Dr.Stefan Maierhofer
Vienna, 23rd February, 2021 Lisa Maria Kellner Werner Purgathofer
Technische UniversitätWien A-1040 Wien Karlsplatz 13 Tel. +43-1-58801-0 www.tuwien.ac.at
Erklärung zur Verfassungder Arbeit
Lisa Maria Kellner,BSc
Hiermit erkläre ich, dass ichdieseArbeit selbständig verfasst habe,dass ichdie verwen- detenQuellenund Hilfsmittel vollständig angegeben habeund dass ichdie Stellen der Arbeit–einschließlichTabellen,Karten undAbbildungen –, dieanderen Werken oder dem Internet im Wortlaut oder demSinn nach entnommensind, aufjeden Fall unter Angabeder Quelleals Entlehnung kenntlich gemacht habe.
Wien, 23. Februar 2021 Lisa Maria Kellner
v
Danksagung
Es wareine langeReise vomBeginnmeinesStudiums bis zu dieser Diplomarbeit und ichhabewährenddessensehrviel Unterstützung vonFamilie,Freunden undKollegen erhalten.Deshalb möchte ichmichandieser Stelle bedanken. Zuerst,möchteich mich beiVRVis Zentrum für Virtual Realityund Visualisierung Forschungs-GmbH bedanken, welches mir diese Diplomarbeit ermöglicht hat. Die VRVis Forschungs-GmbH wirdimRahmen vonCOMET –Competence Centers for Excellent Technologies(854174, 879730) durchBMK, BMDW, Land Steiermark,Steirische Wirt- schaftsförderung–SFG, Land Tirol und WirtschaftsagenturWien –Ein Fonds der Stadt Wien gefördert.Das Programm COMET wird durchdie FFGabgewickelt. All meinen Kollegen am VRVis sei ebenfalls einDankausgesprochen, für die fachliche Hilfe,Dis- kussionen,Anregungen und Unterstützung während meines Studiumsund dieserArbeit. Dabei ist einganz besonderer Dank an Stefan, Attila,Michi, Harri, Georg, Andi,Thomas, Lui,Rebecca, Janne, Mariaund Dani gerichtet. Ebenfalls möchte ichmichbei meinenEltern bedanken, welche michwährend desgesamten Studiumsunterstützt haben.Hier seiauchein großerDankanmeine restlicheFamilie und Freunde gerichtet,die mir während meinesStudiumszur Seite gestandenhaben. Außerdem möchte ichmichbei CycloMedia Deutschland,LiDAR PointCloud und StadtentwässerungsbetriebeKöln,AöR für das Bereitstellender Daten aus Köln bedanken.
vii
Acknowledgements
It hasbeenalong journey from thebeginning of my studies to this thesisand Ihave received alot of support fromfamily,friends and colleagues during this time. Therefore, Iwouldliketotakethis opportunity to thank them. First, Iwouldliketothank VRVisZentrum für VirtualRealityund Visualisierung Forschungs-GmbH formaking this thesispossible. VRVis is funded by BMK, BMDW, Styria,SFG,Tyrol and Vienna BusinessAgency in thescopeofCOMET -Competence Centers for ExcellentTechnologies(854174, 879730) which is managed by FFG. Iwould alsoliketothank all my colleagues at VRVis fortheir professionalhelp, discussions, suggestions and support duringmystudies and this thesis.Special thanksgotoStefan, Attila,Michi, Harri,Georg, Andi, Thomas,Lui, Rebecca, Janne, Maria and Dani. Iwouldalso liketothank my parents, who supported me throughoutmystudies. Iwould alsoliketothank the rest of my family and friends,who have supportedmeduring my studies. Furthermore, Iwouldliketothank CycloMedia Deutschland,LiDAR PointCloud and StadtentwässerungsbetriebeKöln,AöR for providing the data fromCologne.
ix
Kurzfassung
LiDAR-Gerätesind in der Lage diephysische Welt sehr genau zu erfassen. Daher werden sie häufig fürdie 3D-Rekonstruktionverwendet. Leider könnensolcheDaten sehrschnell sehrgroßwerden und meist ist nurein Bruchteilder Punktwolketatsächlichvon Interesse. Daher wirddie Punktwolkevorher gefiltert, um Algorithmennur auf diefür sie relevanten Punkte anzuwenden. Eine semantische Informationüberdie Punkte kann für einesolche Filterung verwendet werden. Die semantische Segmentierung einer Punktwolkeist ein beliebtes Forschungsgebietund auchhier gibtesinden letzten Jahren einenTrendin Richtung DeepLearning. Jedoch sindPunktwolken,imGegensatzzuBildern, nicht strukturiert.Daher werdenPunktwolken oft rasterisiert, wasjedochineinerWeise geschehen muss, sodass die zugrunde liegende Struktur gut erhaltenbleibt. In dieser Arbeitwird ein 3D Convolutional Neural Network für die semantische Segmen- tierung einer LiDAR-Punktwolkeentwickelt und trainiert. Dabei wirdeine Punktwolke mittelseinerOctree-Datenstruktur repräsentiert, welche es ermöglicht, nurrelevante Teile zu rasterisieren. Denn nurdichteTeile derPunktwolke, in denen sichwichtige Informatio- nen über die Struktur befinden, werdenimmer weiter unterteilt.Das ermöglicht es, die Knoten eines gewissen LevelsimOctree zu nehmenund diese als Daten zu rasterisieren. Es gibtviele Anwendungsgebietefür 3D Rekonstruktionen basierendauf Punktwolken. In einemstädtischen Szenario könnendas zumBeispiel ganzeStadtmodelle oder einzelne Gebäudesein. Jedoch wird in dieser Arbeitdie Rekonstruktionvon Gehwegen untersucht. Da beiÜberflutungssimulationen in Städten eineErhöhung vonein paar Zentimetern einengroßen Unterschied machen kann und Wissen über die Bordsteinegeometrie hilft die Simulation genauer zu machen. Bei derGehwegrekonstruktionwird zuerst, basierend auf dersemantischenInformationdes 3D CNNs, die Punktwolkegefiltertund dann werdenPunktmerkmale berechnet, um die Punkte des Bordsteins zu erkennen. Mit diesen Bordsteinpunkten wirddie Geometrie der Bordsteins, Gehweges und derStraße berechnet. Insgesamtwird in dieser Arbeitein Proof-of-Concept Prototypfür semantische Punktwolken- Segmentierung mittels 3D CNNs undbasierend auf dieser einDetektions- undRekon- struktionsalgorithmus für Bordsteine entwickelt.
xi
Abstract
LiDAR devices are abletocapture the physical world very accurately.Therefore, they are often used for 3D reconstruction.Unfortunately,suchdata can become extremely large very quickly and usuallyonly asmallpart of thepoint cloud is of interest.Thus, the pointcloud is filteredbeforehand in order to applyalgorithms only on those points thatare relevant for it.Asemanticinformationabout the points canbeused for sucha filtering.Semanticsegmentation of pointclouds is apopular field of research andhere there has been atrend towards deep learning in recentyears too. However, contrary to images, pointcloudsare unstructured. Hence,point cloudsare often rasterized,but this hastobedone, suchthatthe underlying structure is represented well. In this thesis, a3DConvolutional Neural Network is developedand trained forasemantic segmentation of LiDAR pointclouds. Thereby,apointcloud is represented with an octreedata structure, whichmakes it easy to rasterizeonlyrelevantparts.Since, just dense parts of thepoint cloud,inwhichimportantinformation about thestructure is located, are subdividedfurther. This allowstosimply takenodes of acertainlevel of the octree and rasterize them as datasamples. There are manyapplicationareas for 3D reconstructionsbasedonpoint clouds. In an urbanscenario, thesecan be forexample whole citymodelsorbuildings. However, in this thesis,the reconstruction of sidewalks is explored. Since, forfloodsimulations in cities,an increase in heightofafew centimeterscan makeagreatdifferenceand information about the curbgeometryhelps to makethem more accurate. In the sidewalk reconstruction process,the pointcloud is filtered first, basedonasemanticsegmentation of a3DCNN, and then pointcloud featuresare calculated to detect curbpoints. With these curb points, the geometryofthe curb, sidewalk and streetare computed. Takenall together,this thesis develops aproof-of-conceptprototypefor semanticpoint cloud segmentation using 3D CNNs and based on that,acurb detection and reconstruction algorithm.
xiii
Contents
Kurzfassung xi
Abstract xiii
Contents xv
1Introduction1 1.1 Motivation ...... 1 1.2 Problem Statement...... 2 1.3 Aim of this Thesis ...... 3 1.4 Structure of thisThesis ...... 4
2Related Work 5 2.1 Semantic PointCloud Segmentation...... 5 2.2 CurbDetection ...... 17
3Semantic PointCloudSegmentation 21 3.1 DataExtraction ...... 22 3.2 TrainingDataset...... 26 3.3 Network Learning Basics ...... 28 3.4 Network Architecture ...... 31 3.5 PointCloud Segmentation ...... 34
4Curb Detection and Fitting35 4.1 PointCloud Prefiltering ...... 35 4.2 Feature Extraction ...... 36 4.3 False PositiveFiltering ...... 42 4.4 CurbFitting ...... 45
5Neural Network Training49 5.1 Datasets ...... 49 5.2 Hyperparameter-Tuning ...... 50 5.3 Network Using SmallGrid Size...... 52 5.4 Network Using MiddleGrid Size...... 54
xv 5.5 Network Using BigGrid Size...... 56
6Results 61 6.1 SemanticSegmentation of theSemantic3d Dataset...... 61 6.2 SemanticSegmentation and CurbFitting of theCologneDataset... 69 6.3 Critical Reflection ...... 82
7Conclusion and Future Work 85
AHyperparameterTuning 87
BNetwork Architectures93 B.1Network Using SmallGrid Size...... 93 B.2Network Using MiddleGrid Size...... 95
List of Figures 97
ListofTables 101
Bibliography 103 CHAPTER 1
Introduction
1.1 Motivation
3D reconstruction is rebuildingthe physical world basedoncaptured data.
Reconstructions have abroad field of applications, like preservation of cultural heritage as statues from Michelangelo, relictsfrom Egypt or Buddha statues[GBS14]orserious games about it [MCB+14]. Further use casesare underwater robotic surveys [JRPWM10] or indoor sceneslikeKinectFusion, whichcan be usedfor augmented reality[IKH+11]. Alsoreconstructionsofcities are of great interest,because theycan be used forplanning, city climateornoisepropagationapplications [Bre00]aswellasemergency management, disaster controland formovies or computergames [MWA + 13].
Suchsimple citymodelsare usedinVisdom[Vis], asimulationand visualization application for disaster managementoffloods. Ascreenshot of this software canbeseen in Figure1.1.Theyassist decision making in emergencies of floodscaused by heavy rainfall, tsunamisorotherdisasters.The impact of short-term protections, like sandbags, or long-termprotections canbesimulated in advance to support decisions aboutsafety arrangements.Based on this, emergency personnelcan be trained accordinglyand possibleevacuationplans canbeexplored beforehand.Inaddition, the sewersystem of a city can be added to thesimulationtotaketheircapacity and potential bottlenecks into account.
In case of floods, an increaseinheightofacoupleofcentimetres or ahousedriveway with aslopedowncan makeagreatdifferenceinsafetyarrangements. Therefore, ageometric informationabout drivewaysorsidewalks would be of great interest for theVisdom- teamtomaketheirsimulationevenmore accurate. Therefore, this thesisaddresses the detectionand reconstruction of sidewalk surfaces.
1 1. Introduction
Figure 1.1: Screenshot of theVisdomfloodsimulationsoftware. (Source: [Vis])
1.2 ProblemStatement
Acquiring respectivedata fora3D reconstruction is nowadays probablyaseasy as never before. Various technical instrumentsare available. Mobilephones are abletotake high-resolutionphotoswith meta-data likecameraparameters or GPSlocation. Moreover, photos are not limitedtothe ground, because drones allowsustocapture the world from above too. LiDAR devices scan theirsurroundingwith highprecisionand yield dense pointclouds. This data canthen be combinedwith other information like cadastre data or with datafrom Open Streets Maps,anopensource map with further geospatial tags. Each type of data hasits ownadvantagesand disadvantages. Everyone cantakepho- tos with theirsmartphone, butthe qualityofphotosstrongly depends on thelighting conditionsand are thereforeprone to over- as well as underexposure. Airbornephotogra- phyisnot abletocapture small structures and has to copewith occlusionsfrom trees. Although,they are agoodextensionfor terrestriallaser scanning datatoclose gaps. Photogrammetric pointcloudsare notasfine-grained as LiDAR data, but on the other hand,LiDAR datacan become so huge that it can be challenging to process it efficiently. Geospatial data does notprovide enough informationfor reconstruction,but is agood enhancementtoimprovealgorithms. Therefore, to solve aproblem it is inescapableto use the type of dataaswellasdata structure fitting best to theproblem andtocombine different sources of data. The reconstruction of sidewalks is basedonLiDAR data, because laser scannersare able to capture subtle structures likeanelevation of ten to fifteen centimetres.Laser scan
2 1.3. Aim of this Thesis data of awhole cityhas up to several billion points and curbs are just atinypart of them. It is thereforenot feasibletosearch curb points in the whole pointcloud.This would notonly take manycomputational resources, but also asatisfying curbdetection result would be hard to achieve. Forthe simplereason that by applyingthis problemtothe wholecity, it caneitherbeover-modelled, which would resultinmanyfalsepositivesor under-modelled,whichwouldnot detect manycurb points. Sincesidewalksare located on theground and besides streets, the majorityofpoints can be filtered beforehand to reduce the searcharea.This could either be achieved with some kind of groundplane filtering or by using semantic information. With semantic informationappliedtopoints, they canbefiltered more precisely.For example,acar is on the groundand at or nearby astreettoo,but during the detectionprocess,those carpointswouldlead to manyfalse positivecurb points. Therefore, it would be great to remove car points fromthe point cloud beforehand too. Anotheradvantage is, that this would notonly help to limit the search area for this problem, sincethe semantic information couldalso be usedfor further reconstructions,likebuildings, or for filteringvegetationand other unwelcome points. Hence,sidewalksare located nearroads;informationabout the street network would be another great sourceofdata. OpenStreetMap [Opea]providesitand many further geospatial information. However, the location of those tags is savedinanothercoordinate system as theLiDAR dataand hastobeconverted beforehand.Due to the semantic filtering of thepoint cloud and thecombination with further datasources, thesearch areaisinsomuch restricted, thatdetectingcurb points canbeachievedwith basicspatial featuresand geometry can be generated withsimple algorithms.
1.3Aim of this Thesis
The aim of this thesisistodevelopaproof-of-conceptprototypefor semantic pointcloud segmentation and based on that, acurb detectionand 3D reconstruction algorithm. Convolutionalneural networks areprobably the most commondeeplearning method nowadays.They are used in awide rangeoffields, such as image or speechrecognition, facedetection, text or video analysis and self-driving cars. They are outperformingother supervised classificationmethods by far, because during theconvolutional operations not onlylow-level features are detected,but alsohigh-level featuresand essential structures are distinguished. The essential processes like convolutionorpooling can be easily adaptedtothe three-dimensionalspace. Therefore, asemanticsegmentation of point cloudsisachievedbybuilding and training 3D convolutional neuralnetworks. To decrease memory consumption and training time, the pointcloud willnot just be represented in a uniform gridstructure, but with an octree.The cells of theoctreeinacertainlevel are then voxelized and used forthe segmentation. Forcurb detection, justground points are used,whichcan easily be selected based on the semanticinformation. Multiple geometric features are calculated for eachpointand if a pointfulfilsall features, it is labelledas“curb”.Afterwardsafiltering step removesfalse positive “curb” points and finally geometryisgeneratedfor sidewalks, curbsand street.
3 1. Introduction
Therefore, thework of this thesiscan be divided into twoparts. First, forasemantic segmentation of apointcloud,a3D convolutional neuralnetwork willbedeveloped and trained.Secondly,curb points willbecalculated with geometric pointcloud featuresand basedonthem, geometryfor thecurb, sidewalk and streetwillbecomputed.
1.4Structure of thisThesis
First, an overview about related work in pointcloud segmentation as well as curb detectionisgiven in Chapter 2, where section 2.1 discusses semanticsegmentation based on handcrafted features,artificial neural networks and convolutionalneural networks. Convolutional neuralnetworks are categorized in 2D image segmentation networks,where the pointcloud is projected into images first, and 3D convolutionalneural networks, which aredirectly actingonthe pointcloud.Section 2.2 describes thecurbdetection process in pointcloudsonwhichthe curbdetection steps in this thesisare based on. In thecourse of this process,the pointcloud is prefiltered, then spatialfeatures are calculated and based on them apointisclassified as curb or not.Afterwards, false positivelabelled points are filtered and finally geometryiscomputed. Chapter3describes semantic pointcloud segmentation and therebyaddresses data extraction in Section 3.1, thedataset used for training in Section 3.2, some network learning basics in Section3.3,aswellasthe finalnetwork architecture in Section 3.4. Furthermore, network classification and segmentation of an unknown pointcloud is shortlydepicted in Section 3.5. Chapter4is dedicatedtothe differentsteps of thecurb detection and fitting process. Section 4.1 describes prefiltering of apointcloud and Section 4.2 explainsand visualizes spatial pointcloud features, including the integration of OpenStreetMap data. Section 4.3 addressesfalse positivefilteringand finally,Section 4.4 discusses geometry fitting. Chapter5addressesthe network trainingprocess and itsresults in detail, with the composition of thetraining setinSection 5.1 and thehyperparameter tuninginSection 5.2. Furthermore, threenetworks with differentgrid sizes areevaluated: anetwork using asmallgrid sizeinSection 5.3, anetwork using amiddlegrid sizeinSection 5.4 and a network using abig gridsize in Section 5.5. Chapter6showsand discusses the resultsofthe network training and semantic segmen- tationofthe trainingset in Section 6.1 as well as the semantic segmentation and curb fitting results of real-worlddatainSection 6.2. Acriticalreflection on thework of this thesis is given in Section 6.3 and finally,Chapter7gives aconclusion and shortoutlook into futurework.
4 CHAPTER 2
Related Work
Thischaptergives an overview about related work in pointcloud segmentationand curb detection.Section 2.1 addresses semanticpointcloud segmentation based on handcraftedfeatures as well as artificial neural networks.Therebyafocusisgiven to 2-dimensional convolutionalneural networks, which are mapping thepoint cloud into images, 3-dimensional convolutionalneural networks, whichprocess the pointcloud directly,aswellasspecial for pointcloudsdeveloped optimizations, like tree-based networks for example. Section 2.2 describes research in roadboundary andcurb detectionofself-drivingvehicles.Emphasisisgiven to the pipelineused inthis thesis, where thepoint cloud is filtered first,then candidate points are detected and filtered for false positives, and finally geometry is fitted to thedetected curb points.
2.1Semantic PointCloudSegmentation
Point clouds are aset of unstructuredpoints either calculated by photogrammetryfrom severalphotosorcaptured directly by ascanningdevice. The basic principle of generating photogrammetricpointclouds is computing 2D image- featuresand determining their3Dposition by finding the same feature in multiple overlapping photos. The most commonly usedfeature is theScale Invariant Feature Transform(SIFT), which is invariant to translation, scalingand rotationofthe image [Low04]. Structure-from-Motioncalculates thelocation and orientation of thephotos as well as scene geometry, withoutany prior knowledge about camerapositions, by using bundle adjustmentand suchimage-features.3Dpoints, and thus thescene geometry, are computed by knowledgeacquiredinthis processabout thesame feature in several photos and by using triangulation [WBG+12]. Figure 2.1 shows such aphotogrammetric point cloud.
5 2. RelatedWork
Figure 2.1: Aphotogrammetric pointcloud with location and orientation of thephotos. (Source: adapted from [SKMW17])
LightDetection and Ranging (LiDAR) devices,orcolloquially called laserscanners, are abletocapture the physical world by shooting out laserbeams. Whenalaser beam hits an object,itisreflected and returns to thescanning device. The elapsed time, until the return of thebeam, is measured and based on that,the distance fromthe object to the laser scanner is obtained and thereforethe 3D position of thehit-pointaswell. Point cloudscaptured with LiDAR devices are invariant to illumination, more accurateand denser thanphotogrammetric pointclouds. Suchdevices are abletocapture huge areas, but the resulting pointcloudscould become very largevery quickly. Semantic point cloud segmentation is the assignmentofdiscrete class labels for eachpoint in the pointcloud individually. In this thesisthe main focus is on supervised learning techniques,whereaclassifier learns the correlation of set X := {x1,x2,..., xn} of n data samples to the corresponding set Y := {y1,y2,..., yn} of n groundtruth labels.Atrained classifier takes adata sample xi as inputand predicts aclass label yi,where i is the indexofthe data sample and its according groundtruth value in the sets X and Y respectively. As mentioned above, theset X is consisting of n datasamples, where eachdata sample is an m-dimensional feature vectorbyitself.Therefore, X is nm-dimensional. The set Y encodesthe affiliation of datasamples to c classesand is commonly represented by a one-hot encoding of thecategorical classvalues. Thus, Y is nc-dimensional. One-hotencodingisthe mapping of categorical valuestobinary feature vectors.An exampleisgiven in Table 2.1, where three categories “building”,“vegetation”and “terrain” areeachencodedintoathree-dimensionalbinary feature vector, where every category is assignedtoafixedposition in thevector. Thevalues in afeature vector are zeroeverywhere, exceptfor theposition of therepresentedclass, which hasvalue one. The according feature vectors of our exampleare [1, 0, 0], [0, 1, 0] and [0, 0, 1] respectively. Thismethodisnot restricted to class labels; it is usedtoencode categorical valuesin data samples as well. Sinceanumerical encoding for categories, like in our example1,2
6 2.1. SemanticPointCloud Segmentation and 3, would cause amachine learningalgorithmtorate“terrain”higher as “building”, becauseof3>1and one-hot encoding solves this problem.
categoricalname numerical encoding one-hot encoding building 1 100 vegetation 2 010 terrain 3 001
Table 2.1: An exampleofmappingcategories to numerical valuesand an according one-hot encoding of them.
2.1.1 Handcrafted Features Handcrafted features are calculated for each singlepoint individually.They usually considerinformation aboutapoint’sneighbourhood,whichcan be defined by the number of thenearest k points or acertain radius. Commonly usedfeatures describingthe geometric shapeofapoint’s neighbourhood are based on theeigenvalues λi,where λ1 ≥ λ2 ≥ λ3,ofthe covariance matrix. The covariance matrixΣofapoint p and itsneighbourhood consisting of k points is defined by: 1 k Σ= (p − µ)(p − µ)T k i i i=1 (2.1) 1 k µ = p k i i=1
Suchfeatures -amongst manyothers -are linearity,planarity, sphericity,omnivariance, sumofeigenvalues, curvature and eigenentropy, which are listed in Table2.2 [WJHM15] [HWS16] [TGDM18]. Weinmann et al. [WUH+15]are optimizingtheir pointcloud featuresbychoosing the best neighbourhoodsize for asingle point. This makesfeaturesbetterdistinguishable and thereforeimprovesthe classification. They are calculating theoptimalamountofk nearest neighbours forapoint p individually by building thecovariance matrixofthe neighbourhood around p with an altering k.The best k is chosen based on theminimum Shannon entropy[Sha48]ofthe normalized eigenvaluesorfeatures.Adownside of this approachisthe increasedruntime,since the optimal neighbourhood is computed for eachpointindividually.Hackeletal. [HWS16]are approximatingthe neighbourhood by iteratively downsampling the pointcloud to generate amulti-scale pyramid.The features are then computed perscale with afixed k.This methodcomputesfeatures very fast and concurrently achieves high classificationsresults. Thomas et al. [TGDM18]are using a spherical neighbourhood basedonaradius instead. Theyuse asphericalneighbourhood instead of afixed number of neighbours, because featureswith suchaneighbourhood have aconsistentgeometrical meaningand the number of points in the neighbourhood
7 2. RelatedWork
become afeature by themselves.Besides, classifierwith those featuresare performing better thanwith featurescomputed with afixed amountofpoints.
Linearity λ1−λ2 λ1
Planarity λ2−λ3 λ1
Sphericity λ3 λ1 √ 3 Omnivariance λ1λ2λ3
Sum of eigenvalues λ1 + λ2 + λ3
Curvature λ3 λ1+λ2+λ3