Music Similarity: Learning Algorithms and Applications

Total Page:16

File Type:pdf, Size:1020Kb

Music Similarity: Learning Algorithms and Applications UC San Diego UC San Diego Electronic Theses and Dissertations Title More like this : machine learning approaches to music similarity Permalink https://escholarship.org/uc/item/8s90q67r Author McFee, Brian Publication Date 2012 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA, SAN DIEGO More like this: machine learning approaches to music similarity A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by Brian McFee Committee in charge: Professor Sanjoy Dasgupta, Co-Chair Professor Gert Lanckriet, Co-Chair Professor Serge Belongie Professor Lawrence Saul Professor Nuno Vasconcelos 2012 Copyright Brian McFee, 2012 All rights reserved. The dissertation of Brian McFee is approved, and it is ac- ceptable in quality and form for publication on microfilm and electronically: Co-Chair Co-Chair University of California, San Diego 2012 iii DEDICATION To my parents. Thanks for the genes, and everything since. iv EPIGRAPH I’m gonna hear my favorite song, if it takes all night.1 Frank Black, “If It Takes All Night.” 1Clearly, the author is lamenting the inefficiencies of broadcast radio programming. v TABLE OF CONTENTS Signature Page................................... iii Dedication...................................... iv Epigraph.......................................v Table of Contents.................................. vi List of Figures....................................x List of Tables.................................... xi Acknowledgements................................. xii Vita......................................... xiv Abstract of the Dissertation............................. xvi Chapter 1 Introduction.............................1 1.1 Music information retrieval..................1 1.2 Summary of contributions..................1 1.3 Preliminaries.........................2 Chapter 2 Learning multi-modal similarity..................5 2.1 Introduction..........................5 2.1.1 Contributions.....................8 2.1.2 Preliminaries.....................9 2.2 A graphical view of similarity................ 10 2.2.1 Similarity graphs................... 12 2.2.2 Graph simplification................. 12 2.3 Partial order embedding.................... 14 2.3.1 Linear projection................... 14 2.3.2 Non-linear projection via kernels........... 16 2.3.3 Connection to GNMDS................ 19 2.4 Multiple kernel embedding.................. 20 2.4.1 Unweighted combination............... 21 2.4.2 Weighted combination................ 21 2.4.3 Concatenated projection............... 23 2.4.4 Diagonal learning................... 25 2.5 Experiments.......................... 28 2.5.1 Toy experiment: taxonomy embedding....... 28 2.5.2 Musical artist similarity............... 30 vi 2.6 Hardness of dimensionality reduction............ 38 2.7 Conclusion.......................... 39 2.A Embedding partial orders................... 41 2.B Solver............................. 42 2.C Relationship to AUC..................... 44 Chapter 3 Metric learning to rank....................... 46 3.1 Introduction.......................... 46 3.1.1 Related work..................... 47 3.1.2 Preliminaries..................... 48 3.2 Structural SVM review.................... 48 3.2.1 Optimization..................... 49 3.2.2 Ranking with structural SVM............ 50 3.3 Metric learning to rank.................... 51 3.3.1 Algorithm....................... 52 3.3.2 Implementation.................... 52 3.4 Ranking measures....................... 54 3.4.1 AUC......................... 55 3.4.2 Precision-at-k ..................... 55 3.4.3 Average Precision................... 56 3.4.4 Mean Reciprocal Rank................ 57 3.4.5 Normalized Discounted Cumulative Gain...... 57 3.5 Experiments.......................... 57 3.5.1 Classification on UCI data.............. 58 3.5.2 eHarmony data.................... 59 3.6 Conclusion.......................... 62 Chapter 4 Faster structural metric learning.................. 63 4.1 Introduction.......................... 63 4.2 Structural metric learning................... 64 4.3 Alternating direction optimization.............. 67 4.3.1 Dual optimization................... 68 4.3.2 Multiple kernel projection.............. 70 4.3.3 Implementation details................ 71 4.4 Experiments.......................... 72 4.4.1 UCI data....................... 72 4.4.2 Multimedia data................... 76 4.5 Conclusion.......................... 77 4.A Derivation of eq. (4.11).................... 79 4.B Axis-aligned learning..................... 80 vii Chapter 5 Similarity from a collaborative filter................ 82 5.1 Introduction.......................... 82 5.1.1 Related work..................... 84 5.1.2 Contributions..................... 85 5.2 Learning similarity...................... 86 5.2.1 Collaborative filters.................. 87 5.3 Audio representation..................... 88 5.3.1 Codebook training.................. 89 5.3.2 (Top-τ) Vector quantization............. 89 5.3.3 Histogram representation and distance........ 91 5.4 Experiments.......................... 92 5.4.1 Data.......................... 93 5.4.2 Procedure....................... 94 5.4.3 Comparisons..................... 96 5.5 Results............................ 99 5.6 Conclusion.......................... 105 Chapter 6 Large-scale similarity search.................... 107 6.1 Introduction.......................... 107 6.2 Related work......................... 108 6.3 Spatial trees.......................... 109 6.3.1 Maximum variance KD-tree............. 110 6.3.2 PCA-tree....................... 111 6.3.3 2-means........................ 111 6.3.4 Random projection.................. 112 6.3.5 Spill trees....................... 112 6.3.6 Retrieval algorithm and analysis........... 113 6.4 Experiments.......................... 115 6.4.1 Audio representation................. 116 6.4.2 Representation evaluation.............. 117 6.4.3 Tree evaluation.................... 117 6.4.4 Retrieval results.................... 119 6.4.5 Timing results.................... 120 6.5 Conclusion.......................... 121 Chapter 7 Modeling playlists......................... 123 7.1 Introduction.......................... 123 7.2 A brief history of playlist evaluation............. 124 7.2.1 Human evaluation.................. 124 7.2.2 Semantic cohesion.................. 125 7.2.3 Sequence prediction................. 125 7.3 A natural language approach................. 126 7.3.1 Evaluation procedure................. 127 viii 7.4 Playlist dialects........................ 128 7.5 Hyper-graph random walks.................. 129 7.5.1 The user model.................... 129 7.5.2 The playlist model.................. 130 7.5.3 Learning the weights................. 131 7.6 Data collection........................ 132 7.6.1 Playlists: Art of the Mix 2011............ 132 7.6.2 Edge features..................... 133 7.7 Experiments.......................... 135 7.7.1 Experiment 1: Does dialect matter?......... 135 7.7.2 Experiment 2: Do transitions matter?........ 136 7.7.3 Experiment 3: Which features matter?........ 136 7.7.4 Example playlists................... 139 7.8 Conclusion.......................... 140 Appendix A Optimizing Average Precision................... 145 A.1 Cutting plane optimization of AP............... 145 A.2 Most violated constraint DP................. 146 A.2.1 DP algorithm..................... 147 A.2.2 Complexity...................... 148 Appendix B Online k-Means........................... 150 B.1 Introduction.......................... 150 B.2 Hartigan’s method and online k-means............ 151 ix LIST OF FIGURES Figure 2.1: An overview of multi-modal feature integration...........8 Figure 2.2: Graphical representation of relative comparisons........... 11 Figure 2.3: Two variants of multiple kernel-embedding.............. 25 Figure 2.4: The label taxonomy for the experiment in section 2.5.1........ 29 Figure 2.5: Experimental results for section 2.5.1................. 30 Figure 2.6: aset400 embedding results for each of the base kernels........ 34 Figure 2.7: aset400 embedding results with multiple-kernel embedding..... 35 Figure 2.8: The example-kernel weighting learned by algorithm 2.4....... 36 Figure 2.9: t-SNE visualizations of the learned embedding of aset400...... 37 Figure 2.10: The effect of constraint processing on embedding accuracy..... 38 Figure 2.11: A constraint set that cannot be embedded in R1............ 40 Figure 3.1: Dimensionality reduction for UCI data sets.............. 60 Figure 4.1: Comparison of ITML, LMNN, and MLR on UCI data........ 74 Figure 4.2: Effective rank of learned metrics with noisy dimensions....... 75 Figure 4.3: Comparison of projections for MLR-ADMM and MLR-Proj..... 75 Figure 4.4: Comparison of MLR-ADMM and MLR-Proj for music similarity.. 78 Figure 5.1: Query-by-example retrieval...................... 83 Figure 5.2: Vector quantization: hard and top-τ.................. 90 Figure 5.3: Schematic diagram of audio similarity learning and retrieval..... 93 Figure 5.4: Retrieval accuracy with vector quantized audio representations... 100 Figure 5.5: The effective dimensionality of audio codeword histograms..... 101 Figure 5.6: t-SNE visualization of optimized music similarity space....... 102 Figure 5.7: Comparison of VQ-based audio similarity to alternative methods.. 104 Figure 6.1: Splitting data with a spatial partition tree............... 109
Recommended publications
  • CNPEM – Campus Map
    1 2 CNPEM – Campus Map 3 § SUMMARY 11 Presentation 12 Organizers | Scientific Committee 15 Program 17 Abstracts 18 Role of particle size, composition and structure of Co-Ni nanoparticles in the catalytic properties for steam reforming of ethanol addressed by X-ray spectroscopies Adriano H. Braga1, Daniela C. Oliveira2, D. Galante2, F. Rodrigues3, Frederico A. Lima2, Tulio R. Rocha2, 4 1 1 R. J. O. Mossanek , João B. O. Santos and José M. C. Bueno 19 Electro-oxidation of biomass derived molecules on PtxSny/C carbon supported nanoparticles A. S. Picco,3, C. R. Zanata,1 G. C. da Silva,2 M. E. Martins4, C. A. Martins5, G. A. Camara1 and P. S. Fernández6,* 20 3D Studies of Magnetic Stripe Domains in CoPd Multilayer Thin Films Alexandra Ovalle1, L. Nuñez1, S. Flewett1, J. Denardin2, J.Escrigr2, S. Oyarzún2, T. Mori3, J. Criginski3, T. Rocha3, D. Mishra4, M. Fohler4, D. Engel4, C. Guenther5, B. Pfau5 and S. Eisebitt6. 1 21 Insight into the activity of Au/Ti-KIT-6 catalysts studied by in situ spectroscopy during the epoxidation of propene reaction A. Talavera-López *, S.A. Gómez-Torres and G. Fuentes-Zurita 22 Nanosystems for nasal isoniazid delivery: small-angle x-ray scaterring (saxs) and rheology proprieties A. D. Lima1, K. R. B. Nascimento1 V. H. V. Sarmento2 and R. S. Nunes1 23 Assembly of Janus Gold Nanoparticles Investigated by Scattering Techniques Ana M. Percebom1,2,3, Juan J. Giner-Casares1, Watson Loh2 and Luis M. Liz-Marzán1 24 Study of the morphology exhibited by carbon nanotube from synchrotron small angle X-ray scattering 1 2 1 1 Ana Pacheli Heitmann , Iaci M.
    [Show full text]
  • MODULE 11: GLOSSARY and CONVERSIONS Cell Engines
    Hydrogen Fuel MODULE 11: GLOSSARY AND CONVERSIONS Cell Engines CONTENTS 11.1 GLOSSARY.......................................................................................................... 11-1 11.2 MEASUREMENT SYSTEMS .................................................................................. 11-31 11.3 CONVERSION TABLE .......................................................................................... 11-33 Hydrogen Fuel Cell Engines and Related Technologies: Rev 0, December 2001 Hydrogen Fuel MODULE 11: GLOSSARY AND CONVERSIONS Cell Engines OBJECTIVES This module is for reference only. Hydrogen Fuel Cell Engines and Related Technologies: Rev 0, December 2001 PAGE 11-1 Hydrogen Fuel Cell Engines MODULE 11: GLOSSARY AND CONVERSIONS 11.1 Glossary This glossary covers words, phrases, and acronyms that are used with fuel cell engines and hydrogen fueled vehicles. Some words may have different meanings when used in other contexts. There are variations in the use of periods and capitalization for abbrevia- tions, acronyms and standard measures. The terms in this glossary are pre- sented without periods. ABNORMAL COMBUSTION – Combustion in which knock, pre-ignition, run- on or surface ignition occurs; combustion that does not proceed in the nor- mal way (where the flame front is initiated by the spark and proceeds throughout the combustion chamber smoothly and without detonation). ABSOLUTE PRESSURE – Pressure shown on the pressure gauge plus at- mospheric pressure (psia). At sea level atmospheric pressure is 14.7 psia. Use absolute pressure in compressor calculations and when using the ideal gas law. See also psi and psig. ABSOLUTE TEMPERATURE – Temperature scale with absolute zero as the zero of the scale. In standard, the absolute temperature is the temperature in ºF plus 460, or in metric it is the temperature in ºC plus 273. Absolute zero is referred to as Rankine or r, and in metric as Kelvin or K.
    [Show full text]
  • Thermodynamics Introduction and Basic Concepts
    Thermodynamics Introduction and Basic Concepts by Asst. Prof. Channarong Asavatesanupap Mechanical Engineering Department Faculty of Engineering Thammasat University 2 What is Thermodynamics? Thermodynamics is the study that concerns with the ways energy is stored within a body and how energy transformations, which involve heat and work, may take place. Conservation of energy principle , one of the most fundamental laws of nature, simply states that “energy cannot be created or destroyed” but energy can change from one form to another during an energy interaction, i.e. the total amount of energy remains constant. 3 Thermodynamic systems or simply system, is defined as a quantity of matter or a region in space chosen for study. Surroundings are physical space outside the system boundary. Surroundings System Boundary Boundary is the surface that separates the system from its surroundings 4 Closed, Open, and Isolated Systems The systems can be classified into (1) Closed system consists of a fixed amount of mass and no mass may cross the system boundary. The closed system boundary may move. 5 (2) Open system (control volume) has mass as well as energy crossing the boundary, called a control surface. Examples: pumps, compressors, and water heaters. 6 (3) Isolated system is a general system of fixed mass where no heat or work may cross the boundaries. mass No energy No An isolated system is normally a collection of a main system and its surroundings that are exchanging mass and energy among themselves and no other system. 7 Properties of a system Any characteristic of a system is called a property.
    [Show full text]
  • Insights Into the Unification of Forces
    Insights into the unification of forces John A. Macken Previously unknown simple equations are presented which show a close relationship between the gravitational force and the electromagnetic force. For example, the gravitational force can be expressed as the square of the electromagnetic force for a fundamental set of conditions. These equations also imply that the wave properties of particles are an important component in the generation of these forces. These insights contradict previously held assumptions about gravity. In response to the question “Which of our basic physical assumptions are wrong?” this article proposes that the following are erroneous assumptions about gravity: 1 Gravity is not a true force. For example, the standard model does not include gravity and general relativity characterizes gravity as resulting from the geometry of spacetime. Therefore general relativity does not consider gravity to be a true force. 2 Gravity was united with the other forces at the start of the Big Bang but today gravity is completely decoupled from the other forces. For example, at the start of the Big Bang fundamental particles could have energy close to Planck energy and the gravitational force between two such particles was comparable to the electrostatic force. However, today the gravitational force is vastly weaker than the other forces and is assumed to be completely different than the other forces. 3 The forces are transferred by messenger particles. For example, the electromagnetic force is believed to be transferred by virtual photons and the gravitational force is believed by many physicists to be transferred by gravitons. Gravity has always been the most mysterious of the forces 1 ‐ 3.
    [Show full text]
  • Fast, Computer Supported Experimental Determination of Absolute Zero Temperature at School
    European J of Physics Education Volume 5 Issue 1 2013 Bogacz & Pedziwiatr Fast, Computer Supported Experimental Determination of Absolute Zero Temperature at School Bogdan F. Bogacz Antoni T. Pędziwiatr M. Smoluchowski Institute of Physics Department of Methodics of Teaching and Methodology of Physics Jagiellonian University Reymonta 4, 30-059 Cracow, Poland E-mail: [email protected] (Received: 14.11. 2013, Accepted: 31.12.2013) Abstract A simple and fast experimental method of determining absolute zero temperature is presented. Air gas thermometer coupled with pressure sensor and data acquisition system COACH is applied in a wide range of temperature. By constructing a pressure vs temperature plot for air under constant volume it is possible to obtain - by extrapolation to zero pressure - a reasonable value of absolute zero temperature. The proposed way of conducting experiment allows students to "discover" intuitively the existence of minimal possible temperature and thus to understand the reason for introducing the concept of absolute zero temperature and absolute temperature Kelvin scale. It is a convincing and straightforward method to enhance understanding the general concept of temperature and support teaching thermodynamics and heat - mainly at secondary schools. This experiment was used with our university students who are being prepared to teach physics in secondary schools. They prepared physics lesson based on this material and then discussed didactic value of this material both for pupils and for teachers. Keywords: Physics education, absolute temperature, absolute zero determination, air gas thermometer. Introduction Temperature is a thermodynamic concept characterizing the state of thermal equilibrium of a macroscopic system. Equality of temperatures is a necessary and sufficient condition for reaching thermal equilibrium.
    [Show full text]
  • Norms and Facts in Measurement
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Lirias Fundamenta/ measurement concepts Norms and facts in measurement J. van Brakel Foundations of Physics, Physics Laboratory, State University of Utrecht, PO Box 80.000, 3508 TA Utrecht, The Netherlands Publications concerned with the foundations of measurement often accept uncritically the theory/observation and the norm/fact distinction. However, measurement is measurement-in-a-context. This is analysed in the first part of the paper. Important aspects of this context are: the purpose of the particular measurement; everything we know; everything we can do; and the empirical world. Dichotomies, such as definition versus empirica//aw, are by themselves not very useful because the 'rules of measure- ment' are facts or norms depending on the context chosen or considered. In the second part of the paper, three methodological premises for a theory of measurement are advocated: reduce arbitrariness, against fundamentalism, and do not be afraid of realism. The problem of dichotomies • Extensive quantities are additive (mass, velocity, utility) • The empirical relation > (for mass, length, temperature, In the literature on the foundations of measurement, a ...) is transitive (Schleichert, 1966; Carnap, 1926; number of dichotomies are used in describing properties of Janich et al, 1974) statements about various aspects of measurement. One of • The existence of an absolute unit for velocity (a maxi- the most common distinctions is that between a convention mum velocity) (Wehl, 1949) and an empirical fact, or between an (a priori) definition • The existence of an absolute zero for temperature and an (a posterior/) empirical law. Some other dichotomies that roughly correspond to the two just mentioned are These are examples of statements for which it is not given in Table 1.
    [Show full text]
  • Lect. 3 : Molec Energy, Temperature, and Pure Subst. Topic 1: Molecular
    ME 10.311 Thermo I Lect 3.docx Page 1 of 4 Lect. 3 : Molec Energy, Temperature, and Pure Subst. Delivery Notes: Class L.O.: 1. Initially draw the pressure scale 1. Able to describe what Energy is to a RU freshmen and describe three diagram, before primary forms of energy, from a molec. level. class starts 2. Able to describe what the temperature equality and zeroth law mean 3. Able to describe various temperature scales Temperature Scale “An upside down F” 4. Able to explain the temperature and volume behavior of a pure substance under constant pressure. 5. Able to interpret the various phase regions on a phase diagram 6. Able to explain and calculate quality Class M.Map: P_abs = P_atm +P_gauge 2. Lect 2 quick review. ALE = active learning exercise. TPS = think pair shair ALE Rowan Power Tour Minute Paper: Two ideas that “stuck” in your mind. Å Minute Paper Topic 1: Molecular forms of energy General Comments Regarding Energy • Energy = ability to produce an effect ( it’s a scalar!) How do you think • Energy is a mathematical abstraction – it only exists as it relates to other energy – at the variables we can measure – KE or PE, for example molecular level in a fluid - is stored? Î Chambers w/ Diff. Temperatures before & after mixing ME 10.311 Thermo I Lect 3.docx Page 2 of 4 PRIMARY FORMS OF INTERNAL ENERGY, symbol U or u Draw. monatomic: helium, neon, argon, krypton, xenon and radon. What forms of energy come to mind? hydrogen, nitrogen, oxygen, and carbon monoxide. 1.
    [Show full text]
  • Measuring Temperature
    PhyzGuide: Measuring Temperature THE THERMOSCOPE AND THE THERMOMETER Galileo is given credit for developing the first temperature-measuring device toward the end of the 1500s. Air in a tube with a bulb at the top is heated. The open end of the tube is immersed in water or wine. As the air in the tube cools, it contracts and is drawn into the tube. Once an equilibrium point is established, a rise in temperature increases the volume of the gas, forcing fluid back down; a fall in temperature reduces the volume of the gas, drawing more HOT COLD fluid into the tube. This device was Galileo’s thermoscope. Early in the 1700’s, Gabriel Daniel Fahrenheit developed a more reliable temperature measuring device. A thin glass tube holds a volume of mercury in a bulb at the bottom. When heated, the mercury expands upward in the tube. The tube could be sealed and allowed for accurate, reproducible temperature readings. Because this device could be reproduced and marked with a numerical scale, it was called a thermometer. FAHRENHEIT AND CELSIUS: UNRELATED RELATIVE SCALES As inventor of a reliable thermometer, Fahrenheit was afforded the luxury of developing a temperature scale to his liking. He defined a temperature scale on which 0 was the coldest temperature he could reproduce in the lab (the temperature of an ice-salt mixture) and 96 as human body temperature.The fact that 96 is divisible by 2, 3, and 4 made it easy to mark off increments on the glass tube between 0 and 96 once those endpoints were established.
    [Show full text]
  • Lesson 2: Temperature
    Lesson 2: Temperature Notes from Prof. Susskind video lectures publicly available on YouTube 1 Units of temperature and entropy Let’s spend a few minutes talking about units, in partic- ular the Boltzmann constant kB, a question about which last chapter ended with. What is the Boltzmann constant? Like many constants in physics, kB is a conversion factor. The speed of light is a conversion factor from distances to times. If something is going at the speed of light, and in a time t covers a distance x, then we have x = ct (1) And of course if we judiciously choose our units, we can set c = 1. Typically these constants are conversion factors from what we can call human units to units in some sense more fun- damental. Human units are convenient for scales and mag- nitudes of quantities which people can access reasonably easily. We haven’t talked about temperature yet, which is the sub- ject of this lesson. But let’s also say something about the units of temperature, and see where kB – which will simply be denoted k when there is no possible confusion – enters the picture. Units of temperature were invented by Fahren- heit1 and by Celsius2, in both cases based on physical phe- 1Daniel Gabriel Fahrenheit (1686 - 1736), Polish-born Dutch physicist and engineer. 2Anders Celsius (1701 - 1744), Swedish physicist and astronomer. 2 nomena that people could easily monitor. The units we use in physics laboratories nowadays are Kelvin3 units, which are Celsius or centrigrade degrees just shifted by an additive constant, so that the 0 °C of freezing water is 273:15 K and 0 K is absolute zero.
    [Show full text]
  • Quantum Theory Without Planck's Constant Arxiv:1203.5557V1 [Hep-Ph]
    Quantum Theory without Planck’s Constant John P. Ralston Department of Physics & Astronomy The University of Kansas, Lawrence KS 66045 Abstract Planck’s constant was introduced as a fundamental scale in the early history of quantum mechanics. We find a modern approach where Planck’s constant is absent: it is unobservable except as a con- stant of human convention. Despite long reference to experiment, re- view shows that Planck’s constant cannot be obtained from the data of Ryberg, Davisson and Germer, Compton, or that used by Planck himself. In the new approach Planck’s constant is tied to macroscopic conventions of Newtonian origin, which are dispensable. The pre- cision of other fundamental constants is substantially improved by eliminating Planck’s constant. The electron mass is determined about 67 times more precisely, and the unit of electric charge determined 139 times more precisely. Improvement in the experimental value of the fine structure constant allows new types of experiment to be compared towards finding “new physics.” The long-standing goal of eliminating reliance on the artifact known as the International Proto- type Kilogram can be accomplished to assist progress in fundamental arXiv:1203.5557v1 [hep-ph] 25 Mar 2012 physics. 1 The Evolution of Physical Constants More than a century after Planck’s work, why are we still using Planck’s constant? The answer seems self-evident. Planck’s discovery led to quan- tum mechanics, which explains experimental data. Yet there exists a pic- 1 ture of quantum theory where Planck’s constant is spurious. It cannot by found by fitting quantum mechanics to data.
    [Show full text]
  • Habitat Mapping in Rugged Terrain Using Multispectral Ikonos Images
    06-088.qxd 10/11/08 3:48 AM Page 1325 Habitat Mapping in Rugged Terrain Using Multispectral Ikonos Images Janet Nichol and Man Sing Wong Abstract et al., 1993). Even with digital photography, large area Due to the significant time and cost requirements of coverage would require orthorectification of individual traditional mapping techniques, accurate and detailed images, resulting in potentially prohibitive costs. habitat maps for large areas are uncommon. This study The first ever habitat survey covering the whole 1,100 investigates the application of Ikonos Very High Resolution km2 of Hong Kong at 1:20 000 scale was undertaken by (VHR) images to habitat mapping in the rugged terrain of the Worldwide Fund for Nature in 1993 (Ashworth et al., Hong Kong’s country parks. A required mapping scale of 1993), using a combination of fieldwork and air photo 1:10 000, a minimum map object size of 150 m2 on the interpretation. The resulting accuracy was low (R. Corlett, ground, and a minimum accuracy level of 80 percent were personal communication), and the maps were little used. set as the mapping standards. Very high quality aerial Another such project, the 1:10 000 scale habitat survey photographs and digital topographic maps provided of Northumberland National Park, which was part of the accurate reference data for the image processing and Phase I national survey of Great Britain (Walton, 1993), was habitat classification. A comparison between manual carried out by air photo interpretation and required a total stereoscopic aerial photographic interpretation and image of 717 people-days. The exercise was deemed too expensive classification using pixel-based and object-based classifiers to be repeated (Mehner et al., 2004) given the low accuracy was carried out.
    [Show full text]
  • Ground Plane Based Absolute Scale Estimation for Monocular Visual Odometry Dingfu Zhou, Yuchao Dai, Member, IEEE, and Hongdong Li, Member, IEEE
    IEEE TRANSACTIONS ON XXXX XXXX 1 Ground Plane based Absolute Scale Estimation for Monocular Visual Odometry Dingfu Zhou, Yuchao Dai, Member, IEEE, and Hongdong Li, Member, IEEE Abstract—Recovering absolute metric scale from a monocular absolute scale, at least one piece of metric information is camera is a challenging but highly desirable problem for monoc- required. This cue may come from prior scene knowledge ular camera-based systems. By using different kinds of cues, var- (e.g., camera height, object size, vehicle speed, baseline of ious approaches have been proposed for scale estimation, such as camera height, object size etc. In this paper, firstly, we summarize stereo camera etc.) or from other sensors, such as IMU (Inertial different kinds of scale estimation approaches. Then, we propose Measurement Unit) or GPS (Global Positioning System) etc. If a robust divide and conquer absolute scale estimation method two cameras are provided, i.e. forming a binocular system, the based on the ground plane and camera height by analyzing the absolute scale can be recovered based on the baseline of two advantages and disadvantages of different approaches. By using cameras. Alternatively, the absolute scale can also be recovered the estimated scale, an effective scale correction strategy has been proposed to reduce the scale drift during the Monocular Visual by using IMU, GPS [5], [6] or wheel odometry [7], etc. Odometry (VO) estimation process. Finally, the effectiveness and Furthermore, prior geometric knowledge, such as the camera robustness of the proposed method have been verified on both height, object size, has also been employed for absolute scale public and self-collected image sequences.
    [Show full text]