2011 IEEE International Conference on Acoustics, Speech and Signal Processing
Total Page:16
File Type:pdf, Size:1020Kb
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011) Prague, Czech Republic 22 – 27 May 2011 Pages 1-844 IEEE Catalog Number: CFP11ICA-PRT ISBN: 978-1-4577-0538-0 1/7 TABLE OF CONTENTS AASP-L1: ACOUSTIC SOURCE SEPARATION I AASP-L1.1: COMBINING HMM-BASED MELODY EXTRACTION AND NMF-BASED SOFT ....................................... 1 MASKING FOR SEPARATING VOICE AND ACCOMPANIMENT FROM MONAURAL AUDIO Yun Wang, Zhijian Ou, Tsinghua University, China AASP-L1.2: ADAPTATION OF SOURCE-SPECIFIC DICTIONARIES IN NON-NEGATIVE ........................................... 5 MATRIX FACTORIZATION FOR SOURCE SEPARATION Xabier Jaureguiberry, Pierre Leveau, Simon Maller, Juan José Burred, Audionamix, France AASP-L1.3: AN ACOUSTICALLY-MOTIVATED SPATIAL PRIOR FOR UNDER-DETERMINED ................................ 9 REVERBERANT SOURCE SEPARATION Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval, INRIA / Centre de Rennes - Bretagne Atlantique, France AASP-L1.4: RESOLVING FD-BSS PERMUTATION FOR ARBITRARY ARRAY IN PRESENCE ................................. 13 OF SPATIAL ALIASING Jani Even, Norihiro Hagita, ATR, Intelligent Robotics and Communication Laboratories, Japan AASP-L1.5: A NON-NEGATIVE APPROACH TO SEMI-SUPERVISED SEPARATION OF ........................................... 17 SPEECH FROM NOISE WITH THE USE OF TEMPORAL DYNAMICS Gautham J. Mysore, Adobe Systems Inc., United States; Paris Smaragdis, University of Illinois Urbana-Champaign, United States AASP-L1.6: ITAKURA-SAITO NONNEGATIVE MATRIX FACTORIZATION WITH GROUP .................................... 21 SPARSITY Augustin Lefevre, Francis Bach, Ecole Normale Superieure, France; Cédric Févotte, CNRS LTCI / Télécom ParisTech, France AASP-L2: MUSIC SIGNAL PROCESSING I AASP-L2.1: MULTIPITCH ESTIMATION BY JOINT MODELING OF HARMONIC AND ............................................ 25 TRANSIENT SOUNDS Jun Wu, The University of Tokyo, Japan; Emmanuel Vincent, INRIA, France; Stanislaw Raczynski, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama, The University of Tokyo, Japan AASP-L2.2: FREQUENCY SELECTIVE PITCH TRANSPOSITION OF AUDIO SIGNALS ............................................. 29 Sascha Disch, Fraunhofer Institute for Integrated Circuits (IIS), Germany; Bernd Edler, International Audio Laboratories Erlangen, Germany AASP-L2.3: IMPROVING MELODY EXTRACTION USING PROBABILISTIC LATENT .............................................. 33 COMPONENT ANALYSIS Jinyu Han, Northwestern University, United States; Ching-Wei Chen, Gracenote, United States AASP-L2.4: POLYPHONIC MUSIC TRANSCRIPTION USING NOTE ONSET AND OFFSET ...................................... 37 DETECTION Emmanouil Benetos, Simon Dixon, Queen Mary University of London, United Kingdom AASP-L2.5: AUTOMATIC MUSICAL THUMBNAILING BASED ON AUDIO OBJECT ................................................. 41 LOCALIZATION AND ITS EVALUATION Hiroyuki Nawata, Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano, Nara Institute of Science and Technology, Japan AASP-L2.6: SCORE INFORMED AUDIO SOURCE SEPARATION USING A PARAMETRIC ...................................... 45 MODEL OF NON-NEGATIVE SPECTROGRAM Romain Hennequin, Bertrand David, Roland Badeau, Institut TELECOM / TELECOM ParisTech, France AASP-L3: SPATIAL AND MULTICHANNEL SIGNAL PROCESSING AASP-L3.1: EFFICIENT RANGE EXTRAPOLATION OF HEAD-RELATED IMPULSE ................................................ 49 RESPONSES BY WAVE FIELD SYNTHESIS TECHNIQUES Sascha Spors, Jens Ahrens, Deutsche Telekom Laboratories, Germany AASP-L3.2: EFFICIENCY EVALUATION AND ORTHOGONAL BASIS DETERMINATION IN ................................. 53 FUNCTIONAL HRTF MODELING Mengqiu Zhang, Rodney A. Kennedy, Thushara D. Abhayapala, Australian National University, Australia AASP-L3.3: SPATIAL SOUND REPRODUCTION SYSTEMS USING HIGHER ORDER ................................................ 57 LOUDSPEAKERS Mark Poletti, Industrial Research Ltd, New Zealand; Thushara D. Abhayapala, Australian National University, Australia AASP-L3.4: CONVERTING 5.1 AUDIO RECORDINGS TO B-FORMAT FOR DIRECTIONAL .................................... 61 AUDIO CODING REPRODUCTION Mikko-Ville Laitinen, Ville Pulkki, Aalto University, Finland AASP-L3.5: AN ANALYTICAL APPROACH TO LOCAL SOUND FIELD SYNTHESIS USING .................................... 65 LINEAR ARRAYS OF LOUDSPEAKERS Jens Ahrens, Sascha Spors, Deutsche Telekom Laboratories, Germany AASP-L3.6: A METHODOLOGY FOR EVALUATING THE ACCURACY OF WAVE FIELD ....................................... 69 RENDERING TECHNIQUES Antonio Canclini, Politecnico di Milano, Italy; Paolo Annibale, University Erlangen-Nuremberg, Germany; Fabio Antonacci, Augusto Sarti, Politecnico di Milano, Italy; Rudolf Rabenstein, University Erlangen-Nuremberg, Germany; Stefano Tubaro, Politecnico di Milano, Italy AASP-L4: ECHO CANCELLATION AASP-L4.1: A PROPORTIONATE ADAPTIVE ALGORITHM WITH VARIABLE PARTITIONED ............................. 73 BLOCK LENGTH FOR ACOUSTIC ECHO CANCELLATION Pradeep Loganathan, Emanuel A.P. Habets, Patrick Naylor, Imperial College London, United Kingdom AASP-L4.2: AN EFFICIENT VARIABLE STEP-SIZE PROPORTIONATE AFFINE PROJECTION ............................. 77 ALGORITHM Constantin Paleologu, University Politehnica of Bucharest, Romania; Jacob Benesty, University of Quebec, Canada; Felix Albu, Silviu Ciochina, University Politehnica of Bucharest, Romania AASP-L4.3: RELATIVE PROPORTIONATE NLMS: IMPROVING CONVERGENCE FOR .......................................... 81 ACOUSTIC CHANNEL IDENTIFICATION Tao Yu, John Hansen, The University of Texas at Dallas, United States AASP-L4.4: FOURIER EXPANSION OF HAMMERSTEIN MODELS FOR NONLINEAR .............................................. 85 ACOUSTIC SYSTEM IDENTIFICATION Sarmad Malik, Gerald Enzner, Ruhr-Universität Bochum, Germany AASP-L4.5: ROBUST AND LOW-COST CASCADED NON-LINEAR ACOUSTIC ECHO ............................................... 89 CANCELLATION Moctar Mossi Idrissa, Christelle Yemdji, Nicholas Evans, EURECOM, France; Christophe Beaugeant, Philippe Degry, Infineon, France AASP-L4.6: SPATIO-TEMPORAL SIGNAL PREPROCESSING FOR MULTICHANNEL .............................................. 93 ACOUSTIC ECHO CANCELLATION Karim Helwani, Sascha Spors, Herbert Buchner, Deutsche Telekom Laboratories / Technische Universität Berlin, Germany AASP-L5: MICROPHONE ARRAY SIGNAL PROCESSING AASP-L5.1: BROADBAND DIRECTION ESTIMATION METHOD UTILIZING COMBINED ....................................... 97 PRESSURE AND ENERGY GRADIENTS FROM OPTIMIZED MICROPHONE ARRAY Jukka Ahonen, Ville Pulkki, Aalto University, Finland AASP-L5.2: DESIGN OF ROBUST STEERABLE BROADBAND BEAMFORMERS ....................................................... 101 INCORPORATING MICROPHONE GAIN AND PHASE ERROR CHARACTERISTICS Chiong Ching Lai, Sven Nordholm, Yee Hong Leung, Curtin University, Australia AASP-L5.3: DIRECTION-OF-ARRIVAL ESTIMATION USING ACOUSTIC VECTOR SENSORS ............................. 105 IN THE PRESENCE OF NOISE Dovid Levin, Bar-Ilan University, Israel; Emanuel A.P. Habets, Imperial College London, United Kingdom; Sharon Gannot, Bar- Ilan University, Israel AASP-L5.4: MICROPHONE POSITION OPTIMIZATION FOR PLANAR SUPERDIRECTIVE .................................. 109 BEAMFORMING Ina Kodrasi, University of Oldenburg, Germany; Thomas Rohdenburg, Fraunhofer IDMT, Germany; Simon Doclo, University of Oldenburg, Germany AASP-L5.5: JOINT DOA AND TDOA ESTIMATION FOR 3D LOCALIZATION OF REFLECTIVE .......................... 113 SURFACES USING EIGENBEAM MVDR AND SPHERICAL MICROPHONE ARRAYS Haohai Sun, Norwegian University of Science and Technology, Norway; Edwin Mabande, Konrad Kowalczyk, Walter Kellermann, University of Erlangen-Nuremberg, Germany AASP-L5.6: ROBUST LOCALIZATION OF MULTIPLE SOURCES IN REVERBERANT ............................................ 117 ENVIRONMENTS USING EB-ESPRIT WITH SPHERICAL MICROPHONE ARRAYS Haohai Sun, Norwegian University of Science and Technology, Norway; Heinz Teutsch, Avaya Labs, Germany; Edwin Mabande, Walter Kellermann, University of Erlangen-Nuremberg, Germany AASP-P1: LOUDSPEAKER AND MICROPHONE ARRAY SIGNAL PROCESSING AASP-P1.1: PERFORMANCE ANALYSIS OF A RANDOMLY SPACED WIRELESS .................................................... 121 MICROPHONE ARRAY Shmulik Markovich Golan, Sharon Gannot, Bar-Ilan University, Israel; Israel Cohen, Technion / Israel Institute of Technology, Israel AASP-P1.2: A GENERALIZED DESIGN METHOD FOR DIRECTIVITY PATTERNS OF ............................................ 125 SPHERICAL MICROPHONE ARRAYS Enzo De Sena, Huseyin Hacihabiboglu, Zoran Cvetkovic, King’s College London, United Kingdom AASP-P1.3: SIMULATING ROOM IMPULSE RESPONSES FOR SPHERICAL MICROPHONE ................................ 129 ARRAYS Daniel Jarrett, Emanuel A.P. Habets, Mark Thomas, Patrick Naylor, Imperial College London, United Kingdom AASP-P1.4: RESOLVING SPATIAL SAMPLING EFFECTS IN PARAMETRIC DIRECTIONAL ............................... 133 FILTERING Markus Kallinger, Michael Buerger, Oliver Thiergart, Fabian Kuech, Dirk Mahne, Fraunhofer Institute for Integrated Circuits (IIS), Germany AASP-P1.5: A DATA-DRIVEN POST-FILTER DESIGN BASED ON SPATIALLY AND ................................................ 137 TEMPORALLY SMOOTHED A PRIORI SNR Huajun Yu, Tim Fingscheidt, TU Braunschweig, Germany AASP-P1.6: DESIGN OF MULTIPOLE LOUDSPEAKER ARRAY BASED ON SPHERICAL ....................................... 141 HARMONIC EXPANSION Yoichi Haneda, Ken’ichi Furuya, Hiroaki Itou, NTT Corporation, Japan AASP-P1.7: A WAVENUMBER-FITTING EXTRAPOLATION