Challenges and Perspectives on Real-Time Singing Voice Synthesis S´Intesede Voz Cantada Em Tempo Real: Desafios E Perspectivas

Total Page:16

File Type:pdf, Size:1020Kb

Challenges and Perspectives on Real-Time Singing Voice Synthesis S´Intesede Voz Cantada Em Tempo Real: Desafios E Perspectivas Revista de Informatica´ Teorica´ e Aplicada - RITA - ISSN 2175-2745 Vol. 27, Num. 04 (2020) 118-126 RESEARCH ARTICLE Challenges and Perspectives on Real-time Singing Voice Synthesis S´ıntesede voz cantada em tempo real: desafios e perspectivas Edward David Moreno Ordonez˜ 1, Leonardo Araujo Zoehler Brum1* Abstract: This paper describes the state of art of real-time singing voice synthesis and presents its concept, applications and technical aspects. A technological mapping and a literature review are made in order to indicate the latest developments in this area. We made a brief comparative analysis among the selected works. Finally, we have discussed challenges and future research problems. Keywords: Real-time singing voice synthesis — Sound Synthesis — TTS — MIDI — Computer Music Resumo: Este trabalho trata do estado da arte do campo da s´ıntese de voz cantada em tempo real, apre- sentando seu conceito, aplicac¸oes˜ e aspectos tecnicos.´ Realiza um mapeamento tecnologico´ uma revisao˜ sistematica´ de literatura no intuito de apresentar os ultimos´ desenvolvimentos na area,´ perfazendo uma breve analise´ comparativa entre os trabalhos selecionados. Por fim, discutem-se os desafios e futuros problemas de pesquisa. Palavras-Chave: S´ıntese de Voz Cantada em Tempo Real — S´ıntese Sonora — TTS — MIDI — Computac¸ao˜ Musical 1Universidade Federal de Sergipe, Sao˜ Cristov´ ao,˜ Sergipe, Brasil *Corresponding author: [email protected] DOI: http://dx.doi.org/10.22456/2175-2745.107292 • Received: 30/08/2020 • Accepted: 29/11/2020 CC BY-NC-ND 4.0 - This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. 1. Introduction voice synthesis embedded systems, through the description of its concept, theoretical premises, main used techniques, latest From a computational vision, the aim of singing voice syn- developments and challenges for future research. This work thesis is to generate a song, given its musical notes and lyrics is organized as follows: Section 2 describes the theoretical [1]. Hence, it is a branch of text-to-speech (TTS) technology requisites that serve as base for singing voice synthesis in gen- [2] with the application of some techniques of musical sound eral; Section 3 discusses about singing synthesis techniques; synthesis. Section 4 presents a technological mapping of the patents An example of application of singing voice synthesizers registered for this area; in Section 5 the systematic review of is in the educational area. Digital files containing the singing literature is shown. Section 6 contains a comparative analysis voice can be easily created, shared and modified in order to among the selected works; Section 7 discusses the challenges facilitate the learning process, which dispenses the human and future tendencies for this field; finally, Section 8 presents presence of a singer as reference or even recordings. a brief conclusion. This kind of synthesis can also be used for artistic pur- poses [3]. Investments on the “career” of virtual singers, like 2. Theoretical framework Hatsue Miku, in Japan, have been made, which includes live The problem domain on singing voice synthesis is multidis- shows where the singing voice is generated by Vocaloid syn- ciplinary: beyond computer science, it depends on concepts thesizer, from Yamaha, and the singer image is projected by from acoustics, phonetics, music theory and signal processing. holograms. The following subsections present some concepts from each The applications on singing voice synthesis technology mentioned knowledge area. have been increased by the development of real-time synthe- sizers, like Vocaloid Keyboard [4], whose virtual singer was 2.1 Elements of Acoustics implemented by an embedded system into a keytar, allowing Sound is a physical phenomenon produced by a variation its user the execution of an instrumental performance. of air pressure levels over time, coming from a vibratory The present article presents a review about real-time singing source, for example, and a guitar string. Due to its undulatory Challenges on Real-time Singing Synthesis nature, sound is measured by physical quantities like period, frequency, and amplitude. Period consists in the duration of a complete wave cycle; frequency is the inverse of period, and indicates the quantity of cycles per second that the sound wave presents; amplitude is the maximum value of the pressure variation in relation to the equilibrium point of the oscillation [5]. The period and amplitude of a simple soundwave can be viewed in Figure 1. Figure 2. Envelope shape and its four phases.[5] 2.2 Sound qualities and music theory In regard of the perception of the sound phenomenon by the human sense of audition, the physical components previously described have a relation with the so-called sound qualities: Figure 1. A simple soundwave. pitch, which permits us to distinguish between bass and treble sounds, is directly proportional to the fundamental fre- The simplest soundwaves are the so-called sinusoids, which quency; consist in a single frequency, but this kind of sound is pro- intensity, which depends on amplitude and allows us to duced neither by nature, nor by conventional musical instru- stablish the difference between strong sounds and weak ones; ments. Such sources generate complex sounds, composed by The timbre, related to the waveshape and its envelope, is several frequencies. The lowest of them is called fundamental the quality that makes possible the perception of each sound frequency. When the values of the other frequencies are mul- source as they have their own “voice”. tiples of the fundamental one, the sound is said to be periodic. The sounds of a piano and a guitar, for example, are dis- When the opposite is true, the sound is called aperiodic. The tinguishable by their timbre, even if they have the same pitch superposition of frequencies results in a waveshape, which is and intensity. The fourth quality of sound is its duration over typical for each sound source. It describes a curve called en- time. velope, obtained from the maximum values of the waveshape It is important to remark that complex periodic sounds oscillation. are usually perceived as they have a definite pitch, which cor- The envelope shape is commonly decomposed into four responds to the fundamental frequency, for these sounds are phases, denominated by the following abbreviation ADSR. called musical sounds. On the other hand, aperiodic sounds ADSR means: do not have a distinguishable pitch and they are denominated as noise, although they are also employed in music, especially (i) attack, which corresponds to the time between the be- in percussion instruments. ginning of the execution of the sound and its maximum Since the development of the music theory in the Western amplitude; World, some ways of selection and organization of musical sounds for artistic purposes are established, as well graphical (ii) decay, the necessary time from the maximum amplitude representations of them, according to their qualities. The to a constant one; sensation of similitude between a certain sound and another one with twice the value of the fundamental frequency of the (iii) sustain, the interval of time where the amplitude keeps first, allowed the division of the range of audible frequency such constant state; and into musical scales, composed by individual sounds called by the music theory as notes. The succession of sounds with (iv) release, from the end of the constant state to the return different pitches or, in other words, the succession of different to silence [5]. musical notes is the part of music denominated melody. The rhythm is another part of music and it can be defined Figure 2 shows a schematic chart with an envelope shape as the movement of sounds regulated by their duration. The and its four phases indicated by its initial letters. duration of notes is measured by and arbitrary unit called R. Inform. Teor.´ Apl. (Online) • Porto Alegre • V. 27 • N. 4 • p.119/126 • 2020 Challenges on Real-time Singing Synthesis tempo, based on the proportion that the notes keep in relation vibrations that result from the obstruction of the airflow by to each other’s durations. The tempos are grouped in equal body parts such as lips or tongue. On the other hand, vowels portions into structures called bars. have a periodic nature. In modern musical notation, melodies are written in two Another difference between vowels and consonants is re- dimensions: symbols with different shapes, called musical lated to the role that each one plays in the syllable, which is figures, indicate the duration of sounds. Their succession is a unit whose definition is difficult to be given, but it can be made horizontally in a staff, which is a set of five lines and described as a sonorous chain composed by acclivities, peaks four spaces that indicate, according to the position that the and declivities of sonority. According to the frame/content musical figures occupy in it vertically, the different pitches. theory [7], speech is organized in syllabic frames, which con- Figure 3 shows a melody written in a staff. sist in cycles of opening and closure of mouth. In each frame, there is a segmental content, the phonemes. This content has Three structures: attack, nucleus, and coda. Consonantal phonemes are located in attack or coda, while a vowel form the syllable nucleus [8]. Furthermore, there are phonemes called semivowels, which are present in diphthongs and can Figure 3. Melody written in a staff. be found in the syllabic boundaries. In an acoustic perspective, the syllable is a waveshape where consonants and semivowels The quality of intensity is treated by a part of music the- occupy the phases of attack (acclivity) and release (decliv- ory called dynamics. It is indicated in musical notation by ity), while vowels are in sustain phase (peak or nucleus). In symbols located below the staff.
Recommended publications
  • UNIVERZITET UMETNOSTI U BEOGRADU FAKULTET MUZIČKE UMETNOSTI Katedra Za Muzikologiju
    UNIVERZITET UMETNOSTI U BEOGRADU FAKULTET MUZIČKE UMETNOSTI Katedra za muzikologiju Milan Milojković DIGITALNA TEHNOLOGIJA U SRPSKOJ UMETNIČKOJ MUZICI Doktorska disertacija Beograd, 2017. Mentor: dr Vesna Mikić, redovni profesor, Univerzitet umetnosti u Beogradu, Fakultet muzičke umetnosti, Katedra za muzikologiju Članovi komisije: 2 Digitalna tehnologija u srpskoj umetničkoj muzici Rezime Od prepravke vojnog digitalnog hardvera entuzijasta i amatera nakon Drugog svetskog rata, preko institucionalnog razvoja šezdesetih i sedamdesetih i globalne ekspanzije osamdesetih i devedesetih godina prošlog veka, računari su prešli dug put od eksperimenta do podrazumevanog sredstva za rad u gotovo svakoj ljudskoj delatnosti. Paralelno sa ovim razvojem, praćena je i nit njegovog „preseka“ sa umetničkim muzičkim poljem, koja se manifestovala formiranjem interdisciplinarne umetničke prakse računarske muzike koju stvaraju muzički inženjeri – kompozitori koji vladaju i veštinama programiranja i digitalne sinteze zvuka. Kako bi se muzički sistemi i teorije preveli u računarske programe, bilo je neophodno sakupiti i obraditi veliku količinu podataka, te je uspostavljena i zajednička humanistička disciplina – computational musicology. Tokom osamdesetih godina na umetničku scenu stupa nova generacija autora koji na računaru postepeno počinju da obavljaju sve više poslova, te se pojava „kućnih“ računara poklapa sa „prelaskom“ iz modernizma u postmodernizam, pa i ideja muzičkog inženjeringa takođe proživljava transformaciju iz objektivističke, sistematske autonomne
    [Show full text]
  • The Race of Sound: Listening, Timbre, and Vocality in African American Music
    UCLA Recent Work Title The Race of Sound: Listening, Timbre, and Vocality in African American Music Permalink https://escholarship.org/uc/item/9sn4k8dr ISBN 9780822372646 Author Eidsheim, Nina Sun Publication Date 2018-01-11 License https://creativecommons.org/licenses/by-nc-nd/4.0/ 4.0 Peer reviewed eScholarship.org Powered by the California Digital Library University of California The Race of Sound Refiguring American Music A series edited by Ronald Radano, Josh Kun, and Nina Sun Eidsheim Charles McGovern, contributing editor The Race of Sound Listening, Timbre, and Vocality in African American Music Nina Sun Eidsheim Duke University Press Durham and London 2019 © 2019 Nina Sun Eidsheim All rights reserved Printed in the United States of America on acid-free paper ∞ Designed by Courtney Leigh Baker and typeset in Garamond Premier Pro by Copperline Book Services Library of Congress Cataloging-in-Publication Data Title: The race of sound : listening, timbre, and vocality in African American music / Nina Sun Eidsheim. Description: Durham : Duke University Press, 2018. | Series: Refiguring American music | Includes bibliographical references and index. Identifiers:lccn 2018022952 (print) | lccn 2018035119 (ebook) | isbn 9780822372646 (ebook) | isbn 9780822368564 (hardcover : alk. paper) | isbn 9780822368687 (pbk. : alk. paper) Subjects: lcsh: African Americans—Music—Social aspects. | Music and race—United States. | Voice culture—Social aspects— United States. | Tone color (Music)—Social aspects—United States. | Music—Social aspects—United States. | Singing—Social aspects— United States. | Anderson, Marian, 1897–1993. | Holiday, Billie, 1915–1959. | Scott, Jimmy, 1925–2014. | Vocaloid (Computer file) Classification:lcc ml3917.u6 (ebook) | lcc ml3917.u6 e35 2018 (print) | ddc 781.2/308996073—dc23 lc record available at https://lccn.loc.gov/2018022952 Cover art: Nick Cave, Soundsuit, 2017.
    [Show full text]
  • Expression Control in Singing Voice Synthesis
    Expression Control in Singing Voice Synthesis Features, approaches, n the context of singing voice synthesis, expression control manipu- [ lates a set of voice features related to a particular emotion, style, or evaluation, and challenges singer. Also known as performance modeling, it has been ] approached from different perspectives and for different purposes, and different projects have shown a wide extent of applicability. The Iaim of this article is to provide an overview of approaches to expression control in singing voice synthesis. We introduce some musical applica- tions that use singing voice synthesis techniques to justify the need for Martí Umbert, Jordi Bonada, [ an accurate control of expression. Then, expression is defined and Masataka Goto, Tomoyasu Nakano, related to speech and instrument performance modeling. Next, we pres- ent the commonly studied set of voice parameters that can change and Johan Sundberg] Digital Object Identifier 10.1109/MSP.2015.2424572 Date of publication: 13 October 2015 IMAGE LICENSED BY INGRAM PUBLISHING 1053-5888/15©2015IEEE IEEE SIGNAL PROCESSING MAGAZINE [55] noVEMBER 2015 voices that are difficult to produce naturally (e.g., castrati). [TABLE 1] RESEARCH PROJECTS USING SINGING VOICE SYNTHESIS TECHNOLOGIES. More examples can be found with pedagogical purposes or as tools to identify perceptually relevant voice properties [3]. Project WEBSITE These applications of the so-called music information CANTOR HTTP://WWW.VIRSYN.DE research field may have a great impact on the way we inter- CANTOR DIGITALIS HTTPS://CANTORDIGITALIS.LIMSI.FR/ act with music [4]. Examples of research projects using sing- CHANTER HTTPS://CHANTER.LIMSI.FR ing voice synthesis technologies are listed in Table 1.
    [Show full text]
  • NEA Grant Search - Data As of 02-10-2020 532 Matches
    NEA Grant Search - Data as of 02-10-2020 532 matches Bay Street Theatre Festival, Inc. (aka Bay Street Theater and Sag Harbor 1853707-32-19 Center for the Arts) Sag Harbor, NY 11963-0022 To support Literature Live!, a theater education program that presents professional performances based on classic literature for middle and high school students. Plays are selected to support the curricula of local schools and New York State learning standards. The program includes talkbacks with the cast, and teachers are provided with free study guides and lesson plans. Fiscal Year: 2019 Congressional District: 1 Grant Amount: $10,000 Category: Art Works Discipline: Theater Grant Period: 06/2019 - 12/2019 Herstory Writers Workshop, Inc 1854118-52-19 Centereach, NY 11720-3597 To support writing workshops in correctional facilities and for public school students. Herstory will offer weekly literary memoir writing workshops for women and adolescent girls in Long Island jails. In addition, the organization's program for young writers will bring students from Long Island and Queens County school districts to college campuses to develop their craft. Fiscal Year: 2019 Congressional District: 1 Grant Amount: $20,000 Category: Art Works Discipline: Literature Grant Period: 06/2019 - 05/2020 Lindenhurst Memorial Library 1859011-59-19 Lindenhurst, NY 11757-5399 To support multidisciplinary performances and public programming in community locations throughout Lindenhurst, New York. Programming will include events such as live performances, exhibitions, local author programs, and other arts activities selected based on feedback from local residents. The library will feature cultural events reflecting the diversity of the area. Fiscal Year: 2019 Congressional District: 2 Grant Amount: $10,000 Category: Challenge America: Arts Discipline: Arts Engagement in American Grant Period: 07/2019 - 06/2020 Engagement in American Communities Communities Quintet of the Americas, Inc.
    [Show full text]
  • What Is an Instrument?
    What is an instrument? • Anything with which we can make music…? • Rhythm, melody, chords, • Pitched or un-pitched sounds… • Narrow definitions vs. Extremely broad definitions Tone Colour or Timbre (pronounced TAM-ber) • Refers to the sound of a note or pitch – Not the highness or lowness of the pitch itself • Different instruments have different timbres – We use words like smooth, rough, sweet, dark – Ineffable? Range • Instruments and voices have a range of notes they can play or sing – Demo guitar and voice • Lowest to highest sounds • Ways to push beyond the standard range Five Categories of Musical Instruments Classification system devised in India in the 3rd or 4th century B.C. 1. Aerophones • Wind instruments, anything using air 2. Chordophones • Stringed instruments 3. Membranophones • Drums with heads 4. Idiophones • Non-drum percussion 5. Electrophones • Electronic sounds 1. Aerophones • Wind instruments, anything using air • Aerophones are generally either: • Woodwind (Doesn’t have to be wood i.e. flute) • Reed (Small piece of wood i.e. saxophone) • Brass (Lip vibration i.e. trumpet) Flute • Woodwind family • At least 30,000 years old (bone) Ex: Claude Debussy – “Syrinx” (1913) https://www.youtube.com/watch?v=C_yf7FIyu1Y Ex: Jurassic 5 – “Flute Loop” (2000) Ex: Van Morrison – “Moondance” (1970) (chorus) Ex: Gil Scott-Heron – “The Bottle” (1974) Ex: Anchorman “Jazz Flute”(0:55) https://www.youtube.com/watch?v=Dh95taIdCo0 Bass Flute • One octave lower than a regular flute Ex: Overture from The Jungle Bookhttps://www.youtube.com/watch?v=UUH42ciR5SA • Other related instruments: • Piccolo (one octave higher than a flute) • Pan flutes • Bone or wooden flutes Accordion • Modern accordion: early 19th C.
    [Show full text]
  • Attuning Speech-Enabled Interfaces to User and Context for Inclusive Design: Technology, Methodology and Practice
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Springer - Publisher Connector Univ Access Inf Soc (2009) 8:109–122 DOI 10.1007/s10209-008-0136-x LONG PAPER Attuning speech-enabled interfaces to user and context for inclusive design: technology, methodology and practice Mark A. Neerincx Æ Anita H. M. Cremers Æ Judith M. Kessens Æ David A. van Leeuwen Æ Khiet P. Truong Published online: 7 August 2008 Ó The Author(s) 2008 Abstract This paper presents a methodology to apply 1 Introduction speech technology for compensating sensory, motor, cog- nitive and affective usage difficulties. It distinguishes (1) Speech technology seems to provide new opportunities to an analysis of accessibility and technological issues for the improve the accessibility of electronic services and soft- identification of context-dependent user needs and corre- ware applications, by offering compensation for limitations sponding opportunities to include speech in multimodal of specific user groups. These limitations can be quite user interfaces, and (2) an iterative generate-and-test pro- diverse and originate from specific sensory, physical or cess to refine the interface prototype and its design cognitive disabilities—such as difficulties to see icons, to rationale. Best practices show that such inclusion of speech control a mouse or to read text. Such limitations have both technology, although still imperfect in itself, can enhance functional and emotional aspects that should be addressed both the functional and affective information and com- in the design of user interfaces (cf. [49]). Speech technol- munication technology-experiences of specific user groups, ogy can be an ‘enabler’ for understanding both the content such as persons with reading difficulties, hearing-impaired, and ‘tone’ in user expressions, and for producing the right intellectually disabled, children and older adults.
    [Show full text]
  • Voice Synthesizer Application Android
    Voice synthesizer application android Continue The Download Now link sends you to the Windows Store, where you can continue the download process. You need to have an active Microsoft account to download the app. This download may not be available in some countries. Results 1 - 10 of 603 Prev 1 2 3 4 5 Next See also: Speech Synthesis Receming Device is an artificial production of human speech. The computer system used for this purpose is called a speech computer or speech synthesizer, and can be implemented in software or hardware. The text-to-speech system (TTS) converts the usual text of language into speech; other systems display symbolic linguistic representations, such as phonetic transcriptions in speech. Synthesized speech can be created by concatenating fragments of recorded speech that are stored in the database. Systems vary in size of stored speech blocks; The system that stores phones or diphones provides the greatest range of outputs, but may not have clarity. For specific domain use, storing whole words or suggestions allows for high-quality output. In addition, the synthesizer may include a vocal tract model and other characteristics of the human voice to create a fully synthetic voice output. The quality of the speech synthesizer is judged by its similarity to the human voice and its ability to be understood clearly. The clear text to speech program allows people with visual impairments or reading disabilities to listen to written words on their home computer. Many computer operating systems have included speech synthesizers since the early 1990s. A review of the typical TTS Automatic Announcement System synthetic voice announces the arriving train to Sweden.
    [Show full text]
  • Observations on the Dynamic Control of an Articulatory Synthesizer Using Speech Production Data
    Observations on the dynamic control of an articulatory synthesizer using speech production data Dissertation zur Erlangung des Grades eines Doktors der Philosophie der Philosophischen Fakultäten der Universität des Saarlandes vorgelegt von Ingmar Michael Augustus Steiner aus San Francisco Saarbrücken, 2010 ii Dekan: Prof. Dr. Erich Steiner Berichterstatter: Prof. Dr. William J. Barry Prof. Dr. Dietrich Klakow Datum der Einreichung: 14.12.2009 Datum der Disputation: 19.05.2010 Contents Acknowledgments vii List of Figures ix List of Tables xi List of Listings xiii List of Acronyms 1 Zusammenfassung und Überblick 1 Summary and overview 4 1 Speech synthesis methods 9 1.1 Text-to-speech ................................. 9 1.1.1 Formant synthesis ........................... 10 1.1.2 Concatenative synthesis ........................ 11 1.1.3 Statistical parametric synthesis .................... 13 1.1.4 Articulatory synthesis ......................... 13 1.1.5 Physical articulatory synthesis .................... 15 1.2 VocalTractLab in depth ............................ 17 1.2.1 Vocal tract model ........................... 17 1.2.2 Gestural model ............................. 19 1.2.3 Acoustic model ............................. 24 1.2.4 Speaker adaptation ........................... 25 2 Speech production data 27 2.1 Full imaging ................................... 27 2.1.1 X-ray and cineradiography ...................... 28 2.1.2 Magnetic resonance imaging ...................... 30 2.1.3 Ultrasound tongue imaging ...................... 33
    [Show full text]
  • Fwg 1010 English
    FWG 1010 ENGLISH UHF Wireless System™ ESPAÑOL FRANÇAIS PORTUGUÊS ITALIANO DEUTSCH OWNER’S MANUAL — p. 1 MANUAL DE INSTRUCCIONES — p. 6 MODE D’EMPLOI — p. 11 MANUAL DO PROPRIETÁRIO — p. 16 MANUALE UTENTE — p. 21 BEDIENUNGSHANDBUCH — S. 26 FWG 1010 Symbols Used Environment The lightning flash with arrow point in an equilateral trian- gle means that there are dangerous voltages present • The power supply unit consumes a small amount of electricity within the unit. even when the unit is switched off. To save energy, unplug the power supply unit from the socket if you are not going to be us- The exclamation point in an equilateral triangle on the ing the unit for some time. equipment indicates that it is necessary for the user to ENGLISH refer to the User Manual. In the User Manual, this symbol • The packaging is recyclable. Dispose of the packaging in an ap- marks instructions that the user must follow to ensure propriate recycling collection system. safe operation of the equipment. • If you scrap the unit, separate the case, electronics and cables and dispose of all the components in accordance with the appro- Safety and Environment priate waste disposal regulations. FCC STATEMENT Safety The transmitter has been tested and found to comply with the limits for a low-power auxiliary station pursuant to Part 74 of the • Do not spill any liquids on the equipment. FCC Rules. The receiver has been tested and found to comply with the limits for a Class B digital device, pursuant to Part 15 of the FCC • Do not place any containers containing liquid on the device or Rules.
    [Show full text]
  • Make It New: Reshaping Jazz in the 21St Century
    Make It New RESHAPING JAZZ IN THE 21ST CENTURY Bill Beuttler Copyright © 2019 by Bill Beuttler Lever Press (leverpress.org) is a publisher of pathbreaking scholarship. Supported by a consortium of liberal arts institutions focused on, and renowned for, excellence in both research and teaching, our press is grounded on three essential commitments: to be a digitally native press, to be a peer- reviewed, open access press that charges no fees to either authors or their institutions, and to be a press aligned with the ethos and mission of liberal arts colleges. This work is licensed under the Creative Commons Attribution- NonCommercial- NoDerivatives 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/ by-nc-nd/4.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, California, 94042, USA. DOI: https://doi.org/10.3998/mpub.11469938 Print ISBN: 978-1-64315-005- 5 Open access ISBN: 978-1-64315-006- 2 Library of Congress Control Number: 2019944840 Published in the United States of America by Lever Press, in partnership with Amherst College Press and Michigan Publishing Contents Member Institution Acknowledgments xi Introduction 1 1. Jason Moran 21 2. Vijay Iyer 53 3. Rudresh Mahanthappa 93 4. The Bad Plus 117 5. Miguel Zenón 155 6. Anat Cohen 181 7. Robert Glasper 203 8. Esperanza Spalding 231 Epilogue 259 Interview Sources 271 Notes 277 Acknowledgments 291 Member Institution Acknowledgments Lever Press is a joint venture. This work was made possible by the generous sup- port of
    [Show full text]
  • A Unit Selection Text-To-Speech-And-Singing Synthesis Framework from Neutral Speech: Proof of Concept 39 II.1 Introduction
    Adding expressiveness to unit selection speech synthesis and to numerical voice production 90) - 02 - Marc Freixes Guerreiro http://hdl.handle.net/10803/672066 Generalitat 472 (28 de Catalunya núm. Rgtre. Fund. ADVERTIMENT. L'accés als continguts d'aquesta tesi doctoral i la seva utilització ha de respectar els drets de ió la persona autora. Pot ser utilitzada per a consulta o estudi personal, així com en activitats o materials d'investigació i docència en els termes establerts a l'art. 32 del Text Refós de la Llei de Propietat Intel·lectual undac F (RDL 1/1996). Per altres utilitzacions es requereix l'autorització prèvia i expressa de la persona autora. En qualsevol cas, en la utilització dels seus continguts caldrà indicar de forma clara el nom i cognoms de la persona autora i el títol de la tesi doctoral. No s'autoritza la seva reproducció o altres formes d'explotació efectuades amb finalitats de lucre ni la seva comunicació pública des d'un lloc aliè al servei TDX. Tampoc s'autoritza la presentació del seu contingut en una finestra o marc aliè a TDX (framing). Aquesta reserva de drets afecta tant als continguts de la tesi com als seus resums i índexs. Universitat Ramon Llull Universitat Ramon ADVERTENCIA. El acceso a los contenidos de esta tesis doctoral y su utilización debe respetar los derechos de la persona autora. Puede ser utilizada para consulta o estudio personal, así como en actividades o materiales de investigación y docencia en los términos establecidos en el art. 32 del Texto Refundido de la Ley de Propiedad Intelectual (RDL 1/1996).
    [Show full text]
  • Design for a New Type of Electronic Musical Instrument
    Reimagining the synthesizer for an acoustic setting; Design for a new type of electronic musical instrument Industrial Design Engineering Bachelor Thesis Arvid Jense s0180831 5-3-2013 University of Twente Assessors: 1st: Ing. P. van Passel 2nd: Dr.Ir. M.C. van der Voort Reimagining the synthesizer for an acoustic setting; Design for a new type of electronic musical instrument By: Arvid Jense S0180831 Industrial Design Engineering University of Twente Presentation: 5-3-2013 Contact: Create Digital Media/Meeblip Betahaus, to Peter Kirn Prinzessinnenstrasse 19-20 BERLIN, 10969 Germany www.creadigitalmusic.com www.meeblip.com Examination committee: Dr.Ir. M.C. van der Voort Ing. P. van Passel Peter Kirn ii iii ABSTRACT A design of a new type of electronic instrument was made to allow usage in a setting previously not suitable for these types of instrument. Following an open-ended design brief, an analysis of the Meeblip market, synthesizer design literature and three case-studies, a new usage scenario was chosen. The scenario describes a situation of spontaneous music creation at an outside location. A rapidly iterating design process produced a wooden, semi-computational operated synthesizer which has an integrated power supply, amplifier and speaker. Care was taken to allow for rich and musical interaction as well as making the sound quality of the instrument on a similar level as acoustic instruments. Keywords: Synthesizer design, Electronics design, Open source, Arduino, Meeblip, Interaction design SAMENVATTING (DUTCH) Een ontwerp is gemaakt voor een nieuw type elektronisch muziek instrument welke gebruikt kan worden in een setting die eerder niet geschikt was voor elektronische instrumenten.
    [Show full text]