Final Thesis 1.14 MB

Total Page:16

File Type:pdf, Size:1020Kb

Final Thesis 1.14 MB VYSOKE´ UCˇ ENI´ TECHNICKE´ V BRNEˇ BRNO UNIVERSITY OF TECHNOLOGY FAKULTA INFORMACˇ NI´CH TECHNOLOGII´ U´ STAV POCˇ ´ITACˇ OVE´ GRAFIKY A MULTIME´ DII´ FACULTY OF INFORMATION TECHNOLOGY DEPARTMENT OF COMPUTER GRAPHICS AND MULTIMEDIA HLASEM OVLADANA´ MAPA FIT BAKALA´ Rˇ SKA´ PRA´ CE BACHELOR’S THESIS AUTOR PRA´ CE PETR HUTA´ K AUTHOR BRNO 2012 VYSOKE´ UCˇ ENI´ TECHNICKE´ V BRNEˇ BRNO UNIVERSITY OF TECHNOLOGY FAKULTA INFORMACˇ NI´CH TECHNOLOGII´ U´ STAV POCˇ ´ITACˇ OVE´ GRAFIKY A MULTIME´ DII´ FACULTY OF INFORMATION TECHNOLOGY DEPARTMENT OF COMPUTER GRAPHICS AND MULTIMEDIA HLASEM OVLADANA´ MAPA FIT VOICE CONTROLED MAP OF FIT BAKALA´ Rˇ SKA´ PRA´ CE BACHELOR’S THESIS AUTOR PRA´ CE PETR HUTA´ K AUTHOR VEDOUCI´ PRA´ CE Ing. IGOR SZOKE,˝ Ph.D. SUPERVISOR BRNO 2012 Abstrakt Tato bakaláøská práce se zabývá návrhem a tvorbou navigačního systému po areálu Fa- kulty informačních technologií Vysokého učení technického v Brně s hlasovým ovládáním. Vysvětluje metody používané pro hledání nejkratší cesty na mapě, rozpoznávání øeèi a po- pisuje existující navigační systémy s hlasovým ovládáním. Práce je zaměřena především na tvorbu uživatelského rozhraní s hlasovým ovládáním. Cílem hlasového ovládání je, aby použití tohoto navigačního systému bylo přístupné a efektivní na veøejném místě. Hlasové ovládání je realizováno pomocí knihovny BSAPI. Abstract This bachelor's thesis describes the design and developement of navigation system for the Faculty of information technology, Brno university of technology with voice control. It explains the methods used for finding the shortest path on the map, speech recognition and describes existing navigation systems with voice control. The thesis is focused on creating a user interface with voice control. The purpose of voice control is to make usage of this navigation system more accesible and effective in a public place. Speech recognizing is realized with BSAPI library. Klíčová slova Mapa, grafové algoritmy, rozpoznávání øeèi, hlasové ovládání, BSAPI, Qt, C++. Keywords Map, graph algorithms, speech recognition, voice control, BSAPI, Qt, C++. Citace Petr Huták: Hlasem ovladaná mapa FIT, bakaláøská práce, Brno, FIT VUT v Brně, 2012 Hlasem ovladaná mapa FIT Prohlášení Prohla¹uji, že jsem tuto bakaláøskou práci vypracoval samostatně pod vedením pana Ing. Igora Sz}okeho, Ph.D. Uvedl jsem v¹echny literární prameny a publikace, ze kterých jsem èerpal. ....................... Petr Huták 10. května 2012 Poděkování Rád bych poděkoval vedoucímu mé bakaláøské práce panu Ing. Igorovi Sz}okemu, Ph.D. za odborné vedení, rady a jeho èas, které mi pøi tvorbě práce poskytl. c Petr Huták, 2012. Tato práce vznikla jako školní dílo na Vysokém učení technickém v Brně, Fakultě informa- čních technologií. Práce je chráněna autorským zákonem a její užití bez udělení oprávnění autorem je nezákonné, s výjimkou zákonem definovaných případů. Obsah 1 Úvod 4 2 Grafová reprezentace mapy5 2.1 Grafová reprezentace...............................5 2.2 Algoritmus A*..................................5 2.3 Dijkstrùv algoritmus...............................7 3 Rozpoznávání øeèi9 3.1 Rozdělení rozpoznávání øeèi...........................9 3.2 Dynamické borcení èasu { DTW........................9 3.3 Skryté Markovské Modely { HMM.......................9 3.4 Použitý rozpoznávaè { knihovna BSAPI.................... 10 4 Existující řešení 11 4.1 Navigon Plus 70................................. 11 4.2 Informační dotyková tabule nákupního centra................. 12 4.3 Další systémy s hlasovým ovládáním...................... 12 5 Návrh programu 13 5.1 Požadavky na navigační systém......................... 13 5.2 Knihovna pro tvorbu GUI............................ 13 5.2.1 wxWidgets................................ 13 5.2.2 Qt..................................... 14 5.3 Knihovny pro práci s grafy........................... 15 5.3.1 Knihovna Boost............................. 15 5.3.2 Další knihovny pro práci s grafy.................... 16 5.4 Knihovny pro zpracování zvuku......................... 16 5.4.1 knihovna PortAudio ........................... 17 5.4.2 knihovna libsndfile .......................... 17 5.5 Objektový návrh aplikace............................ 17 5.6 Návrh uživatelského rozhraní.......................... 18 5.7 Návrh formátu uložení grafu........................... 19 6 Implementace 21 6.1 Pøevod mapy z informáčního systému..................... 21 6.2 Tvorba seznamù klíčových slov......................... 22 6.3 Popis implementace tříd navigačního systému................. 22 6.3.1 Třída MainWindow ............................ 22 1 6.3.2 Třída MapObjects ............................ 24 6.3.3 Třída Ui MainWindow .......................... 25 6.3.4 Třída MapSearch ............................. 25 6.3.5 Třída MapSettingsDialog a MapStartPosition ........... 26 6.3.6 Třída MapAudioRecorder ........................ 27 6.3.7 Třídy MapRecognizer,MapRecognizerTarget a SErrorCallBackI . 28 6.3.8 Třída MapWidget ............................. 29 7 Testování 31 7.1 Metodika a podmínky testování......................... 31 7.2 Zhodnocení testování............................... 31 7.3 Postøehy uživatelù................................ 31 8 Závěr 32 A Kompilace zdrojových souborù 35 A.1 Požadavky..................................... 35 A.2 Kompilace..................................... 35 B Návod k použití 36 B.1 Spuštění a ovládání................................ 36 B.1.1 Základní hlasové příkazy......................... 36 B.1.2 Zadání jména a přijmení......................... 36 B.1.3 Korekce jména a příjmení........................ 37 B.1.4 Výběr osoby............................... 37 B.1.5 Výběr poschodí.............................. 37 C Obsah CD 38 2 Seznam obrázkù 2.1 příklad budovy..................................5 2.2 budova reprezentovaná grafem.........................6 4.1 GPS navigace Navigon Plus 70......................... 11 4.2 Informační tabule v nákupním centru, autor: Pavel ©evela / Wikimedia Com- mons [19]..................................... 12 5.1 Zobrazení principu èinnosti aplikace...................... 14 5.2 wxWidgets { příklad vykonání obsluhy stisku tlačítka [13].......... 15 5.3 Qt { princip signálù a slotù [9]......................... 16 5.4 Zpracování zvukového vstupu knihovnami PortAudio a libsndfile .... 17 5.5 Objektový návrh aplikace............................ 18 5.6 Návrh rozmístění prvkù uživatelského rozhraní................ 19 6.1 Interakce mezi třídami aplikace......................... 23 6.2 Třída MainWindow ................................ 24 6.3 Pøechody mezi stavy třídy MainWindow .................... 24 6.4 Třída MapObjects ................................ 25 6.5 Grafické uživatelské rozhraní aplikace..................... 26 6.6 Třída MapSearch ................................. 26 6.7 Třída MapAudioRecorder ............................ 27 6.8 Třídy MapRecognizer,MapRecognizerTarget a SErrorCallBackI ..... 28 6.9 Mapa vykreslená třídou MapWidget ...................... 30 3 Kapitola 1 Úvod Jak jistě mnozí zažili, není jednoduché se orientovat v neznámém prostředí. Například pøi příchodu do rozlehlého, nám neznámého, areálu budov není snadné udržet správnou orientaci, následně pak pøi hledání konretní osoby na konkrétním místě, a někdy i v kon- krétním èase, èasto propadáme panice a jsme rádi za každou radu èi pomoc pøi nasměrování k vytouženému cíli. V těchto případech jsou obèas k nápomoci plánky, směrové tabule, èi kolemjdoucí lidé. Snaha ulehèit si život v¹ak doprovází lidstvo od prvopoèátku, a tak i v ob- lasti navigačních systémù jsme od směrových ukazatelù dospěli do bodu, kdy nám dostupné technologie umožňují na základě vstupních informací vybrat nejideálnější trasu k na¹emu cíli. V případě navigačních systémù budov umístěných na veøejném místě pro využití ¹iro- kou věøejností nám pak vznikají nová specifika. Èasto okolní prostředí, èi situace nedovoluje uživatelùm tohoto systému zadávat vstupní informace klasickým zpùsobem, tedy pøes klá- vesnici, a nabízejí se tak možnosti pro využití technologií dotykových obrazovek, èi pro èlověka pøirozené hlasové komunikace, která je předmětem i této práce. Cílem bakaláøské práce byla implementace navigačního systému pro areál Fakulty in- formačních technologií Vysokého učení technického v Brně, přičemž ovládání tohoto sys- tému se děje prostřednictím hlasových příkazù zadaných uživatelem. Prvním úkolem bylo navrhnout vhodný zpùsob datové reprezentace mapy a informací o osobách. To v¹e za úèe- lem pozdějšího vyhledávání osob, místoností a optimálních cest nebo efektivního zobrazení mapy v grafické podobě. Následující fáze se týkala návrhu a implementace grafického uži- vatelského rozhraní spravujícího zobrazení mapy areálu fakulty s možností vyhledání cesty na základě definovaného poèátku a cíle. Implementace systému proběhla v programovacím jazyce C++. Po dokončení této fáze a otestování dosavadních výsledků pøi¹lo na øadu pøi- pojení hlasového ovládání, kdy pro rozpoznání ovládacích příkazù bylo využito knihovny BSAPI, kterou vyvinula výzkumná skupina Speech@FIT. Na závěr byl systém podroben otestování, pøi kterém byla hodnocena především efektivita hlasového ovládání. Celá práce je rozčleněna do osmi kapitol. Druhá kapitola (2) se ve struènosti zabývá grafovou reprezentací mapy a algoritmy pro nalezení nejkratší cesty v grafu. Následující kapitola (3) je věnována rozpoznávání øeèi. Jsou zde uvedeny základní metody rozpoznávání a popis rozpoznávaèe použitého v této práci. Kapitola (4) dává náhled na aktuální navigační systémy využívající ovládání hlasem.
Recommended publications
  • Porting Portaudio API on ASIO
    GRAME - Computer Music Research Lab. Technical Note - 01-11-06 Porting PortAudio API on ASIO Stéphane Letz November 2001 Grame - Computer Music Research Laboratory 9, rue du Garet BP 1185 69202 FR - LYON Cedex 01 [email protected] Abstract This document describes a port of the PortAudio API using the ASIO API on Macintosh and Windows. It explains technical choices used, buffer size adaptation techniques that guarantee minimal additional latency, results and limitations. 1 The ASIO API ASIO (Audio Streaming Input Ouput) is an API defined and proposed by Steinberg. It addesses the area of efficient audio processing, synchronization, low latency and extentibility on the hardware side. ASIO allows the handling of multi-channel professional audio cards, and different sample rates (32 kHz to 96 kHz), different sample formats (16, 24, 32 bits of 32/64 floating point formats). ASIO is available on MacOS and Windows. 2 PortAudio API PortAudio is a library that provides streaming audio input and output. It is a cross-platform API that works on Windows, Macintosh, Linux, SGI, FreeBSD and BeOS. This means that programs that need to process or generate an audio signal, can run on several different computers just by recompiling the source code. PortAudio is intended to promote the exchange of audio synthesis software between developers on different platforms. 3 Technical choices Porting PortAudio to ASIO means that some technical choices have to be made. The life cycle of the ASIO driver must be “mapped” to the life cycle of a PortAudio application. Each PortAudio function will be implemented using one or more ASIO functions.
    [Show full text]
  • Python-Sounddevice Release 0.4.2
    python-sounddevice Release 0.4.2 Matthias Geier 2021-07-18 Contents 1 Installation 2 2 Usage 3 2.1 Playback................................................3 2.2 Recording...............................................4 2.3 Simultaneous Playback and Recording................................4 2.4 Device Selection...........................................4 2.5 Callback Streams...........................................5 2.6 Blocking Read/Write Streams.....................................6 3 Example Programs 6 3.1 Play a Sound File...........................................6 3.2 Play a Very Long Sound File.....................................8 3.3 Play a Very Long Sound File without Using NumPy......................... 10 3.4 Play a Web Stream.......................................... 12 3.5 Play a Sine Signal........................................... 14 3.6 Input to Output Pass-Through..................................... 15 3.7 Plot Microphone Signal(s) in Real-Time............................... 17 3.8 Real-Time Text-Mode Spectrogram................................. 19 3.9 Recording with Arbitrary Duration.................................. 21 3.10 Using a stream in an asyncio coroutine............................... 23 3.11 Creating an asyncio generator for audio blocks........................... 24 4 Contributing 26 4.1 Reporting Problems.......................................... 26 4.2 Development Installation....................................... 28 4.3 Building the Documentation..................................... 28 5 API Documentation
    [Show full text]
  • The Open Master Hearing Aid (Openmha) Application Engineers' Manual
    The Open Master Hearing Aid (openMHA) 4.16.1 Application Engineers’ Manual © 2005-2021 by HörTech gGmbH, Marie-Curie-Str. 2, D–26129 Oldenburg, Germany The Open Master Hearing Aid (openMHA) – Application Engineers’ Manual HörTech gGmbH Marie-Curie-Str. 2 D–26129 Oldenburg iii LICENSE AGREEMENT This file is part of the HörTech Open Master Hearing Aid (openMHA) Copyright © 2005 2006 2007 2008 2009 2010 2012 2013 2014 2015 2016 HörTech gGmbH. Copyright © 2017 2018 2019 2020 2021 HörTech gGmbH. openMHA is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, version 3 of the License. openMHA is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License, version 3 for more details. You should have received a copy of the GNU Affero General Public License, version 3 along with openMHA. If not, see <http://www.gnu.org/licenses/>. © 2005-2021 HörTech gGmbH, Oldenburg Contents 1 Introduction 1 1.1 Structure........................................ 1 1.2 Platform Services and Conventions......................... 2 2 The openMHA configuration language4 2.1 Structure of the openMHA configuration language................. 4 2.2 Communication between openMHA Plugins .................... 7 3 The openMHA host application8 3.1 Invocation of ’mha’ .................................. 8 3.2 Configuration variables of the openMHA host application............. 10 3.3 States of the openMHA host application ...................... 11 3.4 Audio abstraction layer................................ 11 4 GNU Octave/MATLAB tools 15 4.1 "mhactl_wrapper" - openMHA control interface for GNU Octave and MATLAB .
    [Show full text]
  • The Pulseaudio Sound Server Linux.Conf.Au 2007
    Introduction What is PulseAudio? Usage Internals Recipes Outlook The PulseAudio Sound Server linux.conf.au 2007 Lennart Poettering [email protected] Universit¨atHamburg, Department Informatik University of Hamburg, Department of Informatics Hamburg, Germany January 17th, 2007 Lennart Poettering The PulseAudio Sound Server 2 What is PulseAudio? 3 Usage 4 Internals 5 Recipes 6 Outlook Introduction What is PulseAudio? Usage Internals Recipes Outlook Contents 1 Introduction Lennart Poettering The PulseAudio Sound Server 3 Usage 4 Internals 5 Recipes 6 Outlook Introduction What is PulseAudio? Usage Internals Recipes Outlook Contents 1 Introduction 2 What is PulseAudio? Lennart Poettering The PulseAudio Sound Server 4 Internals 5 Recipes 6 Outlook Introduction What is PulseAudio? Usage Internals Recipes Outlook Contents 1 Introduction 2 What is PulseAudio? 3 Usage Lennart Poettering The PulseAudio Sound Server 5 Recipes 6 Outlook Introduction What is PulseAudio? Usage Internals Recipes Outlook Contents 1 Introduction 2 What is PulseAudio? 3 Usage 4 Internals Lennart Poettering The PulseAudio Sound Server 6 Outlook Introduction What is PulseAudio? Usage Internals Recipes Outlook Contents 1 Introduction 2 What is PulseAudio? 3 Usage 4 Internals 5 Recipes Lennart Poettering The PulseAudio Sound Server Introduction What is PulseAudio? Usage Internals Recipes Outlook Contents 1 Introduction 2 What is PulseAudio? 3 Usage 4 Internals 5 Recipes 6 Outlook Lennart Poettering The PulseAudio Sound Server Introduction What is PulseAudio? Usage Internals Recipes Outlook Who Am I? Student (Computer Science) from Hamburg, Germany Core Developer of PulseAudio, Avahi and a few other Free Software projects http://0pointer.de/lennart/ [email protected] IRC: mezcalero Lennart Poettering The PulseAudio Sound Server Introduction What is PulseAudio? Usage Internals Recipes Outlook Introduction Lennart Poettering The PulseAudio Sound Server It’s a mess! There are just too many widely adopted but competing and incompatible sound systems.
    [Show full text]
  • Mixxx User Manual Release 2.1.0
    Mixxx User Manual Release 2.1.0 The Mixxx Development Team Apr 15, 2018 Contents 1 Introduction to Mixxx 1 1.1 How Mixxx Works............................................1 2 Getting Started 3 2.1 Opening Mixxx..............................................3 2.2 Importing your audio files........................................3 2.3 Analyze your library...........................................4 2.4 Configuring sound input and output...................................4 3 An Overview of the Mixxx Interface7 3.1 The Deck Sections............................................8 3.2 The Mixer Section............................................ 17 3.3 The Sampler Section........................................... 20 3.4 The Effects Section............................................ 22 3.5 The Microphone Section......................................... 22 3.6 Preview Deck Section.......................................... 22 4 The Mixxx Library 24 4.1 Overview of the Library features..................................... 24 4.2 Tracks - View and edit your whole collection.............................. 25 4.3 Loading Tracks.............................................. 28 4.4 Finding Tracks (Search)......................................... 29 4.5 Previewing Tracks............................................ 30 4.6 Edit metadata of audio files....................................... 30 4.7 Auto DJ - Automate your mix...................................... 33 4.8 Playlists - Arranging tracks in a set order................................ 33 4.9 Crates -
    [Show full text]
  • Audiomath Documentation Release 1.16.1
    audiomath Documentation Release 1.16.1 Jeremy Hill Sep 25, 2021 Table of Contents 1 Installation 3 1.1 Compatibility and requirements.....................................3 1.2 Normal installation............................................3 1.3 Advanced installation (from version-controlled sources)........................4 2 How to. 5 2.1 Read a sound from a file into memory..................................5 2.2 Write a sound from memory into a file.................................5 2.3 Create a sound from scratch, in memory.................................6 2.4 Define a functionally-generated sound..................................6 2.5 Perform simple arithmetic........................................7 2.6 Extract a segment of a sound.......................................7 2.7 Extract selected channels from multi-channel sounds..........................8 2.8 Concatenate sounds in time (splicing)..................................8 2.9 Stack channels (multiplexing)......................................9 2.10 Pitch-shift or time-stretch a sound.................................... 10 2.11 Preprocess a sound using SoX...................................... 10 2.12 Manipulate a sound in other miscellaneous ways............................ 10 2.13 Plot a sound............................................... 11 2.14 Play sounds................................................ 11 2.15 Play sounds with more-precise latency, via PsychPortAudio .................... 14 2.16 Record a sound into memory....................................... 15 2.17 Record
    [Show full text]
  • Linux Audio Conference 2019
    Proceedings of the Linux Audio Conference 2019 March 23rd – 26th, 2019 Center for Computer Research in Music and Acoustics (CCRMA) Stanford University, USA “In Ping(uins) e trust! Published by CCRMA, Stanford University, California, US March 2019 All copyrights remain with the authors http://lac.linuxaudio.org/2019 ISBN 978-0-359-46387-9 Credits Layout: Frank Neumann and Romain Michon Typesetting: LATEX and pdfLaTeX Logo Design: The Linuxaudio.org logo and its variations copyright Thorsten Wilms c 2006, imported into "LAC 2014" logo by Robin Gareus Thanks to: Martin Monperrus for his webpage "Creating proceedings from PDF files" ii Partners and Sponsors Linuxaudio.org iii iv Foreword Welcome everyone to LAC 2019 at CCRMA! For the second time in its seventeen year history, the Linux Audio Conference (LAC) is hosted in the United Stated of America by the Center for Computer Research in Mu- sic and Acoustics (CCRMA) at Stanford University. With its informal workshop-like at- mosphere, LAC is a blend of scientific and technical papers, tutorials, sound installations, and concerts centered on the free GNU/Linux operating system and open-source free soft- ware for audio, multimedia, and musical applications. LAC is a unique platform during which members of this community gather to exchange ideas, draft new projects, and see old friends. In these times of increasing political tensions and of rising extremism throughout the world, we believe that emphasizing and promoting the universality of this type of event is of the utmost importance. The Linux audio community exists worldwide; we believe it should remain a priority to diversify LAC’s geographical location from year to year for the benefit of those who can’t afford to travel to the other side of the world.
    [Show full text]
  • Purebasic Vs Freepascal
    Purebasic vs freepascal click here to download Most mentioned BASIC dialects are not as multi platform as Lazarus or even Freepascal except Purebasic and Realbasic. Logged. Today I downloaded and installed Lazarus (based on Free Pascal) just to . Most of these extra libs for PureBasic are created in C/C++ or are Read DirectUIHWND class window and. But, if you want write native applications, not requiring a third-party dlls and platforms, or native and cross- platform applications, you can't use. I downloaded both Lazarus & PureBasic and tried both and they look OK. I can not decide I love the Laz IDE, it's much faster than www.doorway.ru and. If I could just install a file (like a mdb, or maybe a Firebird DB file), thats use this for its awesome platform support - have a look at PureBasic. There is a language out there called pure basic, which is actually the . just like I can see why people liked Delphi or FreePascal; it's a nicer. Today I'm writing about PureBasic, a portable programming language, that works on Compared to Lazarus which uses freepascal this is small. just use out of the box, e.g. Threading, DLL creation (or sharedobjects), XML, JSON, Sqlite. The following tables list notable software packages that are nominal IDEs; standalone tools . PureBasic · Fantaisie Software · Proprietary, Yes, Yes, Yes, AmigaOS .. Free Pascal IDE, Volunteers, / February 15, , Yes, Yes, Yes . PyQt · GPLv3 "or later", Yes, until version and since version , Yes, since. In my honest opinion, there is not any language which is the easiest to start with, you could start with C++ or with PureBasic, or with C.
    [Show full text]
  • Aalborg Universitet Towards an Open Digital Audio Workstation for Live
    Aalborg Universitet Towards an open digital audio workstation for live performance Dimitrov, Smilen DOI (link to publication from Publisher): 10.5278/vbn.phd.engsci.00028 Publication date: 2015 Document Version Publisher's PDF, also known as Version of record Link to publication from Aalborg University Citation for published version (APA): Dimitrov, S. (2015). Towards an open digital audio workstation for live performance: the development of an open soundcard. Aalborg Universitetsforlag. (Ph.d.-serien for Det Teknisk-Naturvidenskabelige Fakultet, Aalborg Universitet). DOI: 10.5278/vbn.phd.engsci.00028 General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. ? Users may download and print one copy of any publication from the public portal for the purpose of private study or research. ? You may not further distribute the material or use it for any profit-making activity or commercial gain ? You may freely distribute the URL identifying the publication in the public portal ? Take down policy If you believe that this document breaches copyright please contact us at [email protected] providing details, and we will remove access to the work immediately and investigate your claim. Downloaded from vbn.aau.dk on: April 30, 2017 THE DEVELOPMENT OF AN OPEN SOUNDCARD THE DEVELOPMENT OF FOR LIVE PERFORMANCE: AUDIO WORKSTATION AN OPEN DIGITAL TOWARDS TOWARDS AN OPEN DIGITAL AUDIO WORKSTATION FOR LIVE PERFORMANCE: THE DEVELOPMENT OF AN OPEN SOUNDCARD BY SMILEN DIMITROV DISSERTATION SUBMITTED 2015 SMILEN DIMITROV Towards an open digital audio workstation for live performance: the development of an open soundcard Ph.D.
    [Show full text]
  • Mixxx User Manual Izdaja 2.2.2
    Mixxx User Manual Izdaja 2.2.2 The Mixxx Development Team 27 apr., 2021 Contents 1 Uvod v Mixxx 1 1.1 Delovanje Mixxx-a ............................................ 1 2 Kako začeti 3 2.1 Zagon Mixxx ............................................... 3 2.2 Uvoz zvočnih datotek ........................................... 3 2.3 Analizira vašo knjižnico ......................................... 4 2.4 Nastavitve vhodnih in izhodnih naprav .................................. 5 3 Pregled Mixxx vmesnika 7 3.1 Section Skin Settings ........................................... 8 3.2 Razdelki s predvajalnikom ........................................ 8 3.3 The Mixer Section ............................................ 18 3.4 The Sampler Section ........................................... 22 3.5 Razdelek Efekti .............................................. 24 3.6 The Microphone Section ......................................... 24 3.7 Preview Deck Section .......................................... 25 4 Mixxx knjižnica 26 4.1 Pregled funkciji knjižnice ......................................... 26 4.2 Tracks - View and edit your whole collection ............................... 27 4.3 Loading Tracks .............................................. 30 4.4 Finding Tracks (Search) ......................................... 31 4.5 Previewing Tracks ............................................ 32 4.6 Edit metadata of audio files ........................................ 32 4.7 Auto DJ - Automate your mix ...................................... 35 4.8 Playlists - Arranging
    [Show full text]
  • Ultragrid Update
    ULTRAGRID UPDATE Miloš Liška CESNET April 2nd 2018 NPAPW19, Prague UltraGrid Platform Technology an affordable platform for very high-quality interactive video (up to 8K) and audio transmissions use of commodity (gaming) hardware Linux and Windows PC and Mac OS platforms commodity video capture cards commodity GPU cards commodity sound cards any reasonable network as low latency as possible on commodity hardware open-source software, BSD (GPL) license User support, community M. Liška, CESNET, 2/20 UltraGrid News Development towards UltraGrid 1.6 243 files changed, 30150 insertions(+), 34950 deletions(-) Audio reimplementation (including experiments with ASIO backend) New GPUJPEG, arithmetic coder in progress NDI support, AJA display, 12-bit video support, SMPTE VC-5 Cineform support GUI improvements Development towards 360 video support Cameras such as Panasonic 360 Live Cam already available, with limited capabilities and huge latency (398.4±16.6 ms) M. Liška, CESNET, 3/20 Community Update – Opravdoví OPRAVDOVÍ Networked Performance Premiéra 26.9.2018 Trutnov (UFFO) Plzeň (Moving Station) M. Liška, CESNET, 4/20 Community Update – APAN M. Liška, CESNET, 5/20 Community Update – APAN M. Liška, CESNET, 6/20 Community Update – ZHdK TwinLab Performance M. Liška, CESNET, 7/20 Community Update – ZHdK M. Liška, CESNET, 8/20 Community Update – SFJAZZ M. Liška, CESNET, 9/20 Multiplatform Audio Transmissions This won’t be about using SDI/HDMI embedded or analog audio with video capture cards (Blackmagic’s etc.) Always works as expected Audio synchronized with video frames Audio latency corresponds to the video framerate (and is higher) 60 fps corresponds to 16.6 ms of latency (not end-to-end at all) M.
    [Show full text]
  • A FAUST Tutorial Etienne Gaudrain, Yann Orlarey
    A FAUST Tutorial Etienne Gaudrain, Yann Orlarey To cite this version: Etienne Gaudrain, Yann Orlarey. A FAUST Tutorial. manual, 2003. hal-02158895 HAL Id: hal-02158895 https://hal.archives-ouvertes.fr/hal-02158895 Submitted on 18 Jun 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. A Faust Tutorial Grame, 9 rue du Garet, 69001 Lyon by Etienne Gaudrain, Yann Orlarey September 10, 2003 Special thanks to Samuel Bousard for the support, the reading and the corrections. Introduction Faust is a programming language designed for the creation of stream processors. The name Faust stands for Functional audio stream, and sug- gests both a musical and a devilish (!) side. It has been developed by Grame since 2000 [OFL02], to help programmers, but also composers and various sound hackers to build audio stream processors. The Faust language is based on functional programming paradigms [Hug89], but the Faust compiler pro- duces C++ code that is optimized by a λ-abstraction-based mechanism. The portability of C++ entails the portability of Faust if one can bring the good wrapper files that will make the C++ code compilable as a standalone application on your system.
    [Show full text]