Speech Recognition: the Interpretation of Training and Using Speech Recognition Software from the Perspectives of Postsecondary Students with Learning Challenges

Total Page:16

File Type:pdf, Size:1020Kb

Speech Recognition: the Interpretation of Training and Using Speech Recognition Software from the Perspectives of Postsecondary Students with Learning Challenges University of Northern Iowa UNI ScholarWorks Dissertations and Theses @ UNI Student Work 2006 Speech recognition: The interpretation of training and using speech recognition software from the perspectives of postsecondary students with learning challenges Delann Soenksen University of Northern Iowa Let us know how access to this document benefits ouy Copyright ©2006 Delann Soenksen Follow this and additional works at: https://scholarworks.uni.edu/etd Part of the Disability and Equity in Education Commons, and the Higher Education Commons Recommended Citation Soenksen, Delann, "Speech recognition: The interpretation of training and using speech recognition software from the perspectives of postsecondary students with learning challenges" (2006). Dissertations and Theses @ UNI. 765. https://scholarworks.uni.edu/etd/765 This Open Access Dissertation is brought to you for free and open access by the Student Work at UNI ScholarWorks. It has been accepted for inclusion in Dissertations and Theses @ UNI by an authorized administrator of UNI ScholarWorks. For more information, please contact [email protected]. SPEECH RECOGNITION: THE INTERPRETATION OF TRAINING AND USING SPEECH RECOGNITION SOFTWARE FROM THE PERSPECTIVES OF POSTSECONDARY STUDENTS WITH LEARNING CHALLENGES A Dissertation Submitted In Partial Fulfillment Of the Requirements for the Degree Doctor of Education Approved: Dr. Sandra Alper, Chair Dr. Deborah Gallagher,, Committee Member IDr. John Henning, Committed Member A '' 71/dasrr*.__________ Dr. Lauren Nelson, Committee Member Dr. Michael Waggoner, Committee Member Delann Soenksen University of Northern Iowa May 2006 Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. UMI Number: 3208608 INFORMATION TO USERS The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleed-through, substandard margins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. ® UMI UMI Microform 3208608 Copyright 2006 by ProQuest Information and Learning Company. All rights reserved. This microform edition is protected against unauthorized copying under Title 17, United States Code. ProQuest Information and Learning Company 300 North Zeeb Road P.O. Box 1346 Ann Arbor, Ml 48106-1346 Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. SPEECH RECOGNITION: THE INTERPRETATION OF TRAINING AND USING SPEECH RECOGNITION SOFTWARE FROM THE PERSPECTIVES OF POSTSECONDARY STUDENTS WITH LEARNING CHALLENGES An Abstract of a Dissertation Submitted In Partial fulfillment Of the Requirements for the Degree Doctor of Education Approved: ndra Alper, Committee chaDr. Sandrandra Alper,Alper, Committee chaDr. Di Dean of the Graduate College Delann Soenksen University of Northern Iowa May 2006 Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. ABSTRACT This study examined the utilization of speech recognition programs by five college students with learning difficulties. A qualitative approach was implemented over the course of two semesters, Fall 2003 and Spring 2004, to: (a) ascertain students’ perspectives on speech recognition, and (b) understand how students used speech recognition programs as a tool to meet the academic demands of postsecondary education. Data collection for this study included interviews, participant observations, field notes, videotape, and course assignments. This study provided qualitative data regarding an assistive technology tool that has traditionally been studied within a quantitative paradigm. Environment, individual characteristics, and life demands were found to impact student use of their speech recognition program. Implications for users, educational professionals, and researchers are presented in the final chapter of this paper. Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. ACKNOWLEDGEMENTS I would like to thank several individuals for their assistance in making this project a reality. First, to the participants in this study who shared their insights. Without their willingness and cooperation, this project would never have been completed. It is to them that I owe a great amount of gratitude. Second, I would like to thank my committee members for their assistance and guidance. I specifically, would like to thank Dr. Alper for the numerous opportunities I have been provided throughout my graduate program. The experiences I have received have contributed to my knowledge and growth as a professional. I thank the remaining committee members, Dr. Gallagher, Dr. Henning, Dr. Nelson, and Dr. Waggoner for their encouragement, support, expertise, and willingness to serve as committee members. I would also like to extend a thank you to many of my friends for their support and encouraging words throughout my graduate program and research. I also want to express my gratitude to those who read and assisted with the editing of my paper. In addition, I owe a lot of gratitude to my husband, Joel. The moral support, patience, and encouraging words whether it was from here or afar, have made my goals a reality. The completion of this project demonstrates that persistence, love, and faith can endure all obstacles and challenges. This research was, in part, funded by Congressional Appropriation Award #P116Z90009 administered by the Fund for Improvement in Postsecondary Education (FIPSE), #R215K000043 administered by the Fund for Improvement in Education (FIE), Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. and H324E012001 administered by the Office of Special Education and Rehabilitative Services (OSERS). No official endorsement of funding agencies is implied. Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. IV TABLE OF CONTENTS PAGE LIST OF FIGURES...............................................................................................................xi CHAPTER 1: INTRODUCTION.........................................................................................1 Scenario 1 .......................................................................................................................... 1 Scenario 2 .......................................................................................................................... 1 Scenario 3 ..........................................................................................................................2 My Experience................................................................................................................. 4 Dragon NaturallySpeaking ................................................................................... 4 iListen..........................................................................................................................7 ViaVoice .................................................................................................................. 9 Researcher’s Perspective ............................................................................................ 13 Initial Assumption ....................................................................................................... 16 Problem Statement......................................................................................................... 16 Purpose of the Study ...................................................................................................... 18 Research Questions ..................................................................................................... 19 Significance of the Study .............................................................................................. 19 Definition of Terms ....................................................................................................... 20 CHAPTER 2: LITERATURE REVIEW .......................................................................... 25 Speech Recognition Software ................................................................................... 25 Speech Recognition Research ................................................................................... 27 Speech Impairments ................................................................................................27 Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. PAGE Controlling the Environment ................................................................................. 30 English as Second Language ................................................................................. 31 Academic Outcomes ............................................................................................ 32 Work Related Outcomes ..................................................................................... 40 Summary of Speech Recognition Research Literature .............................................41 Participants...............................................................................................................41 Independent Variables...........................................................................................
Recommended publications
  • It's Little Wonder That Longtime Windows Users Are Migrating in Droves to the New Mac
    Switching to the Mac: The Missing Manual, Tiger Edition By Adam Goldstein, David Pogue ............................................... Publisher: O'Reilly Pub Date: September 2005 ISBN: 0-596-00660-8 Pages: 520 Table of Contents | Index It's little wonder that longtime Windows users are migrating in droves to the new Mac. They're fed up with the virus-prone Windows way of life, and they're lured by Apple's well-deserved reputation for producing great all-around computers that are reliable, user-friendly, well designed, and now-- with the $500 Mac mini--extremely affordable, too. Whether you're drawn to the Mac's stability, its stunning digital media suite, or the fact that a whole computer can look and feel as slick as your iPod, you can quickly and easily become a Mac convert. But consider yourself warned: a Mac isn't just a Windows machine in a prettier box; it's a whole different animal and a whole new computing experience. If you're contemplating--or have already made--the switch from a Windows PC to a Mac, you need Switching to the Mac: The Missing Manual, Tiger Edition. This incomparable guide delivers what Apple doesn't: everything you need to know to successfully and painlessly move to a Mac. Missing Manual series creator and bestselling author David Pogue teams up with 17-year-old whiz kid and founder of GoldfishSoft (www.goldfishsoft.com) Adam Goldstein to cover every aspect of switching to a Mac--things like transferring email, files, and addresses from a PC to a Mac; getting acquainted with the Mac's interface; adapting to Mac versions of familiar programs (including Microsoft Office); setting up a network to share files with PCs and Macs; and using the printers, scanners, and other peripherals you already own.
    [Show full text]
  • Voice Input Computer Systems
    Voice Input Computer Systems AT Quick Reference Guide Computer Access Series Voice input computer systems (or speech recognition systems) learn how a particular user pronounces words and uses information about these speech patterns to guess what words are being spoken. Voice input systems are useful for the following problems: Trouble physically using the keyboard - Voice input systems can allow a person to operate a computer without using a keyboard or mouse. This can help people who are not able to use their hands at all, as well as others who can use their hands but are limited by speed, fatigue, or pain (e.g., carpal tunnel syndrome, one-handed keyboarders). Trouble creating text - Voice input systems can help a person, who has difficulty spelling words, create text. This can be particularly useful for some people with learning disabilities. However, the person's reading abilities need to be strong enough to recognize when the computer displays the wrong word. Tech Connections is a collaborative project of the United Cerebral Palsy Associations, the Center for Assistive Technology and Environmental Access at Georgia Tech., and the Southeast Disability and Business Technical Assistance Center. Funded by a grant from the National Institute on Disability and Rehabilitation Research of the Department of Education, Award #H133A980052. 490 Tenth St. NW, Atlanta, GA 30318 877-TEK-SEEK; Fax: 404-885-0641 FREQUENTLY ASKED QUESTIONS: How much does it cost? Most full systems cost about $200. For exact prices, check the manufacture web sites listed below. There are also several systems for less than $100, but these cheaper versions may not include all of the dictation features such as hands-free access or a text read-back feature.
    [Show full text]
  • Speech Recognition.Indd
    Speech Recognition Systems CALL Centre Speech Recognition Systems CALL Information Sheet 15 Revised January 2005 Communication Aids for Language and Learning The University of Edinburgh Patersonʼs Land, Holyrood Road Edinburgh EH8 8AQ Tel: 0131 651 6235 Fax: 0131 651 6234 Email: [email protected] http://www.callcentrescotland.org.uk CALL Centre 2005 1 Speech Recognition Systems Speech Recognition Systems What is Speech Recognition? Speech recognition (SR) systems allow people to control a computer by speaking to it through a micro- phone, either entering text, or issuing commands to the computer, e.g. to load a particular program, or to print a document. SR systems have been around for over twenty years, but the early systems were very expensive and required powerful computers to run. During recent years, manufacturers have released basic versions of their pro- grams selling for less than £50 and have also reduced the prices of the full versions. The technology behind speech output has also changed. Early systems used discrete speech, i.e. the user had to speak one word at a time, with a short pause between words. DragonDictate is the only discrete speech system still available commercially. Over the past few years most systems have used continuous speech, allowing the user to speak in a more natural way. The main continuous speech systems currently available for the PC are Dragon NaturallySpeaking and IBM ViaVoice. Microsoft have included their own speech recognition system within recent versions of Windows. There is now a version of IBM ViaVoice for recent Apple Mac computers. Hardware Issues At the same time as the software has been changing, computer prices have continued to drop and the power of a typical desktop machine has grown to such an extent that SR software will run on almost any standard PC manufactured in the past couple of years, provided that it has sufficient memory.
    [Show full text]
  • Digital Dictation Volume 4
    Digital Dictation Volume 4 1-800-962-7566 www.easterndistributing.com Welcome to Call 1-800-962-7566 Download catalogs and specials at www.easterndistributing.com User ID: professional Password: bonita Your Source for Dictation Certified Sales Team Eastern Distributing National Sales 800-962-7566 Fax 800-962-7567 Michael Whalen ext. 1029 [email protected] • Authorized Value Added Distributor for Chris Braune ext. 1063 [email protected] Dodi Cherry ext. 1041 [email protected] Matt Colonna ext. 1027 [email protected] Michael Drouet ext. 1012 [email protected] • Qualified Staff Available to Certify & Trish Giunta ext. 1051 [email protected] Train Prospective Professional Audio Steve Hand ext. 1036 [email protected] and Speech Recognition VARs Ivette Rodriguez ext. 1048 [email protected] • Certified Professional Digital Maryland Sales Office Dictation, Microsoft® and Speech Recognition Services Available! 888-333-7988 Fax 888-333-7985 Joel Chatkin [email protected] • Free Online Demonstration Training, South Carolina Sales Office Streaming Demos and Remote PC 888-352-6400 Fax 800-455-0034 Technical Support Brenda Cox [email protected] West Palm, Florida Sales Office • Next Day Delivery 70% of the United States, Same Day Shipping on All 800-981-8856 Fax 561-760-8244 In-Stock items Dave Trumpler [email protected] Austin, Texas Technical Support Center • West Coast and East Coast Sales 800-981-8973
    [Show full text]
  • Introducing Speech Recognition in Schools
    Introducing Speech Recognition in Schools A CALL Centre Project funded by the Scottish Executive Education Department Special Educational Needs Innovation Grants Programme FINAL PROJECT REPORT AUGUST 2002 1 Contents CONTENTS 2 AIMS 3 OVERVIEW 3 DISSEMINATION 5 EVALUATION 5 OUTCOMES AND DISCUSSION 6 1. Uptake by schools 6 2. Feedback on the CALL Training Pack 6 3. Students involved in the project 7 4. Effectiveness of the Training Pack 7 5. Reasons for not continuing to use speech recognition 7 6. Effects of practice 8 7. Performance of ViaVoice and NaturallySpeaking 9 8. Effect of student skills 13 9. Effect of speech recognition on student skills and work 19 10. Reasons for using speech recognition 22 SUMMARY 24 WEB SITES 25 BOOKS, ARTICLES AND PAPERS 26 APPENDICES 28 2 Aims There is growing awareness in schools of the potential of Speech Recognition (SR) to provide access to the curriculum for pupils with SEN. However, approaches and success with SR vary widely. In some schools there are pupils using SR as their main means of writing and recording work, whereas in other schools staff have found SR difficult to implement with any success at all. The aim of the CALL Introducing Speech Recognition in Schools project was to investigate best practice in schools where speech recognition was being used successfully, and develop and evaluate training materials to help staff and students to learn to use speech recognition productively. A number of research reports and case studies on speech recognition were reviewed before designing the project: particularly, the reports from Elaine Donald at Perth High School (Donald, 1998); Martin Miles and colleagues (Miles, Martin & Owen, 1998), and in particular, the reports and case studies produced by the BECTa speech recognition project (BECTa, 2000).
    [Show full text]
  • Integrating Speech Recognition Software Into Medical Practice: Frequently Asked Questions
    Integrating Speech Recognition Software into Medical Practice: Frequently Asked Questions Jon W. Wahrenberger, MD FAHA, FACC Speech Recognition Solutions GENERAL QUESTIONS ABOUT INSTALLATION AND USE OF DRAGON: What does speech recognition software do? In the simplest terms, speech recognition software allows your voice to substitutes for keyboard entry on a personal computer. In so doing, it allows for entry of text in a word processing document or text box within an Electronic Medical Record (EMR) and navigation on your computer simply by speaking into a microphone. In addition, speech recognition software has functionality which allows multi-step processes on the personal computer to be initiated with a single spoken command. What are the hardware requirements? In general, most recent vintage Windows-based computers support the use of speech recognition software. Specific requirements appear in the table below. Although a minimum of 1 GB of RAM (random access memory) is required, 2 GB or more is generally preferred. Processor RAM Free Hard L2 Cache Drive “Minimum” Intel Pentium 1 GHz 512 2.5 GB 512 KB or faster “Recommended” Intel Pentium 2.4 or 1 GB 2.5 GB 1 MB 1.6 dual core (2GB*) Will speech recognition software work on a Macintosh computer? Although there are several speech recognition program that will work on a Macintosh, including IBM ViaVoice and MacSpeech Dictate, neither of these programs offer the functionality of Dragon NaturallySpeaking, our recommended solution. If you are determined to run Dragon on a Mac, it is possible to do so in 3 ways if you have one of the new Intel Macs.
    [Show full text]
  • IBM Websphere Voice Systems Solutions
    Front cover IBM WebSphere Voice Systems Solutions Covers features and functionality of WebSphere Voice Server with WebSphere Voice Response V3.1 Includes scenarios of Dialogic, Cisco and speech technologies Highlights application development and SDK 3.1 Guy Kempny Suchita Dadhich Ekaterina Dietrich Hu Jiong Justin Poggioli ibm.com/redbooks International Technical Support Organization IBM WebSphere Voice Systems Solutions Implementation Guide January 2003 SG24-6884-00 Note: Before using this information and the product it supports, read the information in “Notices” on page xi. First Edition (January 2003) This edition applies to WebSphere Voice Server for Windows 2000 and AIX, V3.1, WebSphere Voice Response for AIX, V3.1, and WebSphere Voice Response for Windows NT and Windows 2000, V3.1. © Copyright International Business Machines Corporation 2003. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Notices . .xi Trademarks . xii Preface . xiii The team that wrote this redbook. xiv Become a published author . xvi Comments welcome. xvi Chapter 1. Voice technologies . 1 1.1 Access to information through voice . 2 1.2 What are voice applications? . 3 1.3 Speech recognition . 4 1.4 Text-to-speech . 6 1.5 Terminology. 7 1.6 VoiceXML . 8 1.7 Application development . 11 1.7.1 Available tools . 11 1.7.2 Creating and deploying an application . 12 1.7.3 Integrating speech recognition and TTS . 13 1.8 Hardware technology . 14 1.8.1 IBM . 14 1.8.2 Intel Dialogic . 14 1.8.3 Aculab .
    [Show full text]
  • Voice Recognition
    Technical Glossary Adaptive Technology Resource Centre Faculty of Information University of Toronto Accessible Online Learning Tools ................................................. 6 Points to ponder - Questions to consider when shopping for Accessible Online Learning Tools - Online Education Sources ................................... 6 Solutions .................................................................................................. 6 Web Resources ....................................................................................... 8 Alternative Keyboards ................................................................. 9 Points to Ponder - Questions to consider when shopping for an alternative keyboard ................................................................................ 9 Non-Keyboard Based Enhancements: ...................................................... 9 Other Free Enhancements - Windows .................................................... 10 Other Free Enhancements - Macintosh .................................................. 10 Alternative Keyboards ............................................................................ 10 Miscellaneous Keyboard Enhancers ....................................................... 11 Resources .............................................................................................. 12 Alternative Mouse Systems ........................................................ 13 Points to ponder - Questions to consider when shopping for an alternative mouse system .....................................................................
    [Show full text]
  • What Good Is a Linux Client?
    Preparing Today for Linux® Tomorrow What Good is a Linux Client? A Linux White Paper Preface Anyone who has followed the progress of Linux the past few years has noted the surprising gains that Linux has made in server operating system market share and “mindshare” against the entrenched opposition, notably Microsoft® Windows NT®, Novell Netware and various varieties of UNIX®. A complete Linux network, of course, would require Linux clients (networked PCs or terminals) as well. Therefore, an important question is, “Is Linux a viable desktop operating system for general business use?” or must we continue to be confined to the Windows world for our non-server needs? And what about laptop support, or a Small Office/Home Office (SOHO) user or home user, who does not need a network server? Is there a place for Linux in a stand-alone environment? These are some of the questions this paper addresses. I must confess that until recently I was merely a bystander watching with curiosity from the sidelines—without any real involvement— the progress Linux has made. So writing this paper not only gave me the excuse to join in on the action, but it also provides you with the perspective of a Linux “newbie” who is attempting to find the applications and utilities needed to create a working system. The conventional wisdom so far is that Linux is not “ready for prime time” as anything but a network operating system. We shall test this notion to see if I can find commercial, shareware and/or freeware programs for Linux to replace the existing applications I currently use with Windows.
    [Show full text]
  • Tr2004005.Pdf
    THE LISTER HILL NATIONAL CENTER FOR BIOMEDICAL COMMUNICATIONS A research division of the U.S National Library of Medicine TECHNICAL REPORT LHNCBC-TR-2004-005 ASKLEPiOS Project; A Report to the Board of Scientific Counselors September 2004 R. P. Channing Rodgers U.S. National Library of Medicine, LHNCBC 8600 Rockville Pike, Building 38A Bethesda, MD 20894 Table of Contents 1. Introduction ...................................................................................................................... 4 2. Project Objectives ............................................................................................................ 5 3. Significance ............................................................................................................. 6 4. Project Components ......................................................................................................... 6 4.1 Software Applications ............................................................................................ 6 4.1.1 ICPC Multilingual Collaboratory (IMC) .................................................... 6 4.1.2 Internet-Based Teleconferencing [ ... ] AccessGrid .................................... 6 4.1.2.1 Robust Audio Tool (rat) .................................................................. 7 4.1.2.2 Video Conferencing Tool (vic) ........................................................ 7 4.1.2.3 Other UCL tools ............................................................................... 8 4.1.3 Brazil (Web Application Framework)
    [Show full text]
  • Dragon Dictate Medical
    Nuance Communications, Inc. END USER LICENSE AGREEMENT Your acceptance of the terms of this End User License Agreement (“Agreement”) is required before your use of the accompanying software. This Agreement is between you (“Licensee” or “you”) and Nuance Communications, Inc. and/or one or more of its affiliates (collectively, “Nuance”). By opening the sealed Software Package and/or by installing or otherwise using the software accompanying this Agreement (“Software”), you agree to be bound by the terms and conditions of this Agreement. The term “Software” shall also include any modified versions, updates, or upgrades of the Software licensed to you by Nuance. You may install and use a modified version, update, or upgrade of the Software only if you have a validly licensed existing version of the Software being modified, updated, or upgraded. If you download, install, copy, or otherwise use a modified version, update, or upgrade of the Software, then your license terminates as to the previous version of the Software, and you have a license only to such modified version, update, or upgrade of the Software under the terms of this Agreement. If you do not agree to the terms and conditions of this Agreement, you may not install or use the Software and must promptly return the Software and all accompanying materials to the entity from which you obtained this Software Package. THIS IS A LICENSE TO USE SOFTWARE AND NOT A SALE OF SOFTWARE CODE. This document is Licensee’s proof of a non-exclusive license to exercise the rights granted herein and must be retained by Licensee.
    [Show full text]
  • Digital Voice Recorder Online Instructions
    DIGITAL VOICE RECORDER DS-2 ONLINE INSTRUCTIONS Thank you for purchasing an Olympus Digital Voice Recorder. Please read these instructions for information about using the product correctly and safely. Keep the instructions handy for future reference. To ensure successful recordings, we recommend that you test the record function and volume before use. EN Table of Contents Table of Contents ...................................... 2 4 About Playback Safe and Proper Usage ............................. 4 Playing ..................................................... 33 General Precautions .................................. 4 Changing the Playback Speed 1 Getting Started (PLAY SPEED) ................................... 39 Continuous Playback (ALL PLAY) ........... 41 Main Features............................................ 6 Repeat Playback ..................................... 42 Identification of Parts................................. 9 Auto Backspace Review Inserting Batteries ................................... 11 (BACK SPACE) ................................. 44 Power Supply .......................................... 12 Skip Interval Setting (SKIP SPACE) ....... 45 Hold.......................................................... 13 Noise Cancel Function ............................ 47 Setting Time/Date (TIME&DATE) ............ 14 Alarm Playback Function (ALARM) ........ 48 2 Menu Setting 5 Other functions Menu Setting Method .............................. 16 Erasing ..................................................... 51 Menu List ................................................
    [Show full text]