End-To-End Neural Networks for Subvocal Speech Recognition

Total Page:16

File Type:pdf, Size:1020Kb

End-To-End Neural Networks for Subvocal Speech Recognition End-to-end neural networks for subvocal speech recognition Pol Rosello and Pamela Toman and Nipun Agarwala Stanford University fprosello, ptoman, [email protected] Abstract error rates on the order of 10-50% per speaker on the EMG-UKA test corpus of approximately 100 Subvocalization is a phenomenon ob- unique words (Wand et al., 2014) are far too high served while subjects read or think, char- to use subvocal speech recognition in a practical acterized by involuntary facial and laryn- setting. geal muscle movements. By measuring Approaches to EMG-based speech recognition this muscle activity using surface elec- have so far focused on HMM-GMM models. A tromyography (EMG), it may be possi- hybrid HMM-NN model for phone labeling has ble to perform automatic speech recog- also been briefly explored (Wand and Schultz, nition (ASR) and enable silent, hands- 2014). However, obtaining the ground truth phone free human-computer interfaces. In our alignments in EMG recordings is much more chal- work, we describe the first approach lenging than with sound. It is also unclear that toward end-to-end, session-independent laryngeal muscle movements can be classified subvocal speech recognition by leverag- into the same phonemes that are used for audible ing character-level recurrent neural net- speech. To address these challenges, we leverage works (RNNs) and the connectionist tem- recent techniques in end-to-end speech recogni- poral classification loss (CTC). We at- tion that do not require forced phoneme alignment tempt to address challenges posed by models. We also consider large speaker variability a lack of data, including poor gener- and noisy measurements. alization, through data augmentation of We use a baseline three-layer recurrent neural electromyographic signals, a specialized network using spectrogram features, and try to multi-modal architecture, and regulariza- improve performance through feature engineering tion. We show results indicating reason- and using an ensemble of LSTM-based recurrent able qualitative performance on test set ut- networks. We also explore a multi-modal RNN terances, and describe promising avenues architecture that manages to perform best for the for future work in this direction. audible EMG dataset, though the recurrent en- 1 Introduction semble models performed best for whispered and silent EMG data. Through our experiments in fea- Subvocalization is silent internal speech produced ture engineering, data augmentation and architec- while reading. It is characterized by small move- ture exploration, we achieve a character error rate ments in facial and laryngeal muscles measur- of 0.702 on the EMG-UKA dataset, which is an able by surface electromyography (EMG). A suc- improvement over the 0.889 CER of our baseline cessful subvocal speech recognizer would pro- model. vide a silent, hands-free human-computer inter- face. Such an interface can add confidential- 2 Related Work ity to public interactions, improve communica- tion in high-noise environments, and assist peo- Non-audible speech is a focus of ongoing re- ple with speech disorders. Although some work search. The first break-through paper in EMG has attempted to perform speech recognition on recording to speech recognition was Chan et al. EMG recordings of subvocalization, current word- in 2001 (Chan et al., 2001), who achieved an av- erage accuracy of 93% on a 10-word vocabulary loss (Graves and Jaitly, 2014) reframes the of English digits. In addition to EMG, researchers problem of automatic speech recognition from have explored automatic speech recognition using one in which speech is comprised of phones data from magnetic sensing (Hofe et al., 2013), which have a mapping to text, into one in which radar (Shin and Seo, 2016), and video (Wand et al., speech is decoded directly as text. By feeding a 2016). recurrent neural network architecture the speech Having been inspired by Chan et al., a series of signal or derived features from the speech signal papers on EMG-based subvocalization began be- at each time step, the network learns to generate ing published by Jou, Schultz, Wand, and others characters of text. With minimal postprocessing beginning in 2007 (Jou et al., 2007). In particular, to remove duplicated and “blank” characters, the Schultz working group has steadily improved the model’s predictions map very closely to the on models that use a traditional HMM acoustic ar- character sequence. Because each time step does chitecture using time-domain features of the EMG not need a hard and correct label reflecting the signal, with triphones, phonetic bundling, a lan- phone being uttered, this approach avoids many guage model based on broadcast news trigrams, of the assumptions that have posed a challenge and lattice rescoring to estimate the most likely for traditional HMM-based modeling. The Graves hypothesis word sequences. Because EMG data et al. paper introducing CTC showed an approx- differs from audio data, we believe that the pri- imately 5% improvement in label error rate on mary contribution of collaborative work of Schultz TIMIT, from a context-dependent HMM LER of and Wand comes in their development of features 35.21% to a CTC prefix-searech LER of 30.51%, for EMG data. Based on sampling frames of time, and the performance gap has continued to grow they build a feature with high and low frequency since 2014. components, which they then reduce in dimen- sionality using LDA (Schultz and Wand, 2010; 3 Approach Wand and Schultz, 2014). A 2014 paper from We use the public portion of the EMG-UKA elec- Wand that develops a neural network architecture tromyography dataset, and we derive four alter- for phone labeling (Wand and Schultz, 2014) may native feature types from that data. Because this perform somewhat better, though direct compar- dataset has poor phoneme-level alignments, we isons are challenging as the datasets and EMG col- use the character-level CTC problem formulation, lection devices differ. The current state-of-the-art which maps audio recordings directly to textual achieves a word error rate of 9.38% for the best transcription rather than predicting a phone as an speaker-session combination on a limited set of intermediary. In contrast to the existing work words; the reported interquartile range is approxi- in the literature, we strive to build a session- mately 22% to 45%. independent model that does not retrain for each An alternative arm of work by Freitas et al. new EMG session. We experiment with three that attempts to recover text from Portuguese approaches in this realm: a mode-independent EMG signals achieves best average performance model, an ensemble of mode-dependent models, of 22.5% word error rate, also under a traditional and a multi-modal model that uses weight sharing approach (Freitas et al., 2012). The authors find to reduce the number of parameters that must be that the nasality of vowels is a primary source of trained. error, and they suspect that the muscles activated in producing nasal vowels are not detected well by 3.1 Dataset the surface EMG. Freitas et al.’s focus on phones Our data is the public EMG-UKA trial cor- aligns with the work by Schultz and Wand on “ex- pus (Wand et al., 2014). The EMG-UKA trial cor- tracting a set of features which directly represent pus consists of about two hours of EMG record- certain articulatory activities, and, in turn, can be ings of four subjects reading utterances from linked to phonetic features” (Wand and Schultz, broadcast news. EMG recordings are available 2014), and the history of challenges with that ap- while subjects read utterances in three modes: au- proach motivate our application of the connection- dibly, silently, and while whispering. The record- ist temporal classification approach. ings contain 6 channels of EMG data collected at Connectionist temporal classification 600 Hz using multiple leads, a sound recording audible whispered silent 2011). The amount of adipose tissue, age and word 4.6 (62) 3.8 (65) 3.4 (57) slackness of skin, muscle cross-talk, and the sur- phone 3.6 (194) 0.8 (194) 0.2 (188) face nature of non-invasive EMG can also reduce signal quality (Kuiken et al., 2003). Additionally, Table 1: Quality of data labels provided in corpus session-to-session differences in electrode appli- on a 0-5 Likert scale. These results indicate that it cation can result in models that overfit to a single is inappropriate to use phone-level labels for whis- session. A significant challenge of our work is to pered and silent data. We approximate data quality therefore design a model that can generalize well by averaging across qualitative ratings of five ut- to unseen speakers, sessions, and utterances de- terances selected at random from each mode, and spite a considerable lack of data. we provide the sample size at each level in paren- While the full EMG-UKA corpus contains 8 theses. hours of recording data rather than the 2 hours available in the trial corpus, it is not publicly avail- collected at 1600 Hz, and a transcription of the ut- able. The authors of the corpus were not reachable terance. While sound recordings of utterances are for release of the full dataset, despite multiple at- available, at no point in our work do we use them tempts. Because of this, it is impossible for us to to train our models. directly compare the performance of our models Each sample in the corpus contains estimated against prior work on this dataset. phone and word alignments for the audible and whispered data based on an HMM model of the 3.2 EMG feature extraction audio track, and estimated phone and word align- Traditional features used in ASR such as MFCCs ments for the silent data based on a model that cannot be used for EMG data since they rely on maps the HMM results to the silent mode.
Recommended publications
  • FAR Interpretive Report Sample
    Interpretive Report by Steven G. Feifer, DEd, Rebecca Gerhardstein Nader, PhD, and PAR Staff Client Information Client name: Sample Client Client ID : LD Test date : 05/12/2017 Date of birth : 11/10/2009 Age : 7:6 Grade/Education: 2nd Gender : Female Examiner : Dr Williams This report is intended for use by qualified professionals only and is not to be shared with the student or any other unqualified persons. Sample Client Page 1 Interpretive Report 05/12/2017 Overview of This Report The Feifer Assessment of Reading (FAR) is an individually administered measure of reading ability normed for students in prekindergarten through college. The FAR contains individual tests of reading skills which are combined to form a Phonological Index (PI), a Fluency Index (FI), and a Comprehension Index (CI). The subtests which compose the PI assess phonological processing and decoding skills of words in isolation as well as in context. The FI is comprised of subtests that assess visual perception and orthographic processing of letters and words, as well as fluidity in pronouncing phonologically-irregular words. The CI contains subtests designed to assess the underlying factors involved in deriving meaning from print. The Mixed Index (MI), calculated by combining the PI and the FI, assesses for deficits in both phonological processing and orthographic processing skills. The FAR Total Index (TI), calculated by combining the PI, FI, and CI subtests, provides the most comprehensive and reliable assessment of overall reading proficiency. Each index score is expressed as a grade-corrected standard score scaled to a mean of 100 and a standard deviation of 15.
    [Show full text]
  • Subvocalization – Toward Hearing the Inner Thoughts of Developers
    2011 19th IEEE International Conference on Program Comprehension Subvocalization – Toward Hearing the Inner Thoughts of Developers Chris Parnin College of Computing Georgia Institute of Technology Atlanta, Georgia USA [email protected] Abstract—Some of the most fascinating feats of cognition any significant effects hoping to be found when evaluating are never witnessed or heard by others, yet they occur daily a new tool. With instrumentation data, experimenters have in the minds of software developers practicing their craft. recorded actions, but little context and must substitute cog- Researchers have desperately tried to glimpse inside, but with limited tools, the view into a developer’s internal mental pro- nitive measures such as cognitive effort or memory retention cesses has been dim. One available tool, so far overlooked but with metrics such as ratio of document navigations to edits widely used, has demonstrated the ability to measure the phys- or frequency of revisiting a method. Talk-aloud protocols, iological correlates of cognition. When people perform complex like surveys, rely on self-reporting and require considerable tasks, sub-vocal utterances (electrical signals sent to the tongue, manual transcription and analysis that garner valuable but lips, or vocal cords) can be detected. This phenomenon has long intrigued researchers, some likening sub-vocal signals to indefinite and inconsistent insight. the conduits of our thoughts. Recently, researchers have even Within the past few decades, modern research disci- been able to decode these signals into words. In this paper, plines, such as psychology and cognitive neuroscience, have we explore the feasibility of using this approach and report collectively embraced methods that measure physiological our early results and experiences in recording electromyogram correlates of cognition as a standard practice.
    [Show full text]
  • The Readability of Reading Passages in Prescribed English Textbooks and Thai National English Tests
    THE READABILITY OF READING PASSAGES IN PRESCRIBED ENGLISH TEXTBOOKS AND THAI NATIONAL ENGLISH TESTS BY MISS THANAPORN SRISUNAKRUA A DISSERTATION SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY IN ENGLISH LANGUAGE TEACHING LANGUAGE INSTITUTE THAMMASAT UNIVERSITY ACADEMIC YEAR 2018 COPYRIGHT OF THAMMASAT UNIVERSITY Ref. code: 25615621320109NCD THE READABILITY OF READING PASSAGES IN PRESCRIBED ENGLISH TEXTBOOKS AND THAI NATIONAL ENGLISH TESTS BY MISS THANAPORN SRISUNAKRUA A DISSERTATION SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY IN ENGLISH LANGUAGE TEACHING LANGUAGE INSTITUTE THAMMASAT UNIVERSITY ACADEMIC YEAR 2018 COPYRIGHT OF THAMMASAT UNIVERSITY Ref. code: 25615621320109NCD (1) Thesis Title THE READABILITY OF READING PASSAGES IN PRESCRIBED ENGLISH TEXTBOOKS AND THAI NATIONAL ENGLISH TESTS Author Ms. Thanaporn Srisunakrua Degree Doctor of Philosophy Major Field/Faculty/University English Language Teaching Language institute Thammasat University Thesis Advisor Associate Professor Tipamas Chumworatayee, Ph. D. Academic Year 2018 ABSTRACT Readability has long been regarded as a significant aspect in English language teaching as it provides the overall picture of a text’s difficulty level, especially in the context of teaching and testing. Readability is a practical consideration when making decisions on materials to match a text with target readers’ proficiency. A suitable difficulty level will ensure that students receive the most benefit from the materials. However, few studies have compared the readability levels of teaching and testing materials in terms of the difficulty of passages. The present study, therefore, aims to explore the readability of reading passages in English textbooks and Thai national English tests based on three aspects: readability level, linguistic characteristics, and topic areas.
    [Show full text]
  • 5. RHF Assumptions 041713A.Pptx
    Scholarship Policy Initiatives: The Case of the Common Core State Standards Elfrieda H. Hiebert TextProject & University of California, Santa Cruz Acknowledgments: ¡ Hiebert, E.H., & Van Sluys, K. (in press). Standard 10 of the Common Core State Standards: Examining three assumptions about text complexity. In K. Goodman, R.C. Calfee, & Y. Goodman (Eds.), Whose Knowledge Counts in Government Literacy Policies? New York, NY: Routledge. 2 Elfrieda H. Hiebert www.textproject.org Fundamental Premises ¡ An emphasis on increasing students’ capacity with increasingly more complex text across their school careers is one of the goals of reading instruction. Attention to this goal is long overdue. The students who most depend on schools for academic learning can and should be reading much more complex texts. ¡ AND….Whenever a new set of mandates is enacted quickly, especially when the scholarship for an area is scant (and the developers of the mandates do not have grounding in the existing scholarship), potential for misinterpretation can be great. 3 Elfrieda H. Hiebert www.textproject.org Three Assumptions on Text Complexity 1. Readability formulas provide sufficiently valid information to guide selections for instruction and assessment. 2. Text levels need to be accelerated at all levels to ensure college/career readiness. 3. Students at all levels can be rapidly stretched to read harder texts. 4 Elfrieda H. Hiebert www.textproject.org Assumption 1: Readability formulas provide sufficiently valid assessments of text complexity that they can be used as a guide for selections in instruction and assessment. ¡ “quantitative measures should identify the college- and career-ready reading level as one endpoint of the scale.
    [Show full text]
  • About This Program
    1 ABOUT THIS PROGRAM A child listens to a direction, a lecture, a conversation, or a story, yet does not completely understand what is heard. Bits and pieces are recalled-maybe the first thing that was said, maybe the last. At times, the information is remembered, but misconstrued.Inc. The speech sig~ nal may not have been heard clearly. Perhaps the temporal concepts are confusing to the child. The child's reading, spelling, and writing skills may be weak. After a while, the child may "tune out" when people are talking. These are the hallmark behaviors of a child with an auditory processing disorder. While the evaluation process affords the clinician the opportunityPRO-ED, to quantify the child's present level of performance on a variety of standardized tasks, it also should provide the cli~ nician a window into how the child is approaching auditoryby tasks. After studying available re~ search and years of carefully comparing the testing behaviors of children with above average L auditory skills with those of children scoring poorly on auditory tests, I have compiled a list of behavioral differences in the way these two groups of children approach challenging auditory l testing tasks (see Table 1). Examples of challengingmaterial listening tasks include following complex oral directions and listening to orally presented passages and then answering comprehension L questions. As noted in Table 1, children with good auditory processing skills seem to naturally use strategies such as controlling their body, paraphrasing, subvocalizing, visualizing, and request~ l ing clarification during demanding auditory tasks, whereas children with poor auditory processing skills often docopyrighted not.
    [Show full text]
  • The Effectiveness of Using Inner Speech and Communicative Speech in Reading Literacy Development: a Synthesis of Research
    International Journal of Social Science and Humanity, Vol. 5, No. 8, August 2015 The Effectiveness of Using Inner Speech and Communicative Speech in Reading Literacy Development: A Synthesis of Research Yuan-Hsuan Lee rehearsal, and gave examples that suggest inner speech can Abstract—This paper provides a theoretical understanding of improve reading ability. inner speech and communicative speech and their impact on the process of reading literacy. The aim of this paper is to probe into questions, such as what the characteristics of inner speech II. THEORETICAL FRAMEWORK are, what the relationship between inner speech and subvocalization is, what inner speech can do to help reading, and what technology can do to help practice of inner speech and A. Huey’s Inner Speech literacy development. Review of literature has suggested that (1) Huey held that inner speech is part of the ordinary reading inner speech are abbreviated form of easily stored meaning [5]. Huey conducted experiments in which unrelated words units; (2) inner speech as a form of subvocal rehearsal can were exposed four seconds each and the readers are required retrieve information in the working memory and help solve to state what they saw [5]. He found that "the inner reading problems; (3) inner speech is a mechanism that put chunks of text into compact meaning units and that elicits pronunciation of these words and of the suggested phrases meanings when reading cognition becomes problematic; (4) constituted much of the most prominent part of the reader's technology can help literacy development but more emphasis consciousness of them." (p.
    [Show full text]
  • Reading: a Study Guide
    t DOCUMENT RESUME ED 075 811 CS 000 534 AUTHOR Laughter, Mabel Y. TITLE Introduction to Reading: A Study Guide. PUB DATE 72 NOTE 349p.; Unpublished study t EDRS PRICE MF-$0.65 HC-$13.16 DESCRIPTORS *Annotated Bibliographies; College Students; Content t Reading; Developmental Reading; Grouping (Instructional Purposes); *Literature Reviews; *Reading; Reading Development; *Reading Itruction; Reading Materials; Reading Processes; Rea( tig Research; Reading Skills; *Reference Materials; Remedial Reading; Teacher Education ABSTRACT Intended as an aid to college students in reading, this reference guide is composed of eight major topics for the study of reading instruction: overview of the reading process, prerequisites for success in reading, development of reading power and efficiency, reading in the content fields, grouping for instruction, teaching methodology and materials, appraisal of reading development, and readers with special needs. An additional category is provided for those research summaries and bibliographies which apply to reading instruction in general. Each major topic is subdivided into its component elements with further division when necessary for, understanding. TI" reference entries for each subtopic are catalogued according to the type of literature available--reviews and summaries of research and bibliographies, research studies, and theoretical discussions. A bibliographic author index is also included in the documeAt. (Author /TO) U 5 OEPARTMENT OF HEALTH EOU:ATION WELF ARE NATIONAL INSTITUTE OF EOUCATION DOCQh.+E%- AS BEEN REPRO D.JCE0 EXAC'LY A$ ECE ivED .ROM ,HE PEPSON OPGAN,ZA.ON OR.GtN T pC,,T,T$OF OPiN ON5 40-NECESSAR,LY REPRE STATED DO OF SENT OF F,C,AL NATIONAL EOJCATiON POST.O% ORPO,;C, FILMED FROM BEST AVAILABLE COPY INTRODUCTION TO READING: A STUDY GUIDE Mabel Y.
    [Show full text]
  • Republic of Turkey Çanakkale Onsekiz Mart University Institute of Social Sciences Department of English Language Teaching an I
    REPUBLIC OF TURKEY ÇANAKKALE ONSEKİZ MART UNIVERSITY INSTITUTE OF SOCIAL SCIENCES DEPARTMENT OF ENGLISH LANGUAGE TEACHING AN INVESTIGATION INTO THE IMPACT OF CULTURAL SCHEMA AND READING ACTIVITIES ON READING COMPREHENSION MA THESIS Supervisor Asst. Prof. Dr. İsmail Hakkı ERTEN Submitted by Salim RAZI Çanakkale-2004 Sosyal Bilimler Enstitüsü Müdürlüğüne Salim RAZI’ya ait ‘An investigation into the impact of cultural schema and reading activities on reading comprehension’ Adlı çalışma, jürimiz tarafından Yabancı Diller Eğitimi Anabilim Dalı İngilizce Öğretmenliği Programında YÜKSEK LİSANS TEZİ olarak kabul edilmiştir. Başkan Doç. Dr. Dinçay KÖKSAL Akademik Ünvanı, Adı Soyadı Üye Yrd. Doç. Dr. İsmail Hakkı ERTEN Akademik Ünvanı, Adı Soyadı (Danışman) Üye Yrd. Doç. Dr. Mehmet Ali YAVUZ Akademik Ünvanı, Adı Soyadı Üye Yrd. Doç. Dr. Hidayet TUNCAY Akademik Ünvanı, Adı Soyadı Üye Yrd. Doç. Dr. Ece ZEHİR TOPKAYA Akademik Ünvanı, Adı Soyadı i ABSTRACT This study was carried out to investigate the influence of cultural schema and reading activities on reading comprehension. The impact of cultural schema was tested nativizing short stories to make them culturally more familiar. The study was carried out at Çanakkale Onsekiz Mart University with 3rd grade students at the department of English Language Teaching through a 2X2 true- experimental research design where the participants were homogenously placed in different groups according to their Grade Point Averages (GPAs). The first group were given the original short story while the second group were given the nativized (Turkified) one. To find out the effect of reading activities on the comprehension of nativized and original short stories, the third group read the original short story with reading activities while the fourth group read the nativized short story with the same reading activities.
    [Show full text]
  • Reading and Assistive Technology: Why the Reader's Profile Matters
    Reading and Assistive Technology: Why the Reader’s Profile Matters by Karen Erickson ssistive technology (AT) can take many forms; however, as dyslexia is limited, and research to guide the selection of spe- Aa support for reading, electronic text is a core feature of AT. cific supports to meet individual student needs is even more Anderson-Inman and Horney offer a typology that describes limited. Yet, understanding the possible impact of AT on the the types of resources that can be used to vary the supportive- reading skills of students with dyslexia requires that we move ness of electronic text (Anderson-Inman & Horney, 1998; beyond questions regarding the use of AT generally or electron- Horney & Anderson-Inman, 1999). The typology focuses on ic texts specifically and focus on the impact that AT supports the function that each of the resources plays in the supported have on students with varying profiles of strength and weakness reading process (e.g., translational, explanatory, and illustra- in reading. tive) and is intended to serve as a conceptual framework that guides the selection of appropriate AT for students who struggle AT, Reading, and Dyslexia with reading. Table 1 shows a recent version of this typology as It is well understood that dyslexia is a specific learning dis- presented by Anderson-Inman and Horney (2007). ability that is neurological in origin and characterized by While the typography does fairly represent the range of sup- difficulties with accurate and fluent word recognition and ports that are possible given today’s technologies, there is little decoding abilities. AT, specifically text-to-speech with elec- research to guide its use with students with dyslexia.
    [Show full text]
  • Subvocalization in Reading L1japanese Texts and L2 English Texts
    Subvocalization in Reading L1Japanese Texts and L2 English Texts Takue OHNO ([email protected]) Jun-ichi ABE ([email protected]) Department of Psychology, Hokkaido University N10 W7, Kita-ku, Sapporo, 060-0810, JAPAN ‘Subvocalization’ means a subjective experience that is felt Discussion while reading silently by a reader as if he/she were hearing a The results suggested that both in L1 Japanese and L2 voice reading the text inside his/her head. As this English readings, subvocalization occurred, but that its psychological phenomenon is difficult to verify directly, degree in two languages could be different, implying that many of the studies start with the assumption that subvocalization would not necessarily occur in L2 English subvocalization does occur in reading, and then, by giving reading compared to L1Japanese reading. concurrent articulatory suppression tasks, they try to examine its occurrence (e.g. Levy, 1977). Following the Experiment 2 same line, this study aimed at reexamining psychological Method reality of subvocalization in reading. Moreover, this was Participants. Sixty-five Japanese high school students done with a comparison between L1 Japanese reading and aged 16 to 17 participated. L2 English reading, the latter of which will not be Materials. 12 Japanese texts and 12 English texts, and developed with as much phonological necessity as the their tests were newly prepared. The concept of the former. Two experiments were conducted, with the second materials was the same as in Experiment 1, but two more one modified, considering the participants’ tiredness and the texts and tests in each language were added for checking cognitive load on a part of the task in the first one.
    [Show full text]
  • Adult ESL Oral Reading Fluency and Silent Reading Comprehension Kristin Lems National Louis University
    National Louis University Digital Commons@NLU Dissertations 12-2003 Adult ESL Oral Reading Fluency and Silent Reading Comprehension Kristin Lems National Louis University Follow this and additional works at: https://digitalcommons.nl.edu/diss Part of the Bilingual, Multilingual, and Multicultural Education Commons, Curriculum and Instruction Commons, Higher Education and Teaching Commons, and the Other Teacher Education and Professional Development Commons Recommended Citation Lems, Kristin, "Adult ESL Oral Reading Fluency and Silent Reading Comprehension" (2003). Dissertations. 132. https://digitalcommons.nl.edu/diss/132 This Dissertation - Public Access is brought to you for free and open access by Digital Commons@NLU. It has been accepted for inclusion in Dissertations by an authorized administrator of Digital Commons@NLU. For more information, please contact [email protected]. ADULT ESL ORAL READING FLUENCY AND SILENT READING COMPREHENSION Kristin Lems Reading and Language Doctoral Program Submitted in partial fulfillment of the requirements of Doctor of Education in the Foster G. McGaw Graduate School National-Louis University August, 2003 Note: transfer of the files resulted in some repagination which could not be fully corrected in the table of contents ADULT ESL ORAL READING FLUENCY AND SILENT READING COMPREHENSION © 2004 Kristin Lems Abstract A descriptive study of second language adults studying ESL at the beginning, intermediate, and advanced levels in a post-secondary academic program revealed that their oral reading fluency had a significant, low-to-moderate correlation with scores on a measure of silent reading comprehension. The correlation was slightly stronger for measures of accuracy than speed, and strongest for miscue ratio. The correlation increased as proficiency level increased.
    [Show full text]
  • The Principles of Readability
    The Principles of Readability By William H. DuBay Impact Information, 126 E. 18th Street, #C204, Costa Mesa, CA 92627, (949) 631-3309 Copyright The Principles of Readability 25 August 2004 2004 William H. DuBay. All Rights Reserved. Abstract The Principles of Readability gives a short history of literacy studies in the U.S. and a short history of research in readability and the readability formulas. Readers' Comments Please send all comments and suggestions regarding this document to: William DuBay Impact Information 126 E. 18th Street, #C204 Costa Mesa, CA 92627 Phone: (949) 631-3309 Email: [email protected] Website: http://www.impact-information.com Copyright © 2004 William H. DuBay Page ii Contents Introduction ..........................................................................................................1 Guidelines For Readability..........................................................................2 The readability formulas..............................................................................2 Are the readability formulas a problem? .....................................................2 What is readability?.....................................................................................3 Content ........................................................................................................3 The Adult Literacy Studies...................................................................................4 Grading the reading skills of students..........................................................4
    [Show full text]