Japanese Vowel Recognition by Tracking Temporal Changes of Lip Shape

Total Page:16

File Type:pdf, Size:1020Kb

Japanese Vowel Recognition by Tracking Temporal Changes of Lip Shape Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Koshi Odagiri1, and Yoichi Muraoka1 1Graduate School of Fundamental/Computer Science and Engineering, Waseda University, Tokyo, Japan Abstract— In this paper, we propose a vision-based ap- vowels is important. There are two types of single sound proach to recognize Japanese vowels. Traditional researches recognition. Those are static lip image recognition and dealt with lip size, lip width and lip height, but our method tracking temporal changes of lip. deals with lip shape. Our method focus on temporal changes In this paper, we propose a method of letter recognition of lip shape, and we define new feature value to recognize focusing on temporal changes of lip shapes by model-based vowels. There are a lot of conventional studies, but those lip extraction for lip reading. studies ’datasets are captured in specific environment such as well-lighted room and using lipsticks. However, we use 2. Related works Active shape models to extract lip area and calculate fea- ture values. Therefore, our technique is not influenced by In this section, we discuss the previous related works and environment. And this paper describe the feature values are we show a direction of our method. robust. We experimented with our approach and about 80% Uchimura’s study[3] is letter recognition using static im- of average accuracy rate was obtained, and this rate is same age recognition. In their study, they use histograms of gray as vowels recognition of Japanese who use lip reading. We scale images to recognize lip area, and the letter recognition conclude that our method helps speech recognition. method use mouth size and mouth width. They use static image of lip, therefore specifying sections between letters Keywords: lip reading, vowel recognition, lip extraction is difficult and unsuitable to expand word recognition and sentence recognition. 1. Introduction Saitoh and Konishi’s study[4] uses one of the color-based Today, speech recognitions by audio are developed and method. And their method of letter recognition is to use those are used in game hardware, car navigation system temporal changes of lip size and lip aspect ratio. The results and cell phones, however, the systems cannot be used of their method was on average 93.8%. But the method is under noisy environment. Basically, speech recognition by not robust because of color-based method. impaired hearing people is based on sign language. But some people use lip reading. Therefore we can say that visual information improve performance of audio speech recognition under bad environment. To recognize mouth area is avery important for lip reading. We classify methods for recognizing into two types. One is color-based recognition such as snake algorithm[1], and two is model based recognition like Active shape models[2]. Color-based recognition is influenced by brightness of envi- ronment. On the other hand, model based recognition is not influenced by light, but need training datasets of face. Lip reading experiments are classified into four types. First is letter recognition, second is word recognition, third is sentence recognition and the last is semantic recognition. But Fig. 1: Lip area extraction by color-based method Japanese language has hiragana letters and unclear grammar. Therefore sentence recognition and semantic recognition are not robust and need a lot of learned datasets. And Japanese Figure 1 and figure 2 are results of lip area extraction pronunciations consist of some hiragana letters. Japanese using color-based method. We experimented lip extraction have differences between mouth shapes when they speak by RGB information of image. Figure 1 shows that this vowels. And almost all sound based on 5 vowels of /a/, method can get almost all lip area, but besides non-lip area. /i/, /u/, /e/ and /o/. Therefore single sound recognition of In figure 2, we changed the threshold of color-comparison. recognition of utterance by visual information using lip features. 3.1 Initialization First, we use 68 points for make active shape models learn faces, and use 19 features in those points. Figure 3 shows 68 points learned by active shape models. In this experiment, we define sections of utterance as one segment between mouth close and next mouth close. Experimentally, one section has about 30 frames to 70 frames. Therefore we adjust those sections to 50 frames. And To adjust the movement of mouth, we adjust mouth size and inclination by the width between features of both sides of the closed Fig. 2: Lip area extraction by different threshold mouth contour in the first frame. 3.2 Feature value The figures show that this color-based algorithm is clearly To tracking temporal changes, we use feature value from influenced by a background and regulation of thresholds. features of lip contours including the inside of mouth. The following figure 4 shows our definition of feature value in this experiments. Our feature value is defined the width between center point of contour and each points. These feature values mean where the features are. Fig. 3: Lip area extraction by active shape models On the other hand, we propose a lip extraction method of a model-based method. Figure 3 shows that a lip ex- Fig. 4: Features of lip area and Feature value traction by Active shape models using the same image as the above face. Clearly, the model-based method extract lip area correctly and also in detail. And our method deal Therefore feature values are formulated as with lip shapes more and more minutely. We mentioned the q above section, Japanese language consists of hiragana letters 2 2 V = (αx + αy) + (Cx + Cy) (1) on pronounces. And there are so small difference between consonants. Therefore Uchimura’s based on mouth size and where V is feature value, α is feature, and C is the center width and Saitoh’s method based on mouth size and aspect feature of mouth. ratio are unsuitable to expand to consonants recognition. 3.3 Relation between feature values We propose a robust method based on model-based lip extraction and tracking temporal changes of feature points In this paragraph, we explain relation between feature on lip shape to recognize vowels, and our method solve the values. The following figure 5 is comparison between feature above problems. values of 5 different people that calculated by the previous paragraph using the top feature of the mouth of /a/. Those features change largely at the vowel of /a/. 3. Method In addition, figure 6 is relation of temporal changes We use model-base method for lip area extraction in between vowels. We can see differences between vowels this experiments. In this section, we propose a method of from figure. 3. Evaluate values of each vowel by formula 3, and the vowel which the evaluated value is smallest is a matchable vowel for input. 4. Experiments In this section, we implement our method and experiment. And discuss the results of our system. 4.1 Setup We implemented the system which has the method we proposed. And the system was divided into the following 2 parts. Fig. 5: Feature values of /a/ by 5 people !"#$%& 0-+0$+-%!"1 '()*+ ,*-%$.* '-+$*/ ,*-%$.*/ +*-"!"1& .*0(1"!%!(" '-+$*/ +!#&-.*- 3!/4 *2%.-0%!(" Fig. 7: Chart of learning part of system Fig. 6: Feature values of vowels Figure 7 is a chart of learning part of our system. First we input a vowel and calculating feature values by our method. And learn those values to database. Considering previous two graphs, we can recognize vow- els by feature values which proposed by us and can be got !"#$%& 0-+0$+-%!"1 by formula 1. '()*+ ,*-%$.* '-+$*/ 3.4 Learning values Calculating average of previous feature values by formula 0(3#-.!"1 )!%4 1 for each vowel. And we use those values to recognize an ,*-%$.*/ +*-."*5& 5!/6 .*0(1"!%!(" input vowel. Therefore leaned datas are got by 5-%- P N V D = n=0 np (2) tvp N ($%#$% where Dtv is a learned feature value of a time of a vowel. +!#&-.*- */%!3-%*5 N is number of datasets, p is feature of lip area. V is value *2%.-0%!(" '()*+ got by formula 1. 3.5 Matching method Fig. 8: Chart of estimating part of system For recognition of vowels, we use following formula to calculate which vowel is most likely to the input. Figure 8 is a char of estimating part of our system. XT X19 Estimating part have the same processes as learning part j − j Sv = Xtvn Dtvn (3) by calculating feature values. But the next step is compar- t=0 n ing process. The comparing process is done by the above where Sv is evaluated value of a vowel, T is number of matching method of section 3 using learned database. Last, frames. And Xtvn is input vowel. D is calculated by formula we can get an estimated answer by the system. Table 1: Environment of experiments OS Windows 7 Professional 64bit edition CPU Intel Core 2 Extreme X9650 Memory 4GByte Camera Logicool 2-MP Webcam C600h Resolution of camera 640px x 480px FPS during capturing 30fps Our system was run the following table 1. We used web camera. And this means that this system was run by a camera more poor than a camera of iPhone 4. We captured 20 people speaking 5 vowels in front of Fig. 9: Comparison between two trained datasets camera and captured 3 times each. And we used 15 people of those data for valid dataset. Those valid dataset is defined not blurred and can recognize feature points by Active shape models. And our datasets were captured at various back- grounds such as laboratories, houses and meeting rooms.
Recommended publications
  • Speechreading for Information Gathering
    Speechreading for information gathering: A survey of scientific sources1 Ruth Campbell Ph.D, with Tara-Jane Ellis Mohammed, Ph.D Division of Psychology and Language Sciences University College London Spring 2010 1 Contents 1 Introduction 2 Questions (and answers) 3 Chronologically organised survey of tests of Speechreading (Tara Mohammed) 4 Further Sources 5 Biographical notes 6 References 2 1 Introduction 1.1 This report aims to clarify what is and is not possible in relation to speechreading, and to the development of speechreading skills. It has been designed to be used by agencies which may wish to make use of speechreading for a variety of reasons, but it focuses on requirements in relation to understanding silent speech for information gathering purposes. It provides the main evidence base for the report : Guidance for organizations planning to use lipreading for information gathering (Ruth Campbell) - a further outcome of this project. 1.2 The report is based on published, peer-reviewed findings wherever possible. There are many gaps in the evidence base. Research to date has focussed on watching a single talker’s speech actions. The skills of lipreaders have been scrutinised primarily to help improve communication between the lipreader (typically a deaf or deafened person) and the speaking hearing population. Tests have been developed to assess individual differences in speechreading skill. Many of these are tabulated below (section 3). It should be noted however that: There is no reliable scientific research data related to lipreading conversations between different talkers. There are no published studies of expert forensic lipreaders’ skills in relation to information gathering requirements (transcript preparation, accuracy and confidence).
    [Show full text]
  • Early Intervention: Communication and Language Services for Families of Deaf and Hard-Of-Hearing Children
    EARLY INTERVENTION: COMMUNICATION AND LANGUAGE SERVICES FOR FAMILIES OF DEAF AND HARD-OF-HEARING CHILDREN Our child has a hearing loss. What happens next? What is early intervention? What can we do to help our child learn to communicate with us? We have so many questions! You have just learned that your child has a hearing loss. You have many questions and you are not alone. Other parents of children with hearing loss have the same types of questions. All your questions are important. For many parents, there are new things to learn, questions to ask, and feelings to understand. It can be very confusing and stressful for many families. Many services and programs will be available to you soon after your child’s hearing loss is found. When a child’s hearing loss is identified soon after birth, families and professionals can make sure the child gets intervention services at an early age. Here, the term intervention services include any program, service, help, or information given to families whose children have a hearing loss. Such intervention services will help children with hearing loss develop communication and language skills. There are many types of intervention services to consider. We will talk about early intervention and about communication and language. Some of the services provided to children with hearing loss and their families focus on these topics. This booklet can answer many of your questions about the early intervention services and choices in communication and languages available for you and your child. Understanding Hearing Loss Timing: The age when a hearing loss has occurred is known as “age of onset.” You also might come across the terms prelingual and postlingual.
    [Show full text]
  • In This Issue: Speech Day 10
    Soundwave 2016 The Mary Hare Magazine June 2016 maryhare.org.ukmaryhare.org.uk In this issue: Speech Day 10 HRH Princess Royal visits 17 Sports Day 28 Ski Trip 44 Hare & Tortoise Walk 51 Head Boy & Head Girl 18 Primary News 46 SLT & Audiology 53 1 Soundwave 2016 The Mary Hare Magazine June 2016 maryhare.org.uk Acknowledgements Contents Editors, Gemma Pryor and Sammie Wilkinson Looking back and looking forward The Mary Hare Year 4–20 by Peter Gale Getting Active 21–28 Cole’s Diner 29–30 Welcome to this wonderful edition of Soundwave – Mr Peter Gale a real showcase of the breadth and diversity of experiences Arts News 31–33 which young people at Mary Hare get to enjoy. I hope you Helping Others 34–35 will enjoy reading it. People News 35–39 This has been a great year but joined us for our whole school under strict control and while Our Principal one with a real sadness at its sponsored walk/run and a they are substantial, they only Alumni 40–41 heart – the death of a member recent visit from Chelsea allow us to keep going – to of staff. Lesley White made a Goalkeeper Asmir Begovic who pay the wages and heat the Getting Around 42–45 huge contribution to Mary Hare presented us with a cheque school and to try to keep on and there is a tribute to her on for £10,000 means that the top of the maintenance of two Mary Hare Primary School 46–48 page 39. swimming pool Sink or Swim complex campuses.
    [Show full text]
  • Research and Evidence 40 Years On
    Cued Speech – research and evidence 40 years on In 2016 Cued Speech (CS) celebrates its 40 year anniversary with a conference in America which will be attended by academics and researchers from around the world. In the past 39 years a wide range of evidence has grown which demonstrates its effectiveness and yet there’s still confusion in the minds of many people in the UK as to what exactly CS is. The name ‘Cued Speech’ probably doesn’t help matters. Many people believe that the French name Langage Parlé Complété (LPC) or ‘completed spoken language’ paints a clearer picture and has helped to bring about the situation where every deaf child is offered the option of visual access to French through LPC. CS is a visual version of the spoken language in which it is used. Why then, the name Cued Speech? For hearing children the speech of parents / carers is both how they develop language and the first expression of language, then, when children start school, they have the language they need to learn to read, and then they learn yet more language through reading. For deaf children, CS does the job of speech; it is your speech made visible. When you use the 8 handshapes and 4 positons which are the ‘cues’ of CS, you turn the 44 phonemes of your speech into visible units which can, like sounds, be combined into words, sentences and, as a result, full language. Just as hearing children learn a full language thorough listening to speech, so deaf children can learn a full language through watching speech which is ‘cued’.
    [Show full text]
  • The Language Skills of Singaporean Deaf Children Using Total Communication Mandy Phua Su Yin National University of Singapore 20
    THE LANGUAGE SKILLS OF SINGAPOREAN DEAF CHILDREN USING TOTAL COMMUNICATION MANDY PHUA SU YIN NATIONAL UNIVERSITY OF SINGAPORE 2003 THE LANGUAGE SKILLS OF SINGAPOREAN DEAF CHILDREN USING TOTAL COMMUNICATION MANDY PHUA SU YIN (B.A.(Hons.), NUS) A THESIS SUBMITTED FOR THE DEGREE OF MASTER OF SOCIAL SCIENCE (PSYCHOLOGY) DEPARTMENT OF SOCIAL WORK AND PSYCHOLOGY NATIONAL UNIVERSITY OF SINGAPORE 2003 i Acknowledgements I would like to express my gratitude to: ❖ A/P Susan Rickard Liow, Department of Social Work and Psychology, National University of Singapore, for your advice and patient guidance. ❖ The Principal, Mrs Ang-Chang Kah Chai, staff and students of the Singapore School for the Deaf for participating in this study and for teaching me much about the Deaf community. ❖ A/P Low Wong Kein, Head, Department of Otolaryngology, Singapore General Hospital, and colleagues in the Listen and Talk Programme for always being quick to provide instrumental aid. ❖ Ms Wendy Tham and Mr Tracey Evans Chan for your helpful suggestions and comments on the thesis. ii Table of Contents Acknowledgements i Table of Contents ii List of Tables vi List of Figures vii Summary viii Chapter 1 Introduction 1 1.1. Deaf Education Worldwide 1 1.1.1. Definitions and Terminology 1 1.1.2. Language and Literacy 2 1.1.3. Approaches to Deaf Education and Programmes 3 1.1.3.1. Auditory-Verbal Approach 4 1.1.3.2. Bilingual-Bicultural Approach 4 1.1.3.3. Cued Speech 5 1.1.3.4. Oral Approach 5 1.1.3.5. Total Communication 5 1.2.
    [Show full text]
  • An Auditory Processing Disorder (APD) Refers to a Variety of Conditions That Affect the Way the Brain Processes Auditory Information
    Auditory Processing Disorders (APD) An Auditory Processing Disorder (APD) refers to a variety of conditions that affect the way the brain processes auditory information. APD is different from a hearing impairment, in that individuals with APD generally have normal hearing ability. Rather, an individual with APD cannot process the information they hear in the same way that others do. This can lead to difficulties in recognizing and interpreting sounds, especially the sounds involved in speech. Approximately 2-3% of children are affected with APD. Males are twice as likely as females to be affected by the disorder. The ultimate causes of APD are unknown. The Committee of UK Medical Professionals Steering the UK Auditory Processing Disorder Research Program have developed the following working definition of Auditory Processing Disorders: "APD results from impaired neural function and is characterized by poor recognition, discrimination, separation, grouping, localization, or ordering of non-speech sounds. It does not solely result from a deficit in general attention, language or other cognitive processes." APD can be difficult to diagnose in children. Often times, children who present with symptoms of APD are misdiagnosed as having ADD/ADHD, Asperger’s syndrome, or other forms of autism. Though it is different from these disorders, it shares some overlap and common characteristics with dyslexia and specific language impairment (SLI). When individuals with APD experience an inability to process verbal information, they do not process what is being said to them. Because people with APD are used to guessing to fill in the processing gaps, they may not even be aware that they have misunderstood something.
    [Show full text]
  • American Sign Language
    • HOME • INFORMATION & REFERRAL • AMERICAN SIGN LANGUAGE American Sign Language • What is American Sign Language? • Five (5) common misconceptions people have about ASL • Where can I take sign language (ASL) classes in Rhode Island? • Where can I find additional information about ASL? • For Parents: Where can I find information and resources for my deaf child? What Is American Sign Language (ASL)? ASL, short for American Sign Language, is the sign language most commonly used by the Deaf and Hard of Hearing people in the United States. Approximately more than a half-million people throughout the US (1) use ASL to communicate as their native language. ASL is the third most commonly used language in the United States, after English and Spanish. Contrary to popular belief, ASL is not representative of English nor is it some sort of imitation of spoken English that we use on a day-to-day basis. For many, it will come as a great surprise that ASL has more similarities to spoken Japanese and Navajo than to English. When we discuss ASL, or any other type of sign language, we are referring to what is called a visual-gestural language. The visual component refers to the use of body movements versus sound. Because “listeners” must use their eyes to “receive” the information, this language was specifically created to be easily recognized by the eyes. The “gestural” component refers to the body movements or “signs” that are performed to convey a message. A Brief History of ASL ASL is a relatively new language, which first appeared in the 1800s’ with the founding of the first successful American School for the Deaf by Thomas Hopkins Gallaudet and Laurent Clerc (first Deaf Teacher from France) in 1817.
    [Show full text]
  • Speech-Reading Intervention for Profoundly Deaf Child the Case of Hosanna School for the Deaf, Ethiopia
    IOSR Journal Of Humanities And Social Science (IOSR-JHSS) Volume 23, Issue 6, Ver. 7 (June. 2018) PP 26-42 e-ISSN: 2279-0837, p-ISSN: 2279-0845. www.iosrjournals.org Speech-reading Intervention for Profoundly Deaf Child The Case of Hosanna School for the Deaf, Ethiopia Dr. Tesfaye Basha Dilla University, Ethiopia Corresponding Author: Dr. Tesfaye Basha Abstract: The objective of this intervention was to develop functional speech reading skillsand to increase response of correct speech reading skill. To attain the objective of the study single subject experimental design is used. Four critical elements of methods such as selection of the target, establishment of a baseline, repeated measurement with positive reinforcement, and intervention are used in thestudy. The research design used the steps of the A-B-A-B design. The participant client is profoundly deaf. Intervention of speech reading therapy presented for six weeks. This particular intervention consists of five days in a week and half hour intervention sessions. Long experienced speech therapist teacher was selected for the intervention. Amharic vowels served as the stimulus materials and Amharic one syllable words are instrument materials used for intervention.The finding revealed that 91.77% of the one syllable words at the highest level of success in lip reading and 91.67% similar success of vowels. The client word sound levels appeared to be the best ranging from 83.66% to 100% of the time correct responses and 66.66% to 100% of the vowels sounds correct responses. The result of these words and vowels correct response improvement occurred through continuous practice under proper guidance and intervention.
    [Show full text]
  • Deep Learning for Lip Reading Using Audio-Visual Information for Urdu Language
    Deep Learning for Lip Reading using Audio-Visual Information for Urdu Language Muhammad Faisal Sanaullah Manzoor Information Technology University Information Technology University Lahore Lahore [email protected] [email protected] Abstract But human lipreading performance is not Human lip-reading is a challenging task. It requires not precise consequently there is enormous need of only knowledge of underlying language but also visual automatic lip-reading system. It has many clues to predict spoken words. Experts need certain level practical applications such as dictating of experience and understanding of visual expressions learning to decode spoken words. Now-a-days, with the instructions or messages to a phone in a noisy help of deep learning it is possible to translate lip environment, transcribing and re-dubbing sequences into meaningful words. The speech recognition archival silent films, security, biometric in the noisy environments can be increased with the visual identification, resolving multi-talker information [1]. To demonstrate this, in this project, we simultaneous speech, and improving the have tried to train two different deep-learning models for lip-reading: first one for video sequences using spatio- performance of automated speech recognition in temporal convolution neural network, Bi-gated recurrent general [1, 2]. neural network and Connectionist Temporal Classification Usually in noisy environments speech Loss, and second for audio that inputs the MFCC features recognition systems fails or performs poorly, to a layer of LSTM cells and output the sequence. We have because of the extra noise signals. To overcome also collected a small audio-visual dataset to train and test our model.
    [Show full text]
  • Forensic Lip Reading by Tina Lannin Tina Is a Life-Long Lip Reader, She Is Totally Deaf and Is a Certified Lipreading Teacher
    Forensic Lip Reading by Tina Lannin Tina is a life-long lip reader, she is totally deaf and is a certified lipreading teacher. She has worked as a forensic lip reader for 20 years and heads up a forensic lip reading team at 121 Captions. Lip reading, or speech reading, is the skill of using pick words and work out what is seen and grasp the one sense that was meant for another. In place of context. It can take on average one hour to work heard speech, lip shapes are seen, decoded by one’s through one minute of video. brain and knowledge of language, and translated so Why does lip reading ability vary so much from that they make sense. A lip reader will watch a per- person to person? son’s lips, facial expressions, eyes, gestures, and body Speech is designed primarily to be heard, not viewed. language, and if possible they will use context to clue Many of the critical aspects of speech are hidden from themselves into the topic. Context is very important view. Most consonants are produced by actions of the in lipreading but this is often missing in forensic lip tongue inside the oral cavity (g,d,t,y,r,s,n,k,sh,s,j,z) reading. and not by visible actions of tongue, lips, teeth (m/p, Forensic lipreading is the practice of applying lip f/v, th). Lip shapes do not always reflect the speech reading skills to (typically silent) video footage where sounds being made, but can anticipate or follow no context is given since an airlock must be present, them: for example, the mouth shape for the final ‘th’ i.e.
    [Show full text]
  • Predicting the Ability to Lip-Read in Children Who Have a Hearing Loss Jeanne Breitmayer Flowers
    Washington University School of Medicine Digital Commons@Becker Program in Audiology and Communication Independent Studies and Capstones Sciences 2006 Predicting the ability to lip-read in children who have a hearing loss Jeanne Breitmayer Flowers Follow this and additional works at: http://digitalcommons.wustl.edu/pacs_capstones Part of the Medicine and Health Sciences Commons Recommended Citation Flowers, Jeanne Breitmayer, "Predicting the ability to lip-read in children who have a hearing loss" (2006). Independent Studies and Capstones. Paper 428. Program in Audiology and Communication Sciences, Washington University School of Medicine. http://digitalcommons.wustl.edu/pacs_capstones/428 This Thesis is brought to you for free and open access by the Program in Audiology and Communication Sciences at Digital Commons@Becker. It has been accepted for inclusion in Independent Studies and Capstones by an authorized administrator of Digital Commons@Becker. For more information, please contact [email protected]. Predicting the Ability to Lip-Read in Children who have a Hearing Loss by Jeanne Breitmayer Flowers An Independent Study submitted in partial fulfillment of the degree requirements for the degree of: Masters of Science in Deaf Education Washington University School of Medicine Program in Audiology and Communication Sciences May 19, 2006 Approved by: Nancy Tye-Murray, Ph.D., Independent Study Advisor Abstract: This study aims to discover if a variety of factors related to a child’s education and audiologic history predict a child’s ability to lip-read. 1 Introduction A variety of factors could potentially influence a child’s ability to lip-read, such as a child’s age, the child’s current school placement, or the child’s speech, language, and speech perception ability.
    [Show full text]
  • “Hear, Israel” the Involvement of Jews in Education of the Deaf (1850–1880)
    Jewish History (2009) 23: 41–56 © The Author(s) 2008. This article is published DOI: 10.1007/s10835-008-9070-y with open access at Springerlink.com “Hear, Israel” The involvement of Jews in education of the deaf (1850–1880) MARJOKE RIETVELD-VAN WINGERDEN AND WIM WESTERMAN Department of Theory and Research in Education, VU-University Amsterdam, Van der Boechorststraat 1, 1081BT Amsterdam, The Netherlands E-mail: [email protected]; [email protected] Abstract During the last two centuries there has been a methodological struggle over teaching the deaf. Do deaf people learn to communicate by means of gestures and signs (the “manual method”) or is it important for them to learn speech and lip-reading (the “oral method”)? In the second half of the nineteenth century, many schools for the deaf made the transition from the manual to the oral method, which the Milan conference of teachers of the deaf decided to promote in 1880. In this conversion, Jews played an important role. Yet there appears to be a clear link between their efforts and Jewish tradition, including its perception of the deaf. Introduction Jewish teachers have played a dominant role in the special education of the deaf. These teachers were also strong defenders of what is known as “the oral method” of deaf-instruction. Why, we would like to ask, were so many Jews eager to take on the role of teaching the deaf to speak and why in just this way? Moreover, what role, if any, does Jewish tradition have in this story? We believe that role exists.
    [Show full text]