Influence of Native and Non-Native Multitalker Babble on Speech Recognition in Noise Chandni Jain, Sreeraj Konadath, Bharathi M

Audiology Research 2014; volume 4:89 Influence of native and non-native multitalker babble on speech recognition in noise Chandni Jain, Sreeraj Konadath, Bharathi M. Vimal, Vidhya Suresh Department of Audiology, All India Institute of Speech and Hearing, Manasagangothri, Mysore, India needed to assess speech recognition in Malayalam listeners in the Abstract presence of other non-native backgrounds of various types. The aim of the study was to assess speech recognition in noise using multitalker babble of native and non-native language at two different signal to noise ratios. The speech recognition in noise was Introduction assessed on 60 participants (18 to 30 years) with normal hearing sensitivity, having Malayalam and Kannada as their native language. For Perception of speech is the ability of the listener to perceive the this purpose, 6 and 10 multitalker babble were generated in Kannada acoustic waveform produced by a speaker as a string of meaningful and Malayalam language. Speech recognition was assessed for native words and ideas and attend to it.1 Comprehensive understanding of listeners of both the languages in the presence of native and non- speech depends on various noise conditions like, signal to noise ratio native multitalker babble. Results showed that the speech recognition (SNR) and types of backgroundonly noise. Speech recognition in noise is in noise was significantly higher for 0 dB signal to noise ratio (SNR) clinically used to assess the auditory functions in individuals with compared to -3 dB SNR for both the languages. Performance of hearing sensitivity within normal limits and with hearing loss. Kannada Listeners was significantly higher in the presence of native Speech recognition in noise is affected by two types of processes, (Kannada) babble compared to non-native babble (Malayalam). namely signaluse driven and knowledge driven. The main purpose of sig- However, this was not same with the Malayalam listeners wherein they nal driven processes is to analyze different acoustic sources of the performed equally well with native (Malayalam) as well as non-native speech signal.2 However, knowledge driven processes rely mainly on babble (Kannada). The results of the present study highlight the the linguistic content of the speech.3 Most of the speech perception importance of using native multitalker babble for Kannada listeners in lieu of non-native babble and, considering the importance of each SNR studies have focused on signal driven processes and thus little is for estimating speech recognition in noise scores. Further research is known about the influence of knowledge driven processes in speech perception. The knowledge driven process in speech recognition in noise can be studied by comparing the listeners with different native languages in Correspondence: Chandni Jain, Department of Audiology, All India Institute the presence of different multitalker babble i.e., the multitalker babble of Speech and Hearing, Manasagangothri, Mysore 570 006, India. of native as well as foreign languages. The speech perception in the E-mail: [email protected] presence of different competing noise would result in masking of the target speech. This could be due to energetic masking and informa- Acknowledgements: the authors would like to express gratitude to Dr. S.R. tional masking. Energetic masking occurs as a result from competition Savithri, Director, AIISH and Dr. Animesh Barman, HOD-Audiology for per- between target and masker at the periphery of the auditory system and mitting to carry out this research. informational masking refers to everything that reduces intelligibility 4 Key words: speech perception in noise,Non-commercial multitalker babble, signal to noise of speech when energetic masking is accounted for. ratios. In the past, a number of studies have been done to study the effects of babble on the listener’s recognition for sentences in their native Contributions: CJ, study design, data analysis, final manuscript correction; language. Studies have shown that the linguistic content of the mask- SK, data analysis, final manuscript preparation; BMV, VS, data collection and ing speech signal can influence the speech recognition.5,6 It is report- data analysis. ed that the performance of the listener degrades when the competing speech is spoken in the listeners’ native language versus a language Conflict of interests: the authors report no conflict of interests. that is unfamiliar to them.6,7 Received for publication: 29 October 2013. In another study8 Calandruccio et al. investigated the effect of lin- Revision received: 6 March 2014. guistically and phonetically close and distant maskers on speech Accepted for publication: 11 March 2014. recognition. They reported that the speech recognition performance increased as the target-to masker linguistic distance was increased. In This work is licensed under a Creative Commons Attribution 9 NonCommercial 3.0 License (CC BY-NC 3.0). a study by Russo and Pichora-Fuller they reported that the word recognition performance was poorer in the presence of multitalker babble ©Copyright C. Jain et al., 2014 compared to familiar music. Licensee PAGEPress, Italy The objective of this research was to investigate the effect of native Audiology Research 2014;4:89 and non-native babble backgrounds on speech recognition perform- doi:10.4081/audiores.2014.89 ance. India is a multilingual country with twenty-three constitutional- [Audiology Research 2014; 4:89] [page 9] Article ly recognized languages.10 Thus audiologists in the country face a great Malayalam. The speaker used for recording was a bilingual speaker, difficulty in delivering the services to a culturally and linguistically with Malayalam as her first language and English as a second lan- diverse client load. Also, as India is a bilingual mosaic;11 there occur guage. She was instructed to read the words in the list as natural as situations wherein the native speakers of a specific language are possible. The individual wave files of the words were stored onto a com- exposed to multitalker babble of some other non-native language. So, puter hard disk and were normalized by using Adobe Audition (Version there is a requirement to know influence of multitalker babble of native 3.0) software. and non-native language on speech recognition in noise scores. To ensure that the developed lists were of equal intelligibility, the Further, SNR is a major determinant of speech intelligibility, there is a speech recognition scores were found out on 30 native Malayalam requirement to know the influence of SNR on speech recognition in speakers and the results showed that all the lists were of equal difficul- noise scores with varying complexity of background noise. ty. Speech recognition scores of all the participants ranged from 95 to The present study aimed to study the speech recognition in noise 100% across the lists. Word list in Kannada was adapted from Sreela using multitalker babble of native and non-native language at different and Devi.12 This list also consisted of 100 words distributed in four sub- SNRs. Two Dravidian languages (Kannada and Malayalam) were lists with 25 words each. The Kannada wordlist used was also phoneti- selected for the present study and both the language origin is from cally balanced and consisted of verbs and the intelligibility of this list Sanskrit. These two languages are spoken in the southern states of was checked by the authors on 30 native Kannada speakers. India and native speakers of both the languages interact with each other. Thus, it will be of great interest to study the influence of native Generation of multitalker babble and non-native babble of these languages on speech recognition in the For the purpose of assessing speech recognition in the presence of listener’s native language. The objectives of the study were: i) to com- noise, multitalker babble was generated in Kannada and Malayalam pare the speech recognition of Kannada listeners in the presence of languages. For the construction of babble, speech was recorded from 20 native and non-native multitalker babble; ii) to compare the speech individuals using different standardized passages in Kannada and recognition of Malayalam listeners in the presence of native and non- Malayalam. The passages consisted of 304 words and 307 words respec- native multitalker; iii) to compare the effect of SNRs (0 and -3dB) on tively in Kannada and Malayalam.14 Ten native speakers of Malayalam speech recognition scores; iv) To compare the effect of number of talk- and ten native speakers of Kannada language with equal number of ers in the multitalker babble (10 and 6 talkers babble) on speech recog- males and females were selectedonly to generate ten talker and six talker nition in noise scores. babble. Same talkers were used for the generation of ten talker and six talker babble and all the talkers were native speakers of the language with English as their second language. Further, Kannada multitalker babble was mixeduse with Kannada and Malayalam words at 0dB SNR and Materials and Methods -3dB SNR with the use of MATLAB code. Similarly, Malayalam multitalker babble was mixed with Malayalam and Kannada words to meet Participants the objectives of the study. The onset of multitalker babble preceded the Sixty participants in the age range of 18 to 30 years with normal onset of the word by 600 ms and continued till 600 ms after the end of hearing sensitivity participated in the study. Thirty native speakers of each word in the recorded list. Malayalam and thirty native speakers of Kannada were selected for the study. Participants in both the groups did not know how to speak, read Procedure for speech recognition testing and write the non-native language and they had English as their sec- Speech recognition was assessed using the above described test ond language. The listeners who participated had normal hearing, as materials on 60 participants (18 to 30 years) with normal hearing sen- indicated by their four-frequency (500 Hz, 1000 Hz, 2000 Hz, 4000 Hz) sitivity, having Malayalam and Kannada as a native language (30 in pure-tone average threshold of ≤15 dBHL, ‘A’ Type tympanogram with each language group).

Influence of Native and Non-Native Multitalker Babble on Speech Recognition in Noise Chandni Jain, Sreeraj Konadath, Bharathi M

Spoken Indian Language Identification: a Review of Features and Databases

PARITOSH BARMAN Selected Publications: 1) Indian Federalism

Threatened and Endemic Fishes of Tripura with Comments on Their Conservation

YUN@HASOC-Dravidian-Codemix-FIRE2020: a Multi-Component Sentiment Analysis Model for Offensive Language Identification

Arxiv:2106.09460V1 [Cs.CL] 17 Jun 2021

The Heritage and Culture of Tripura Are Vast and Vivid Because of the Large Number of Races Residing in the State from the Ancient Period

Improving Wordnets for Under-Resourced Languages Using Machine Translation Bharathi Raja Chakravarthi, Mihael Arcan, John P

Regional Languages Centres

List of Selected Candidates Belonging to SC Category for Award Of

List of Selected Candidates Belonging to SC Category for Award Of

Corpus Creation for Sentiment Analysis in Code-Mixed Tamil-English Text

Automatic Processing of Code-Mixed Social Media Content Utsab Barman