<<

Sociology at Surrey University of Surrey

social researchUPDATE • The technology needed to make digital recordings of interviews and meetings for the purpose of qualitative research is described. • The advantages of using technology are outlined. • The technical background needed to make an informed choice of technology is summarised. • The Update concludes with brief evaluations of the types of audio recorder currently available.

Tools for Digital Audio Recording in Qualitative Research

Alan Stockdale

In a recent book Michael Patton writes, “As a naïveté, can heighten the sense of “being Dr. Stockdaleʼs training is in good hammer is essential to fine carpentry, there”. For discussion of the naturalization cultural anthropology. He is a a good is indispensable to of audio recordings in qualitative research, senior research associate at fine fieldwork” (Patton 2002: 380). He see Ashmore and Reed (2000). Education Development Center goes on to cite an example of transcribers in Boston, Massachusetts, at one university who estimated that 20% Why digital? of the tapes given to them “were so badly where he currently serves Audio Quality as an investigator on several recorded as to be impossible to transcribe The recording process used to make genetics education research accurately – or at all.” Surprisingly there analogue recordings using is remarkably little discussion of tools and introduces noise, particularly tape hiss. projects funded by the U.S. techniques for recording interviews in the Noise can drown out softly spoken words National Institutes of Health. qualitative research literature (but see, for and makes transcription of normal speech example, Modaff and Modaff 2000). difficult and tiring. Digital recorders This overview discusses the potential generally have a much higher to advantages of and noise ratio. Less noise reduces the risk of provides some technical background and lost data and results in faster, less expensive a checklist of features to consider when and more accurate transcription. buying a digital recorder. It concludes with Note that audio quality also depends on brief comments on the different types of using a suitable external or recorder currently available and the names properly positioned near of some of the leading manufacturers. As speakers in an environment with low levels the technology is changing rapidly and new of ambient noise. recorders are appearing constantly there is little point in recommending particular Digital Editing models as the recommendations would There are cheap, sophisticated audio editing rapidly become obsolete. programs (e.g. Syntrillium’s CoolEdit 2000) Detailed discussion of the methodological that can be helpful if they are used with issues associated with audio recording is care. These programs can be used to adjust beyond the scope of this Update. Let it be the recording level, fix recordings in which noted, however, that audio recording uses one speaker louder than another, ISSUE technology and technique to frame and reduce unwanted background noise, filter structure the representation of an event. unnecessary , silence personal It is important to keep this in mind given or identifying information to protect that the quality improvement of digital anonymity, and cut extraneous sections 38 technology, when coupled with technical from the beginning or end of audio files. social research UPDATE

Archiving Channels It is easy and inexpensive to backup and Single channel or mono recording often archive digital audio recordings. When works fine for interviews. Mono recording using a compressed digital format such as also doubles the available record time when MP3 it is possible to store an entire research using a digital recorder. However, stereo project on one or two CD-ROMs. However, recording may be an advantage in some because digital audio is readily copied and situations where the speakers are separated transmitted, additional steps may need to from each other or where there are several be taken to ensure that original recordings speakers. To take advantage of stereo are kept secure and research participants’ recording a microphone setup that allows confidentiality is adequately protected. each microphone element to be positioned next to a different speaker or set of speakers -based Transcription will be necessary. This will aid transcription Transcription can greatly improve by making it easier to ensure that a good the usability and usefulness of transcriptions audio recording level is obtained for all (Muhr 2000). This is primarily accomplished speakers and making it possible for the through the automatic insertion of tags that person doing the transcription to use the encode additional data during transcription. stereo separation to help identify speakers See for example DGA’s Transcriber and transcribe overlapping speech. software, which uses eXtended Markup Language (XML) tags. The most obvious Recording Level use of tags is synchronization of transcript The level of the – how files to their corresponding audio files. much the microphone signal is amplified This facilitates checking, correcting, or later – needs to be set properly to make a good referral as the researcher can go to any point recording. If the signal is too strong it will in the transcript and immediately play the be distorted; too weak and the speech one corresponding segment of audio. Hopefully, wishes to record may be swamped by noise these type of features will be integrated into and difficult to hear. The majority of cheap analysis software eventually. recording devices do not provide any visual display of the level and set the recording Technical background level automatically. This makes recording Response easy but automatic level control (ALC) The audible range of the human ear is can be problematic (Modaff and Modaff approximately 20 Hz to 20 kHz. The most 2000). ALC constantly adjusts the level to important frequencies for speech occur in any audio , even background noise the “mid range” between 250 Hz and 8 kHz. during pauses in speech. This may result The sensitivity of microphones and in the level being frequently, although recorders to audio frequencies varies. briefly, poorly adjusted to the speech being Microphones for recording speech often recorded. ALC also changes the overall are most sensitive to the range between 200 dynamics so that the difference between Hz and 10 kHz. Digital recorders also vary in loud and quiet speech is compressed. their sensitivity. A MiniDisc recorder, when Digital Audio matched with an appropriate microphone, is Digital audio is recorded by sampling a capable of recording frequencies between 20 wave and assigning each sample a Hz and 20 kHz. Some digital voice recorders value. The quality of the audio depends on when set to “long play” mode may only the sampling frequency and the resolution, encode frequencies between 300 Hz and 3 that is, the range of values that can be kHz. line frequencies are limited assigned to each sample. The sampling to those between 400 Hz and 3.4 kHz. A frequency is significant to the extent it that approximates the needs to be at least double the highest mid-range frequencies will result in the best frequency one wishes to record. Music CDs speech recordings. use a sample rate of 44.1 kHz – a rate more social research UPDATE

than adequate for encoding frequencies Factors to consider up to 20 kHz. For recording speech, a • Cost (including batteries and media sample rate of at least 16 kHz will ensure if applicable). It is a false economy to good quality. Audio is normally encoded purchase a cheap recorder if the audio in 8 bits, 16 bits, or in some cases higher quality is such that it increases the cost resolutions. The higher the bit depth, the and time of transcription. Transcription is greater the number of amplitude values time consuming and expensive so a good that can be represented for each sample. recorder costing hundreds of pounds will An 8 bit resolution may be adequate for quickly pay for itself. recording speech for some purposes, but • Audio quality. See discussion above. 16 bits is better. • Easy of use. Digital Audio and Compression • Microphones. Are external microphones CD quality digital audio corresponds to a supported and are they supported through sample rate of 44.1 kHz, encoded at 16 bits, line-in or mic-in jacks? Internal microphones on two channels. This works out to: are usually of low quality, may pick up noise 44.1k × 16 × 2 = 1411.2 kilobits/second from the recorder, and may be difficult 1411.2k / 8 = 176.4 kilobytes/second to position close to interviewees. An 176.4k × 3.6k = 635.04 megabytes/hour external , often expensive and To record at this rate consumes a cumbersome, may be needed to boost the considerable amount of storage space. microphone signal to if an external The same is true of other forms of Pulse microphone can only be attached through a Code (PCM) audio, which is line-in jack. the usual format for Windows WAV files • Portability and intrusiveness in an and Macintosh AIFF files. Even if the interview situation. encoding rate is reduced by using a 16 kHz • Ruggedness and reliability of recorder sample rate at eight bit resolution with one and media. channel (which for many purposes might • Audio formats. What type of recording be satisfactory for recording speech) the formats or compression schemes are recording will still consume 57.6 MB/hr. supported? How easy is it to use a given The solution to the space problem is format with data analysis, audio editing, and compression schemes or codecs that use transcription tools? psychoacoustic principles and other audio • Stereo recording. Is stereo recording features to reduce the bit rate in ways that supported? See discussion above. social research UPDATE is limit the perceived quality loss of the audio distributed without charge on stream. Common compression schemes • Computer transfer. How easy is it to include: Fraunhofer MPEG 1 Layer 3 (MP3), transfer recordings to a computer? Is USB request to social researchers Advanced Audio Coding (AAC), Adaptive or some other method that allows faster in the United Kingdom by Transform Acoustic Coding (ATRAC), and than real time upload supported? the Department of Sociology Windows Media Audio (WMA). MiniDisc • Record time. How long will media at the University of Surrey uses ATRAC, which in standard mode, like and batteries allow recording to continue CD audio, samples at 44.1KHz, in stereo, uninterrupted? as part of its commitment to and encodes in 16 bits, but saves the audio • Batteries. Can rechargeable batteries be supporting social research in 1/5 the space without perceptible loss used? training and development. of quality. Fraunhofer MP3 saves audio in • Control over the recording process. 1/11 the space. A Fraunhofer MP3 audio file Is the recording level displayed and is it Contributions to social encoded at 32 kbps (22.05 kHz sample rate, possible to manually adjust the recording research UPDATE that with 16 bit encoding, mono) will provide level? See discussion above. review current issues in social good voice recording for many purposes • Additional information display. Does and only takes up 14.4 MB/hr. Newer research and methodology the recorder display remaining battery in about 2,500 words are codecs such as WMA and AAC maintain power and remaining record time? perceived audio quality at even greater Copy protection. Is a copy protection welcome. All UPDATE compression ratios. • scheme implemented and, if so, what articles are peer-reviewed. social research UPDATE

limitations does it impose on the use of may be an issue with some of these devices. Personal Note audio recordings? They nearly always lack a microphone input My own practical experience is presently jack as well as other features that would make limited to the use of cassette tape, Types of digital recorder them good field recorders. That said, some of MiniDisc, and computer recording. I Many of the digital recorders designed these devices have great potential and future currently make recordings using either a specifically for recording interviews or developments are worth watching. MiniDisc recorder or a computer. In the meetings are expensive, complicated, and MiniDisc (Sharp, Sony) future, I plan to trade in my MiniDisc for geared to the needs of broadcast journalists. MiniDisc provides “near CD” quality audio a suitable solid-state recorder. To learn Other types of digital recorder that are recording, is very portable, has long record about my approach to using digital audio simpler and cheaper are often designed times, and is relatively cheap (although the in my own qualitative research see: http:// primarily as portable music players or cheapest recorders should be avoided if they www.edc.org/CAEPP/resources/audio.asp. for simple dictation and may have some lack a microphone jack). are often significant limitations when used to record used by broadcast journalists and others as References interviews and meetings. At the moment, Ashmore, Malcolm and Darren Reed 2000 a cheap alternative to more expensive field there are few devices that fall in the middle ‘Innocence and Nostalgia in Conversation recorders. Disadvantages include a poor ground, but new ones are constantly Analysis: The Dynamic Relations of Tape computer interface – upload of audio files is only appearing. and Transcript’, Forum Qualitative possible by real time re-recording. MiniDisc also Cassette Tape Recorder (Sony, Marantz) Sozialforschung / Forum: Qualitative needs to be used carefully to ensure directory Social Research 1(3). Available at: http:// While not digital, a cassette recorder can still be information is saved or recordings will be lost. qualitative-research.net/fqs/fqs-eng.htm. used to create digital audio files by re-recording Voice Recorders (Olympus, Sony, Panasonic) Modaff, J. V. and D. P. Modaff (2000). cassette tapes to a computer equipped with a These small solid-state devices are designed Technical notes on audio recording. soundcard. Disadvantages are a low signal-to- to record memos, dictated letters, and the Research on Language and Social noise ratio, limited recording time, and the need like. Some of the more expensive ones have Interaction. 33 (1), 101-118. for analogue to digital conversion. microphone jacks and interface well with Muhr, Thomas (2000) “Increasing the Handheld (Handspring, Palm, computers through a USB connection or Reusability of Qualitative Data with XML”, HP, Toshiba, NEC, Casio) removable cards. Most of these Forum Qualitative Sozialforschung / PocketPC and Palm devices can be used to record devices save audio in highly compressed Forum: Qualitative Social Research, 1(3). audio but very few of these devices support formats, with low sampling frequencies, and Available at: http://qualitative-research.net/ the use of external microphones. Handheld limited frequency sensitivity. These factors will fqs/fqs-eng.htm. computer devices may eventually appear with limit audio quality. Future models are likely to Patton, Michael 2002 Qualitative Research input jacks or add-ons that allow external support higher quality audio. and Evaluative Methods. Third Edition. microphones to be used. Thousand Oaks, CA: Sage Publications. Professional Solid State Recorders Desktop and Portable Computers (Marantz, Denon, Maycom, Mayah, Orban) Direct to computer recording may be the best These recorders are designed for and cheapest way to make digital recordings of of interviews by broadcast journalists. They are interviews done by phone when equipped with usually rugged and reliable, have sophisticated a good telephone coupler, soundcard or USB recording features, are generally larger than audio (e.g. Griffin Technology’s other portable recorders, interface well with social research UPDATE iMic), and recording software. A computer may computers, and are usually very expensive. (ISSN: 1360-7898) be cumbersome for field recordings. The latest CD-R/RW Recorders ultra subnotebooks (Sony, Toshiba, Fujitsu) are is published by quite small and light, but availability is often Marantz has recently started to sell a professional Department of Sociology portable CD-R/RW recorder (CDR300) designed limited outside Japan. This is an expensive University of Surrey option but most people either own a computer for recording meetings and interviews. It is already or need one for other tasks. expensive but audio quality should be excellent, Guildford GU2 7XH blank discs are cheap, and audio is easily Consumer Player/Recorders United Kingdom. transferred to computer. Some portable consumer devices that are Tel: 01483 300800 (DAT) Recorders (Sony, primarily designed for listening to music can be Tascam) Fax: 01483 689551 used to record speech. At the moment, these Edited by Nigel Gilbert devices are designed around either small hard DAT is primarily a professional recording (e-mail: [email protected]) drives (Creative Labs, Archos) or solid-state medium. It is expensive and is rapidly becoming memory storage (Pogo Products). Reliability obsolete. Autumn 2002 © University of Surrey