Production of Digital Internet Video Material for Streaming Applications

Session 3220 Production of Digital Internet Video Material for Streaming Applications Z. Chambers, M. B. Taylor, J. Iannelli and A. J. Baker University of Tennessee Knoxville, TN 37996-2030 Abstract The rapid growth of Internet-based teaching curricula has prompted a new direction for distance education - the streaming of live video lectures to remote student sites for on-demand education. This live material is exceptional while the post-processed static files are better than nearly all currently produced streaming video formats. The necessary compression software and computer hardware is readily available and surprisingly inexpensive. The development of quality video material, however, is a time-consuming process which requires both technical savvy and an artistic touch. This paper therefore provides a detailed recipe for creating digital video material for streaming applications. Introduction The emergence of the Internet, and in particular high performance communications, has allowed the traditional classroom-based educational environment to transcend to a multimedia, computer- driven venue, admitting global students whose only participation requirement is a modern computer with an Internet connection. The deployment of synchronized audio and video to support a developed static curriculum, i.e., HTML and PDF documents, fosters a sense of “human-ness” to the remote user while allowing an entire curriculum to be “taught” at any time. The advent of streaming compression technology has removed the restrictive file-size limitations of previous video compression-decompression algorithms (codecs). A one hour video lecture which is nearly 10 Gigabytes in uncompressed digital video form can be compressed to under 6 Megabytes and streamed to a remote user for real-time reception through a 28.8Kbps modem. Basic recipe directions are as follows. The video is first recorded with a Canon Optura digital camcorder. Next, it is captured via an Osprey 100 video capture card to an 18 GB Quantum hard drive, connected to a Pentium II 333 PC, and hand-edited in Adobe Premiere. The audio track is exported, noise-reduced, and amplified for optimum clarity with CoolEdit. The audio track is then recombined with the video and compressed using the Indeo 5.06 codec. Having completed the first round of compression, the file is then archived onto a compact disc for future use. The file is then further compressed, using RealMedia’s propriety compression technology, for uploading to a Linux powered video server. Students are able to access the videos using nothing more than a standard web browser and the free RealPlayer plugin. Page 4.427.1 Page Discussion and Results Step 1: Recording the Video Lecture To record the video lecture, three key elements are required: a high quality camcorder, a lavaliere microphone, and a well-lit whiteboard. The choice of camcorder is primarily dictated by budget - a $500 8mm camcorder will generate acceptable entry-level video while a $1500 digital camcorder will provide exceptional video quality. An important theorem is that final video quality is directly proportional to initial video quality, hence obtaining the highest quality camcorder is of utmost importance. The Canon Optura digital camcorder was selected based on its excellent features, competitive price (WB Hunt, $1500), and pure digital video output. After experimenting with the built-in wide-area microphone, a lavaliere microphone (Radio Shack, $39.99) was incorporated to better focus the audio on the instructor and to minimize room noise. The final element, a well-lit whiteboard, required the construction of an incandescent light rack (Home Depot, $100) over the whiteboard to eliminate shadows caused by the regular fluorescent room lighting. The actual recording of the lecture should take full advantage of the optical zoom features of the camera. The digitally enhanced zoom feature should not be used as it introduces video “noise” which is unacceptable for clean compression. The purpose of the camera is to relay the classroom experience to the remote user, not to focus on the instructor’s head. Equations and key notes on the board take precedence while the instructor should be in focus only for monologues greater than a minute. Zooming must be slow and fluid so as not to disorient the remote participant. To insure that written text is fully legible, the character height must be approximately 1/4 the height of the viewfinder. Finally, 15 seconds of silence should be recorded at the beginning of each lecture. This allows the room noise to be later filtered out, allows the instructor to prepare for the “take,” and permits the camera to get up to full speed internally. Step 2: Capturing the Video Lecture to the Hard Drive Capturing the video to the hard drive has four requirements: a video capture card, a sound card, an audio/visual (AV) rated SCSI hard drive, and a CPU powerful enough to convert the torrent of digital video data into a Windows AVI format and write it to disk. We first experimented with a Radius MotoDV capture card (Radius, $420) because it allowed for the transfer of pristine digital video data via its IEEE 1394 “Firewire” interface. Unfortunately, the 740x480 pixel window size of the captured video, while phenomenally clear, required nearly an hour per minute of video for the first round of compression. The Osprey 100 capture card (RealNetworks, $200) was tested next. Although the digital-to-analog-to-digital conversion process introduced noise into the video signal, the 160x120 pixel window size required only three minutes per minute of video for the initial compression. The Osprey card was therefore selected for this project. The sound card is responsible for the capture of the audio component of the video track. The SoundBlaster AWE64 ProGold card ($149) has performed excellently - do not settle for anything that can not handle at least 16 bit sound at 44KHz. Page 4.427.2 Page An AV rated hard drive is one which is guaranteed to handle the rigors of Audio-Visual recording. Specifically, these hard drives to do not pause to perform thermal recalibrations. Two Atlas Quantum III 18 GB hard drives (NetExpress, $1200 each) were selected to handle the projected 10 GB per hour lectures. Over the last three months, these drives have been reliably filled and erased on a near daily basis. Because digital video (DV) is not amenable to direct editing, it must be converted to either a Windows AVI or a Macintosh MOV file as it is captured. A Pentium 333 PC with 384 MB RAM (Gateway, $3500) easily handled this chore. A critical obstacle in the acquisition of the video data was the 2 GB filesize limit inherent to the AVI filetype. Thus, only about 13 minutes of video could be recorded before reaching the filesize limit. This in turn required careful hand editing in Adobe Premiere 5.1 to splice the files together. A 15-second overlay between segments insures a good splice point. Step 3: Editing and Archiving the Video Data As stated previously, Adobe Premiere 5.1 (UT Computer Store, $383) was used to splice the video segments together1. Having reassembled the video, the audio track was exported as an AVI file for post-processing with CoolEdit 96 (CoolEdit, $50). The 15-second room noise sample was used in an FFT algorithm to eliminate those offending frequencies from the lecture. In the author’s opinion, this is the most valuable of all post-processing steps. The cleaned audio was then visually inspected for aberrations - noise spikes due to dropping a marker, clearing one’s throat, closing a door, etc. These aberrations were isolated and their volume reduced to match the average lecture volume. The entire track was then amplified to 95% of the soundcard’s internal cutoff. The post-processed audio track was then recombined with the video track in Premiere and the entire lecture was compressed using the Indeo 5.06 codec. Multiple layers of compression are traditionally bad practice. The losses incurred by compressing a video once are frequently revealed when it is compressed again. However, the first round of compression was necessary due to the 2 GB filesize limit. The Indeo 5.06 codec with 100% quality compressed the 10 GB file to approximately 500 MB with minimal incurred losses. The advantage was twofold - the source file was small enough to handle with Windows and it could be burnt onto a compact disc for archival purposes. Our CD burner, a Yamaha 4260 CDRW (NetExpress, $420), has proven quite reliable for this procedure. Step 4: Final Compression The final step is to compress the 500 MB file using RealMedia’s propriety compression technology2. Here another theorem is appropriate: increasing the quality of the video increases the bandwidth requirement, i.e. the type of Internet connection, which decreases the potential viewing audience size. Thus, encoding a video for reception through a 112K dual ISDN connection may look great but will be unreceivable by all remote viewers with a 56K dial-up connection. It was therefore advantageous to encode for several bandwidths - 28.8K modem, 56K modem, and 112K dual ISDN - and allow the remote viewer to choose their desired connection*. We have found that setting the optimized frame rate to maximum sharpness and the audio to mono yields superb results. Page 4.427.3 Page *Note that the recently released Real G2 server automatically adjusts the signal bandwidth to accommodate the user’s Internet connection. Step 5: Upload to Server It is suggested that a separate machine be acquired to serve the RealMedia files. If live broadcast is desired, a dedicated server is required. We used a 333Mhz PII computer with a 9 GB SCSI hard drive, 128MB of memory, and a 10 MB/sec Ethernet connection running Debian Linux for this purpose.

Load more