<<

Index

Symbols Lingo script, 46 A-law, 148, 152, 166 object, 47 compander, 232 tweening, 44 µ-law, 148, 152, 166, 435 Adobe Flash, 47 compander, 232 animation, 50 2D mesh, 375 symbol, 49 geometry coding, 376 window, 48 motion coding, 378 Adobe Flash , 619 object coding, 375 Adobe Photoshop, 41 2D object animation, 379 alpha , 41 3D model-based coding, 381 magic wand tool, 42 3D percept, 130 Adobe Premiere, 39 3D polygon mesh, 383 timeline window, 39 3D video and TV, 130 ADPCM (adaptive differential pulse code 3G, 582, 583, 589 modulation), 165, 175, 435–437 G3G (Global 3G), 583 Affine transform, 380 4G, 584, 589 Alias, 142–144 Amazon web services (AWS), 14, 649, 657, A 659 AC (Alternate Current), 236 Amazon EBS, 652 Access network, 486, 489, 494, 564 Amazon EC2, 647, 650, 660 Access point (AP), 576, 586–588, 608 Amazon machine image (AMI), 651 Access time, 545 Amazon S3, 645, 647, 649, 660 Active , 118 AWS region, 649, 651 Active video line, 118 Cloudfront, 657 AD (analog-to-digital) converter, 149, 175 AMPS (advanced mobile phone system), 577 Adaptive compression algorithms, 196 Anaglyph 3D, 132 Adaptive , 196 Analog display interface, 126 Adobe Director, 42 Analog video, 115 3D Sprite, 47 Animation, 18 animation, 44 Autodesk 3ds Max, 19 control, 44 Autodesk Maya, 19 Imaging Lingo, 43 Autodesk Softimage, 19 Lingo, 43 DirectX, 18

Z.-N. Li et al., Fundamentals of Multimedia, 715 Texts in Computer Science, DOI: 10.1007/978-3-319-05290-8, © Springer International Publishing Switzerland 2014 716 Index

Java3D, 18 Bi-level standards, also see OpenGL, 18 JBIG, 309 Anti-, 143 Bilinear interpolation, 346, 355 filter, 143 Binary tree, 190 APC (adaptive predictive coding), 176 Bitmap, 58 Application-layer multicast, 506, 553 Bitplane, 59 end-system multicast (ESM), 555 Bitrate, 151, 195, 225, 267, 270, 317, 325, 332, multi-tree overlay, 556 430, 507, 509, 551, 590 single tree overlay, 555, 556 Block codes, 594 , 205, 372 Block-based coding, 359, 363 adaptive arithmetic coding, 215, 216 Blu-ray, 14 basic algorithm, 206 Bluetooth, 589 binary arithmetic coding, 214, 413, 415 BMP (bitmap), 75 integer implementation, 214 Broadcast, 503, 550 scaling and incremental coding, 210 Buffer management, 514 tag, 208, 210 peak bitrate, 515 Aspect ratio, 118, 123, 125 prefetch buffer, 515 ATM (asynchronous transfer mode), 496, 511 ATV (advanced TV), 348 C Audio compression standard C-BIRD, 680 G.711, 437, 523 search by , 680, 682 G.721, 436 search by illumination invariance, 685 G.722, 523 search by object model, 686 G.723, 436 search by texture, 683 G.723.1, 447, 449 CABAC (context-adaptive binary arithmetic G.726, 436, 437 coding), 396, 419 G.727, 436 Cable modem, 486, 492 G.728, 436, 450 Cable TV network, 489, 492 G.729, 436, 447, 450 CAE (context-based arithmetic encoding), 370 Audio filtering, 150 Camera system, 85 CAVLC (context-adaptive variable length Autostereoscopic display device, 135 coding), 396 lenticular lens, 135 CBIR (content-based image retrieval), 675, parallax barrier, 135 680, 703 AVC (, also see 3D shapes and objects, 677 H.264/AVC), 395 early CBIR systems, 678 AVCHD (advanced video coding high histogram intersection, 678 definition), 18 human activity, 700, 703 AVI (audio video interleave), 39 key technologies, 692 quality-aware, 706 B quantifying search results, 688 BAB (binary alpha block), 370 video retrieval, 679, 697 Bag of words (BoW), 694 CCIR (consultative committee for international Band-limited signal, 143 radio), 122 Band-limiting filter, 151 CCITT (international telegraph and telephone Band-pass filter, 151, 158, 166 consultative committee), 168, 325 Bandwidth, 151, 166, 489, 512, 531, 566, 577, CDMA (code division multiple access), 578, 618, 628, 645, 652, 662, 670 609 Base station (BS), 576, 579, 585, 608 cdma2000, 583 BCH (Bose-Chaudhuri-Hocquenghem) codes, WCDMA (wideband CDMA), 582, 609 595 CELP (code excited linear prediction), 444 Index 717

adaptive codebook, 445 spectra sensitivity of the eye, 83 LSF (line spectrum frequency), 448 visible light, 82 LSP (line spectrum pair), 447 Color-matching function, 88, 89 LTP (long time prediction), 445 Commission internationale de L’eclairage stochastic codebook, 448 (CIE), 89 STP (short-time prediction), 445 Compression Checksum, 498, 500, 591 lossless, 62, 74, 186 , 122, 407, 417, 419 lossy, 62, 74, 186, 282 Chromaticity, 90 ratio, 74, 186 diagram, 89 speech, 166 , 106 Compressor function, 231 CIF (common intermediate format), 123 Cones, 83 Circuit switching, 495 Constant rate (CBR), 501, 515, 546 Client/server, 531, 532, 540, 567 Content delivery network (CDN), also see Cloud computing, 645 content distribution network (CDN), cloud gaming, 667 539 computation offloading, 661 Content distribution network (CDN), 506, 539, infrastructure as a service (IaaS), 647 541, 542, 565, 629, 657 platform as a service (PaaS), 648 Akamai, 506, 542, 660 private cloud, 647 Context modeling, 305 public cloud, 647 Continuous (CFT), 247 software as a service(SaaS), 648 Continuous (CWT), 252, Cloud gaming, 665 256 Gaikai, 15, 668 Convolutional codes, 596 Onlive, 668 Coordinated live streaming and storage sharing Clustering, 67 (COOLS), 638 CMY, 102 CPU (central processing unit), 651, 652, 661, CMYK, 103 662, 665 , 186, 477 CRC (cyclic redundancy check), 488, 592 Coder mapping, 165 CRT (cathode ray tube), 86, 116 Codeword, 186 CSS (cascading style sheets), 11 Coding efficiency, 430 Color D camera-dependent, 100 DA (digital-to-analog) converter, 149 cycling, 65 Datagram, 498 density, 682 DB (decibel), 144 histogram, 64, 680 DC (Direct Current), 236 HSV, 101 Decoder mapping, 165 layout, 682 , 116 lookup table (LUT), 67 Delaunay mesh, 377 monitor specification, 93 Dictionary-based coding, 200 multi-ink printers, 104 Difference of Gaussian (DOG), 693 multisensor cameras, 100 Differential coding, 168, 218 palette, 65 Differentiated service (DiffServ), 513 palette animation, 65 diffServ code (DS), 513, 514 picker, 65 per-hop behavior (PHB), 513, 514 primaries, 88 Digital audio, 16 sRGB, 101 Adobe Audition, 16 subcarrier, 119 coding of, 165 Color science, 81 quantization and transmission, 164 light and spectra, 81 Sound Forge, 16 718 Index

Digital display interface, 128 E Digital library, 185 EBCOT (Embedded block coding with Digital subscriber line (DSL), 491 optimized truncation), 270, 295, ADSL (asymmetrical DSL), 486, 491, 493, 298, 301, 303 564 EDTV (enhanced definition TV), 126 Digitization of sound, 139 End-to-end argument, 503 Discrete Cosine Transform (DCT), 234, 236, Entropy, 186, 187 241 coding, 187, 189 1D, 235, 236 Epidemic model, 634 2D, 234, 235, 281 Error concealment, 500, 603 2D basis function, 244 Error detection, 590 2D matrix implementation, 245 Error-resilient coding, 597 2D separable basis, 244 Error-resilient entropy coding (EREC), 600 basis function, 236, 242 Ethernet, 488 comparison to DFT, 247 CSMA/CD (carrier sense multiple access DCT-matrix, 246, 400, 424 with collision detection), 489 Discrete Fourier Transform (DFT), 247 Euler’s formula, 247 (DST), 419, 425 Excited purity, 93 Discrete Wavelet Transform (DWT), 252, 259 EXIF (exchangeable image file), 76 Disparity, 131 Exp-Golomb code, 215, 409, 410, 413 gradient, 136 Expander function, 232 manipulation, 136 mapping, 136 Extended Huffman coding, 194 range, 136 EZW (Embedded Zerotree Wavelet), 270–273 sensitivity, 136 velocity, 136 F Dispersion, 82 F-score, 711 Distortion measure, 225 Facebook, 14, 617, 619 MSE (Mean Square Error), 226 Fax standards PSNR (Peak Signal-to-Noise Ratio), 226 G3, 309 SNR (Signal-to-Noise Ratio), 226 G4, 309 Dithering, 59, 66 FDMA (frequency division multiple access), dither matrix, 60 577, 578, 580 ordered dither, 61 Firewall, 486, 502, 503 DM (), 174 First person shooter (FPS) game, 666 adaptive, 175 FM (frequency modulation), 152, 154 uniform, 175 Forward error correction (FEC), 593 DMT (discrete multi-tone), 491 Fourier transform, 251 Domain name system (DNS), 496 Frame buffer, 59, 100 DPCM (differential pulse code modulation), 165, 168, 171, 289 Frame-based coding, 359, 363 Dropbox, 645 Free rider, 564 DV video (), 122 Frequency, 236 DVB (digital video broadcasting), 544 frequency response, 236, 244 DVB-MHP (multimedia home platform), frequency spectrum, 236 544 spatial frequency, 236, 244 DVD, 9, 348, 349, 475, 479, 658 Frequency hopping (FH), 589 Dynamic adaptive streaming over HTTP FTTH (fiber-to-the-home), 493 (DASH), 480, 565, 660 FTTN (fiber-to-the-neighborhood), 493 media presentation description (MPD), 565 FTTN (fiber-to-the-node), 493 , 168 Fundamental frequency, 140 Index 719

G H.263+, 336 , 86, 96, 100 H.263++, 336 , 94 H.264, 395, 620, 668 printer, 103 CABAC, 409, 413 Gaussian distribution, 277 CAVLC, 409, 411 Generalized Markup language (GML), 10 entropy coding, 409 GIF (graphics interchange format), 69 (GOP), 399 color map, 71 hierarchical prediction structure, 399 GIF87, 72 in-loop deblocking filtering, 407, 408 GIF89, 72 integer transform, 401 interlacing, 72 intra coding, 404 screen descriptor, 69 intra spatial prediction, 404 GIF(graphics interchange format) , 396 animation, 19 multiple reference frame, 399 Global motion compensation, 374 MVC (multiview video coding), 417 Gossip algorithm, 558, 560 profiles, 415 GPRS (general packet radio service), 580 baseline profile, 415 GPS (global positioning system), 589 extended profile, 416 GPU (), 652, 661, 666 high 10 profile, 417 Granular distortion, 229 high 4:2:2 profile, 417 Graphics, 17 high 4:4:4 predictive profile, 417 fireworks editing, 17 high profile, 417 freehand editing, 17 main profile, 416 Gray-level quantization, 402 image, 219 quarter-pixel precision, 397 intensity, 188 scaling, 403 Grayscale, 73 SVC (scalable video coding), 417 Group of pictures (GOP), 399 , 396, 399 GSM (global system for mobile communica- variable block-size motion compensation, tions), 578, 579 396 GSTN (general switched telephone network), H.264/AVC (or H.264/MPEG-4 AVC), 395 523 H.265, 418, 419, 567 CABAC, 428 H discrete sine transform (DST), 425 H.261, 317 entropy coding, 428 bitstream, 330 in-loop deblocking filtering, 419, 427 block layer, 332 integer transform, 424 encoder and decoder, 328 integer transform matrix, 424 formats supported, 326 intra coding, 425 GOB (group of blocks) layer, 331 motion compensation, 419 inter-frame (P-frame) coding, 327 profiles, 429 intra-frame (I-frame) coding, 326 main 10 profile, 429 layer, 332 main profile, 429 picture layer, 331 main still picture profile, 429 quantization, 328 quadtree prediction structure, 419 step size, 328 quantization, 425 H.262, 348 quarter-pixel precision, 421 H.263, 317, 332, 619 SAO (sample adaptive offset), 419, 427 motion compensation, 333 slice, 420, 598 optional coding mode, 334 special coding modes, 429 PB-frame, 335 I_PCM, 429 720 Index

lossless, 429 IETF (Internet Engineering Task Force), 494, transform skipping, 429 501, 504 tile, 420 IGMP (Internet Group Management Protocol), transform coding, 419, 420, 424 504 variable block-size motion compensation, Image 419 24-bit, 62 wavefront parallel processing (WPP), 420 8-bit, 63 H.26L, 395 data type, 62 Half-pixel precision, 334 descriptor, 680 Halftone printing, 60 fireworks editing, 17 Hamming codes, 594 formation, 84 Hamming distance, 594 high bit-depth, 62 Handoff (Handover), 577, 606, 608 histogram, 188 soft handoff, 609 hyperspectral, 63 vertical handoff, 610 monochrome, 57 Harmonic, 140 multispectral, 63 Hartley’s law, 186 Photoshop editing, 17 HDTV (high definition TV), 124, 126, 417, quality, 703 430 resolution, 58 HEVC (high efficiency video coding), 418 retrieval, 675 Hidden terminal problem, 586 Image processing, 64, 236 Hierarchical JPEG, 291, 513 theory, 186 Integer transform, 399, 401, 424 Homogeneous coordinate system, 380 Integral imaging, 135 Horizontal parallax, 131 Intellectual property management and Horizontal retrace, 116 protection (IPMP), 390 HPLabs media server, 621, 628 Interaction delay, 666, 670 HTML (HyperText Markup Language), 10, 52, Interactive TV (ITV), 544 69 Interlaced scanning, 115 HTTP (HyperText Transfer Protocol), 486, Internet, 13, 485, 494, 516 502, 516, 564, 649 Internet protocol TV (IPTV), 37 HTTP-based streaming, 563 Internet telephony, 522 Huffman coding, 192, 599 IP (Internet Protocol), 496, 516 optimality, 193 IP address, 496–498, 501, 503, 520, 521, prefix property, 192 540, 559, 606, 647, 668 procedure, 196 IPv4 (IP version 4), 496, 497, 503 tree, 198 IPv6 (IP version 6), 501, 503 Human vision, 83 packet fragmentation, 496 Hybrid coding, 318, 342, 395, 396, 419 IP multicast, 503, 551, 553 Hybrid excitation vocoder, 450 MBone, 505 MBE (multiband excitation), 450 ISDN (integrated services digital network), 579 MELP (multiband excitation linear ISP (Internet service provider), 486, 490, 542 predictive), 447, 452 ITU (international telecommunication union), Hypermedia, 9 325, 436, 437, 440, 523, 582, 583, 585, 598 I ITU-R (radiocommunication sector), 122 IDCT (Inverse Discrete Cosine Transform), ITU-T (telecommunication standardization 235 sector), 122, 325, 395, 418, 490 1D, 235 2D, 235 J 2D matrix implementation, 246 Jamming, 580 Index 721

JBIG (joint bi-level image experts group), 310 Median-cut algorithm, 67 JBIG2, 310 Medium access control (MAC), 488 Jitter, 508, 538, 652 Memory, 589, 594, 596, 651, 652, 661, 666 JPEG (joint photographic experts group), 9, Microcell, 577 73, 281, 283, 513 MIDI (musical instrument digital interface), baseline, 289 154 DCT, 283 banks, 157 entropy coding, 289 channel, 155 main steps, 281 channel messages, 160 mode, 290 channel mode, 162 zigzag scan, 288 channel pressure, 161 JPEG-LS, 305 conversion to WAV, 164 JPEG2000, 293 key pressure, 161 keyboard, 156 K MIDI machine control (MMC), 158 KLT (Karhunen-Loève transform), 234 patch, 157 sequencer, 156 L system messages, 160, 163 LAB color model, 97 tone modules, 156 LAN (local area network), 486, 489, 495 velocity, 157, 161 Latency, 508, 585 voice messages, 160 Layered multicast, 551 MiniDV, 119 multicast enhanced loss-delay based MMR (modified modified read) algorithm, 370 adaptation (MLDA), 553 Mobile IP, 606 receiver-driven layered congestion control access router (AR), 606 (RLC), 553 correspondent node (CN), 607 receiver-driven layered multicast (RLM), foreign agent (FA), 606 551 hierarchical mobile IP (HMIP), 608 Line-of-sight (LOS) communication, 573 home address (HoA), 606 LOCO-I (low complexity home agent (HA), 606 for images), 305 mobile node (MN), 606 Lookup table (LUT), 63, 65, 67 Mobile visual search, 703, 705 Lossless image compression, 218 Mobility management, 605 Lossless JPEG, 219, 293 global mobility, 606 encoder, 220 interdomain mobility, 606 predictor, 220 intradomain mobility, 606 Lossy image compression, 225 macromobility, 606 Low-pass filter, 151 micromobility, 606 LPC (), 442, 449 Model-based coding, 311 LPC-10, 442 MOS (mean opinion score), 431 LTE (long term evolution), 15, 585 Motion compensation (MC), 318, 363 Luminance, 106 backward prediction, 319 LZW (Lempel-Ziv-Welch), 69, 72, 76, 200 forward prediction, 319 , 318, 662 M Motion JPEG, 291 Macroblock, 318, 395, 396, 419, 597–600, Motion vector, 319, 604 602, 603 2D logarithmic search, 321 Macrocell, 577 hierarchical search, 322 Mean absolute difference, 320 sequential search, 320 Mean average precision (MAP), 690, 697 MPEG (moving picture experts group), 341, Media-on-demand (MoD), 543, 544 435 722 Index

MPEG audio compression, 457 level, 349 bark, 463 modes of prediction, 350 bit allocation, 471 profile, 348, 349 bit reservoir, 474 program stream, 358 critical band, 462 spatial scalability, 355 equal-loudness curves, 458 temporal scalability, 355 frequency masking, 458, 460 transport stream, 358 MDCT (modified discrete cosine transform), MPEG-2 AAC (advanced audio coding), 475 473 low complexity profile, 475 MNR (mask-to-noise ratio), 471 main profile, 475 MP3, 9, 466, 473, 474 PQF (polyphase quadrature filter) bank, 476 MPEG layers, 466 scalable sampling rate profile, 475 Layer 1, 466 TNS (temporal noise shaping) tool, 475 Layer 2, 466, 473 MPEG-4, 359, 362, 532, 567, 599, 668 Layer 3, 466, 473 audio and visual objects, 360 , 458 BAPs (body animation parameters), 383 scale factor band, 474 BDPs (body definition parameters), 383 SMR (signal-to-mask ratio), 471 BIFS (binary format for scenes), 361 SNR (signal-to-noise ratio), 471 binary shape, 370 temporal masking, 464 body object, 381, 382 threshold of hearing, 460 face object, 381 MPEG working model FAP (face animation parameter), 382 SM (simulation model), 385 FDP (face definition parameter), 382 TM (test model), 385 grayscale shape, 370 VM (verification model), 385 group of video object plane (GOV), 362 XM (experimentation Model), 385 level, 383 MPEG-1, 8, 341 media objects, 360 B-frame, 342 part 10, also see H.264, 360, 383, 395 bitstream, 346 part 2, 360, 383 block layer, 348 profile, 383 D-frame, 347 static texture coding, 373 differences from H.261, 344 synthetic object coding, 375 Group of pictures (GOPs) layer, 347 texture coding, 368 I-frame, 342 uniform mesh, 376 macroblock layer, 348 video object (VO), 362 motion compensation, 342 video object layer (VOL), 362 P-frame, 342 video object plane (VOP), 362 performance of, 346 video object-based coding, 362 picture layer, 347 video-object sequence (VS), 362 prediction, 342 video-object-based coding, 360 quantization, 344 wavelet coding, 373 sequence layer, 346 MPEG-4 AAC (advanced audio coding), 476 slice, 344 BSAC (bit-sliced arithmetic coding), 476 slice layer, 347 DAB+ digital radio, 476 transform coding, 342 digital multimedia broadcasting, 476 MPEG-2, 348 digital radio Mondiale, 476 alternate scan, 352 high definition advanced audio coding data partitioning, 358 (HD-AAC), 476 differences from MPEG-1, 358 high efficiency advanced audio coding hybrid scalability, 357 (HE-AAC), 476 , 349 perceptual coders, 476 Index 723

perceptual noise substitution, 476 N SAOL (structured audio orchestra language), Netflix, 15, 645, 658 477 Netpbm format, 76 SNHC (synthetic/natural hybrid coding), 477 NetTube, 631 structured coder, 477 Network TTS (text-to-speech), 477 application layer, 486 MPEG-7, 384 connection-oriented, 497 description definition language (DDL), 390 connectionless, 499 description scheme (DS), 387 data link layer, 486 Descriptor (D), 385 network layer, 486 MPEG-7 audio, 479 physical layer, 485 MPEG-J, 361 presentation layer, 486 MPEGlet, 361 session layer, 486 Multi-rate multicast, 550 transport layer, 486 Multicast, 503, 550 Network address translation (NAT), 501, 564 Multimedia, 3 NMT (nordic mobile telephony), 577 Non line-of-sight (LOS) communication, 575 history of, 5 NTSC (National System subjects, 20 Committee), 87, 93, 342 tools, 15 Nyquist Multimedia authoring, 19 frequency, 143 Director, 19 rate, 143, 166, 175 Dreamweaver, 20 theorem, 142 Flash, 19 tools, 39 O Multimedia cloud computing, 652 Octave, 140 Multimedia messaging service (MMS), 580 OFDM (orthogonal frequency division Multimedia presentation, 26 multiplexing), 584, 587 graphics style, 26 Online social network, 540, 618, 632 sprite animation, 29 free-rider, 634 video transition, 29 ordinary user, 634 Multimedia production, 35 spreader, 634 flowchart phase, 36 ONU (optical networkunit), 492 prototyping and testing, 36 Orthogonal, 242 storyboard, 35 Orthonormal, 243 Multipath fading, 574 basis, 259 Gilbert-Elliott model, 575 OSI (open systems interconnection), 485, 486, rake receiver, 576 494 Rayleigh fading model, 575 Out-of-gamut color, 94 Rician fading model, 575 Overlay network, 506, 554 Multiple input/multiple output (MIMO), 584, 585, 588 P Multiresolution analysis, 252, 253 Packet interleaving, 597 Multisampling, 158 Packet switching, 495 Munsell color naming system, 99 datagram, 495 MUSE (multiple sub-nyquist sampling virtual circuit, 495 encoding), 124 Padding, 365 Music sequencing, 16 extended padding, 367 Cakewalk Pro Audio, 16 horizontal repetitive padding, 366 Finale, 16 vertical repetitive padding, 366 Sibelius, 16 PAL (phase alternating line), 93, 121, 342 724 Index

Parity checking, 590, 593 PSNR (Peak Signal-to-Noise Ratio), 226, 305, Partial, 140 430 fundamental frequency, 140 PSTN (Public Switched Telephone Network), harmonic partial, 140 495, 523 overtone, 140 PTM (polynomial texture mapping), 77 Path loss, 573 PBM (portable bitmap), 76 Q PCM (pulse code modulation), 130, 165–167, QA paradigm, 696 435, 436, 513 QAM (quadrature amplitude modulation), 491, PDF (portable document format), 76 492 Peer-to-peer, 14, 506, 557, 631, 656, 657 QCIF (Quarter-CIF), 123 BitTorrent, 14, 557, 649 QP (quantization parameter), 402, 404, 408, buffer map, 560 425 chunkyspread, 562 QPSK (quadrature phase-shift keying), 492 CoolStreaming, 14, 558 Quadrature modulation, 119 data-driven overlay, 557 Quality hybrid tree and mesh overlay, 562 objective assessment, 430 mesh overlay, 557, 631 quality factor, 284 PPLive, 558 subjective assessment, 431 PPStream, 558 assessment, 430 PPTV, 657 Quality of service (QoS), 500, 506, 507, 509, tree overlay, 638 510, 513, 519, 531, 566, 662 Quality-of-experience (QoE), 666–668 UUSee, 558 Quantization, 142, 283 Perceptual nonuniformity, 149, 178, 462, 509 dead-zone, 228, 296, 404 Pervasive computing, 586 decision boundary, 165 PGM (portable graymap), 76 distortion, 172 Pitch, 140, 156 error, 145, 172 Pixel, 57 linear format, 147 Pixel clock, 119 noise, 145 PNG (portable network graphics), 74 nonuniform, 147 alpha-channel, 75 nonuniform quantizer, 231 interlacing, 75 companded quantizer, 231 Polyphony, 157 Lloyd–Max quantizer, 231 Post compression rate-distortion (PCRD), 300 reconstruction level, 165 Postscript, 76 uniform, 147 POTS (plain old telephone service), 522, 579 uniform scalar quantizer, 228 PPM (portable pixmap), 76 midrise, 228 Precision, 688 midtread, 228, 404 Predictive coding , 232, 694 lossless, 168 codebook, 232 lossy, 168 Quantizer Profile, 383 backward adaptive, 437 Progressive JPEG, 291, 513 Jayant, 438 Progressive scanning, 115 Lloyd-Max, 171 Proxy caching, 532, 629 Query expansion, 695 hit-ratio, 629 prefix caching, 535 R rate-split caching, 536 Random graph, 623 segment caching, 535 Rate-distortion, 226, 404 sliding-interval caching, 533 rate-distortion function, 226 Index 725

RCPC (rate compatible punctured convolu- binary shape coding, 370 tional) codes, 596 grayscale shape coding, 372 Real time strategy (RTS) game, 667 SIF (source input format), 342 Recall, 688 SIFT (scale invariant feature transform), 692 Receiver operating characteristic (ROC), 691, canonical orientation, 693 692 descriptor, 693 Reed–Solomon (RS) codes, 595 key point, 693 Reference frame, 319, 399, 599, 600, 603, 605, Signal processing, 151, 236 662 SIP (Session Initiation Protocol), 523 Region of interest (ROI), 294, 303 Skype, 14 Regular graph, 623 Small world, 623 Relevance feedback, 694 Smart TV, 544, 545 Retina, 83 SMPTE (society of motion picture and RFC (request for comments), 494 television engineers), 87, 93 RGB, 31, 100, 386 SMPTE-170M, 87 RGB to CMY, 102 SMS (short message service), 579 RLC (run-length coding), 189 SNR (signal-to-noise ratio), 144, 491 Rods, 83 Social media, 617, 619, 623, 628, 629 Role playing game (RPG), 666 Social network, 617, 618 Round trip time (RTT), 501, 553, 556 Sound RSVP (Resource ReSerVation Protocol), 511 card, 154 RTCP (RTP Control Protocol), 519, 521, 522, digitization, 140 531, 533, 564 wave, 139 RTP (Real-time Transport Protocol), 518, 522, Spatial 531, 533, 564 domain, 218 RTSP (Real-Time Streaming Protocol), 520, frequency, 73, 107 522, 531, 533, 564 redundancy, 282, 318 Run-length encoding (RLE), 75, 288 Spectral power distribution (SPD), also see RVLC (reversible variable-length code), 599 spectrum, 82 Spectrophotometer, 81 S Spectrum, 82 SA-DCT (shape-adaptive DCT), 368 locus, 91 Sampling, 140 SPIHT (Set Partitioning in Hierarchical Trees), alias frequency, 144 270, 277 fixed rate, 143 Spread spectrum, 580 folding frequency, 144 Sprite, 373 frequency, 141 coding, 373 nonuniform, 142 SQNR (signal to quantization noise ratio), 145 rate, 142 SRGB (standard RGB), 88 true frequency, 144 Standard Generalized Markup Language uniform, 142 (SGML), 10 SAP (Session Announcement Protocol), 524 Stereopsis, 131 SDP (Session Description Protocol), 524 Stream replication, 550 SDTV (standard definition TV), 125 Streaming media, 507, 531, 532, 540, 542 SECAM (systeme electronique couleur avec Structural similarity (SSIM), 431, 669, 707 memoire), 93, 121 Subband, 262 Sequencer, 159 Support Vector Machine (SVM), 696, 705, Sequential JPEG, 291 706, 708 Set-top box (STB), 126, 544 Surface spectral reflectance, 84 Shannon–Fano algorithm, 189 Susceptible-infectious-recovered (SIR) model, Shape coding, 370 634 726 Index

Sync skew, 508 V Synthesizer, 156, 159 Variable (VBR), 507, 515, 522, 536, Synthetic sound, 152 539 Variable-length coding (VLC), 185, 189, 335 Vergence-accommodation conflict, 133, 134 T Vertical retrace, 116 TACS (total access communication system), VESA (video electronics standards associa- 577 tion), 128–130 Target frame, 319 Video broadcasting, 545 TCP (Transmission control protocol), 497 batching, 545 acknowledgement (ACK), 498 harmonic broadcasting, 547 additive increase and multiplicative decrease pyramid broadcasting, 546 (AIMD), 499, 553 staggered broadcasting, 546 port number, 498, 501, 525, 564 stream merging, 549 retransmission timeout, 499 Video card, 59 window, 497 Video conferencing, 325, 332, 415, 504 TDMA (time division multiple access), 578 Video display interfaces, 126 Television systems Video editing, 17 NTSC (National Television System after effects, 18 Committee), 118, 127 final cut pro, 18 PAL (phase alternating line), 121 Premiere, 17 SECAM (systeme electronique couleur avec Video quality assessment (VQA), 430, 431 memoire), 121 Video signals, 126 Temporal redundancy, 318 Video transitions, 40 Texture Video-on-demand (VoD), 507, 543–545 analysis, 684 Vocoder, 439 layout, 683 CELP (code excited linear prediction), 444 TIFF (tagged image file format), 75 channel vocoder, 439 Timbre, 156 formant vocoder, 441 Transducer, 140 LPC (linear predictive coding), 442 Transform coding, 233, 399, 424 phase insensitivity, 439 Tristimulus values, 90 Voice-over-IP (VoIP), 493, 507, 522 Twitter, 15, 619 tweets, 619 W vine, 619 W3C (World Wide Web Consortium), 13, 390 Two-sided geometric distribution (TSGD), 309 WAN (wide area network), 495 Wave table, 152 data, 158 U file, 154 µ U-law (also see -law), 148, 149, 152 synthesis, 153 Ubiquitous computing, 573, 586 Wavelength dominant, 93 UDP (User Datagram Protocol), 499, 501, 502, Wavelet, 251 520, 525, 526, 531, 564, 668 admissibility condition, 257 port number, 500 analysis filter, 261 UHDTV (ultra high definition TV), 126, 419, basis, 259 429 biorthogonal, 261 Unicast, 503 compact support, 261 URI (Uniform Resource Identifier), 517 mother wavelet, 258 URL (Uniform Resource Locator), 496, 517, synthesis filter, 261 523, 540, 566, 580 Weber’s law, 147 User-generated content (UGC), 618 White point correction, 95 Index 727

WiMAX, 585 X Wireless LAN (WLAN), 488, 577, 586 XML (Extensible Markup Language), 12, 384, carrier sense multiple access with collision 390 avoidance (CSMA/CA), 586 XYZ to RGB, 96 IEEE 802.11, 586 IEEE 802.11a, 587 IEEE 802.11ac, 588 Y IEEE 802.11b, 587 YCbCr, 109, 122, 282, 386, 419 IEEE 802.11g, 587 YIQ, 105, 107 IEEE 802.11n, 588 YouTube, 14, 36, 551, 617, 618, 655 Wi-Fi, 488, 586, 610 insight analytics, 626 WMF (windows meta file), 75 partner program, 625 Words, 694 YUV, 105 visual words, 694 Work-ahead smoothing, 536 WWW (World Wide Web), 69, 359 Z Web 2.0, 14, 617 Zipf’s law, 633