International Organisation for Standardisation s5
Total Page:16
File Type:pdf, Size:1020Kb

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 29/WG 11 CODING OF MOVING PICTURES AND AUDIO
ISO/IEC JTC 1/SC 29/WG 11 N9558 Antalya, TR – January 2008
Source: Leonardo Chiariglione Title: Report of 83rd meeting Status
Report of 83rd meeting
1 Opening The 83rd MPEG meeting was held on 14 – 18 January 2008 in Antalya, Turkey.
2 Roll call of participants Annex 1 provides the attendance list
3 Approval of agenda Annex 2 provides the adopted agenda
4 Allocation of contributions Annex 3 provides the list of input contributions.
5 Communications from Convenor The Convenor announced that Jörn Ostermann was appointed as Chairman of the Requirements group.
6 Report of previous meeting This was approved
7 Processing of NB Position Papers NB Position Papers were presented and discussed. Where relevant a response was provided.
1 8 Work plan management
8.1 Media coding
8.1.1 MPEG-2 Main Profile Level for 1080@50/60p The following documents were approved
9563 Request for 13818-2:2000/Amd.3 9564 Text of ISO/IEC 13818-2:2000/PDAM 3 Level for 1080@50/60p
8.1.2 MPEG-4 Visual Simple Studio Profile Levels 5 and 6 The following document was approved
9565 Study Text of ISO/IEC 14496-2:2004/PDAM5 Simple Studio Profile Levels 5 and 6
8.1.3 AAC-ELD The following document was approved
9619 Workplan for AAC-ELD Verification Test
8.1.4 New Profiles for Professional Applications
8.1.5 Scalable Video Coding The following document was approved
9577 Report on SVC Verification Tests
8.1.6 Multiview Video Coding The following documents were approved
9575 Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1 9576 Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video Coding 9578 Joint Multiview Video Model (JMVM) 7 9579 JMVM 7 Software 9580 Overview of Multiview Video Coding (MVC)
8.1.7 AFX The following document was approved
9649 WD2.0 of AFX 3rd Edition
8.1.8 Frame-based Animated Mesh Compression The following documents were approved
2 9647 DoC on ISO/IEC 14496-16:2006/PDAM2 (Frame-based Animated Mesh Compression) 9648 Text of ISO/IEC 14496-16:2006/FPDAM2 (Frame-based Animated Mesh Compression)
8.1.9 Low-complexity 3D mesh compression The following documents were approved
9650 Requirements for low-complexity 3D mesh compression 9651 CfP for low-complexity 3D mesh compression
8.1.10 Open Font Format The following documents were approved
9683 Request for 14496-22 2nd Edition 9684 Text of ISO/IEC CD 14496-22 2nd Edition
8.1.11 Codec Configuration Representation The following documents were approved
9584 Study Text of ISO/IEC CD 23001-4 Codec Configuration Representation 9585 Reconfigurable Video Coding Requirements V 4.0 9586 Overview of Reconfigurable Video Coding (RVC) 9589 Description of Core Experiments in RVC 9590 RVC Simulation Model (RSM) V7.0 9591 RVC Work Plan and FU Development Status
8.1.12 Video Tool Library The following documents were approved
9587 Study Text of CD ISO/IEC 23002-4 Video Tool Library 9588 Extensions of Video Tool Library under consideration 9593 Description of Exploration Experiments in RVC 9594 Methodologies for Video Toolbox Extension V2.0
8.1.13 Spatial Audio Object Coding The following documents were approved
9636 Status and Workplan on SAOC Core Experiments 9637 WD on SAOC Text and Reference Software
8.1.14 Post Production Deliverable Formats The following documents were approved
9710 Requirements for MPEG Post Production Deliverable Formats 9711 Gap Analysis between Post Production Deliverable Requirements and Proposed Working
3 Draft 9712 Text of WD1.0 MPEG Post Production Deliverable Formats
8.1.15 Free Viewpoint TV coding The following documents were approved
9595 Call for Contributions on 3D Video Test Material (Update) 9596 Description of Exploration Experiments in 3D Video
8.1.16 Unified speech and audio coding The following documents were approved
9638 Evaluation Guidelines for Unified Speech and Audio Proposals 9639 Workplan on Speech and Audio Material Selection 9640 Draft Workplan on Subjective Testing of Unified Speech and Audio Coding Proposals
8.1.17 Media Value Chain Ontology The following document was approved
9658 Requirements for a Media Value Chain Ontology
8.1.18 Representation of Sensory Experience The following document was approved
9659 Requirements on RoSE Framework
8.2 Composition coding
8.2.1 Scene representation The following documents were approved
9675 WD1.0 of Use of LASeR jointly with BIFS in MPEG-4 Systems Architecture 9676 Request for Amendment of ISO/IEC 14496-11 9677 ISO/IEC 14496-11 PDAM6 Scene Partitioning
8.2.2 Presentation of Structured Information The following documents were approved
9715 Requirements for Presentation of Structured Information 9716 Preliminary WD of Presentation of Structured Information
4 8.3 Description coding
8.3.1 Visual Descriptions Extensions The following document was approved
9582 Description of Core Experiments for MPEG-7 New Visual Extensions
8.3.2 Visual Signature Tools The following document was approved
9581 Text of ISO/IEC 15938-3:2001/PDAM 3 Image Signature Tools
8.4 IPMP The following documents were approved
9686 DoC on ISO/IEC 21000-5/FPDAM3 Open Access Content Profile 9687 Text of ISO/IEC 21000-5/FDAM3 Open Access Content Profile 9688 MPEG-21 REL Profiles Software Implementation Plan v.9
8.5 Transport and File formats
8.5.1 Carriage of SVC in MPEG-2 Systems The following documents were approved
9669 Text ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2 Systems 9670 Text of ISO/IEC 13818-1:2007/Cor.2 WD2.0 related to the carriage of AVC
8.5.2 ISO Base Media File Format The following documents were approved
9678 Text of ISO/IEC 14496-12 3rd Edition 9680 Updated Technology under Consideration for Part 12
8.5.3 AVC File Format extensions for SVC The following documents were approved
9681 DoC on ISO/IEC 14496-15/FPDAM2 SVC File Format Extension 9682 Text of ISO/IEC 14496-15/FDAM2 SVC File Format Extension
8.6 Multimedia architecture
8.6.1 3D Graphics Compression Models The following document was approved
9652 Study of CD of ISO/IEC 14496-25
5 8.6.2 WIM TV The following document was approved
9717 Requirements on WIM TV
8.6.3 MPEG eXtensible Middleware The following document was approved
9713 Requirements for MXM (MPEG eXtensible Middleware)
8.7 Application formats
8.7.1 MAF general The following documents were approved
9689 MAF Overview Document 9690 MAF Overview Presentation
8.7.2 Musical Slide Show Application Format The following documents were approved
9691 Study Text of ISO/IEC FCD 23000-4 Musical Slide Show 2nd Edition 9692 Study Text of ISO/IEC 23000-4:200x/PDAM1 MSS Application Format Conf. and Ref. Software
8.7.3 Media Streaming Application Format The following document was approved
9693 Text of ISO/IEC 23000-5 2nd Edition WD1.0 Media Streaming Application Format
8.7.4 Professional Archival MAF The following documents were approved
9694 Requirements on Professional Archival Application Format 9696 Text of ISO/IEC CD 23000-6 Professional Archival Application Format
8.7.5 Open Release Application Format The following documents were approved
9697 DoC of ISO/IEC FCD 23000-7 Open Access Application Format 9698 Text of ISO/IEC FDIS 23000-7 Open Access Application Format 9699 Request of Amendment for ISO/IEC 23000-7
6 9700 Text of ISO/IEC PDAM1 23000-7 Conformance and Reference Software
8.7.6 Portable Video Player MAF The following documents were approved
9701 Study Text of ISO/IEC 23000-8/FCD Portable Video Application Format 9702 Workplan for Portable Video Application Format Conformance and Ref. Soft.
8.7.7 Video Surveillance Application Format The following documents were approved
9705 DoC on ISO/IEC CD 23000-10 (Video Surveillance Application Format) 9706 Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format) 9708 Future Work on Surveillance AF's – collection of requirements
8.7.8 Video Stereoscopic Application Format The following document wa approved
9709 Text of ISO/IEC CD 23000-11 (Stereoscopic Video Application Format)
8.8 Reference implementation
8.8.1 Symbolic Music Representation Reference Software The following documents were approved
9671 DoC on ISO/IEC 14496-5/FPDAM16 Symbolic Music Representation Ref. Soft. 9672 Text of ISO/IEC 14496-5/FDAM16 Symbolic Music Representation Ref. Soft.
8.8.2 BSAC Extensions Reference Software The following document was approved
9630 Study on ISO/IEC 14496-5:2001/FPDAM 20, Reference Software for MPEG-1/2 Audio in MPEG-4 and BSAC Extensions
8.8.3 AAC-ELD Reference Software The following document was approved
9629 ISO/IEC 14496-5:2001/AMD XX, WD on AAC-ELD Reference Sw.
8.8.4 SVC Reference Software The following documents were approved
7 9572 Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 19 9573 Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding
8.8.5 3D Graphics Compression Model Reference Software The following document was approved
9645 ISO/IEC 14496-5 PDAM 22 (3DGCM RefSoft)
8.8.6 LASeR Reference Software The following documents were approved
9673 DoC on ISO/IEC 14496-5/FPDAM17 LASeR Ref. Soft. 9674 Text of ISO/IEC 14496-5/FDAM17 LASeR Ref. Soft.
8.8.7 DMB AF Reference Sofware The following document was approved
9704 Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance and Reference Software
8.8.8 Video Surveillance Application Format Reference Sofware The following document was approved
9707 Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference Software
8.8.9 MPEG Surround Reference Software The following documents were approved
9634 DoC on ISO/IEC 23003-1:2006/FPDAM 2, MPEG Surround Reference Sw. 9635 ISO/IEC 23003-1:2006/FDAM 2, MPEG Surround Reference Sw.
8.9 Conformance
8.9.1 MPEG-2 Main Profile Level for 1080@50/60p Conformance The following documents were approved
9583 Request for 13818-4:2004/Amd.3 9618 Text of ISO/IEC 13818-4:2004/PDAM 3 Level for 1080@50/60p Conformance Testing
8.9.2 MPEG-4 Visual Simple Profile Level 6 Conformance The following document was approved
9567 Study Text of ISO/IEC 14496-4:2004/PDAM35 Simple Studio Profile Levels 5 and 6
8 Conformance Testing
8.9.3 Scalable Video Coding Conformance The following documents were approved
9568 Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 31 9569 Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing for Scalable Video Coding
8.9.4 Symbolic Music Representation Conformance The following documents were approved
9625 DoC on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance 9626 ISO/IEC 14496-4:2004/FDAM 29, SMR Conformance
8.9.5 Audio Scalable to Lossless Conformance The following documents were approved
9620 DoC on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance 9621 ISO/IEC 14496-4:2004/FDAM 20, SLS Conformance
8.9.6 AAC-ELD, OAFI and additional AAC Conformance The following document was approved
9624 ISO/IEC 14496-4:2004/AMD XX, WD on AAC-ELD, OAFI and additional AAC Conformance
8.9.7 Frame-based Animated Mesh Compression Conformance The following document was approved
9642 Study on PDAM of ISO/IEC 14496-4:2004 AMD32 (FAMC Conformance)
8.9.8 Multiresolution Profile Conformance The following document was approved
9643 Study on PDAM of ISO/IEC 14496-4:2004 AMD33 (MultiResolution Profile Conformance)
8.9.9 3D Graphics Compression Model Conformance The following document was approved
9644 ISO/IEC 14496-4:2004 PDAM 34 (3DGCM Conformance)
9 8.9.10 DMB AF Conformance The following document was approved
9704 Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance and Reference Software
8.9.11 Video Surveillance Application Format Conformance The following document was approved
9707 Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference Software
8.9.12 MPEG Surround Conformance The following documents were approved
9631 DoC on ISO/IEC 23003-1:2006/FPDAM 1, MPEG Surround Conformance 9632 ISO/IEC 23003-1:2006/FDAM 1, MPEG Surround Conformance 9633 Workplan on further issues for MPEG Surround Conformance
8.9.13 Video Tool Library Conformance The following document was approved
9592 RVC Conformance Testing Working Draft V4.0
8.9.14 MPEG-4 Audio Conformance Rollup The following document was approved
9627 MPEG-4 Audio Conformance Rollup
8.10 Maintenance
8.10.1 Systems coding standards The following document was approved
9679 WD1.0 of Corrigendum on ISO/IEC 14496-12
8.10.2 Video coding standards The following documents were approved
9566 Study Text of ISO/IEC 14496-2:2004/DCOR3 9570 Disposition of Comments on ISO/IEC 14496-5:2001/Amd.1:2002/DCOR 1 9571 Text of ISO/IEC 14496-5:2001/Amd.1:2002/COR 1 9574 Text of ISO/IEC 14496-10:200X/DCOR 1
10 8.10.3 Audio coding standards The following documents were approved
9622 ISO/IEC 14496-4:2004/AMD 11/DCOR 3, Parametric Stereo 9623 ISO/IEC 14496-4:2004/AMD 19/DCOR 1, ALS 9628 ISO/IEC 14496-5:2001/AMD 10/DCOR 2, ALS
8.10.4 3DG coding standards The following document was approved
9646 Study of ISO/IEC 14496-16:2006/AMD1/DCOR1
8.10.5 MPEG-21 standards The following document was approved
9685 Items for consideration for Corrigendum or Amendment of MPEG-21 DIA
8.10.6 MAF standards The following document was approved
9703 Text of ISO/IEC 23000-9/DCOR1 (DMB Application Format)
9 Organisation of this meeting
9.1 Tasks for subgroups The following tasks were assigned
Requirements Carriage of AVS on MPEG-2 Systems IPTV Media value chain ontologies Framework for representation of sensory effects information ? Information exchange with virtual worlds Systems 2 13 Carriage of SVC 26 Open Font Format Conformance 27 Laser v.2 conformance 514 Open Font Format Reference Software 16 Symbolic Music Representation Reference Software 17 Laser Reference Software 151 File Format 7 12 Query Format Schemas 21 5 REL amendment OAC
11 6 Media value chain ontologies 81 Reference software Schemas 91 Mime type registration 151 Security in Event Reporting A 41 Musical Slide Show MAF conformance & RS 2 Protected Musical Slide Show MAF 51 Media Streaming MAF conformance & RS 6 Professional Archival MAF 7 OA MAF 81 Portable Video Player MAF conformance & RS 9 DMB MAF conformance & RS 10 Video Surveillance MAF 11 Stereoscopic MAF E 8 M3W Reference Software and Conformance X Post production delivery format U MPEG eXtensible Middleware V Information exchange with virtual worlds W Framework for representation of sensory effects information Y Joint management of content description and presentation IPTV Video 2 2 New levels for 1080/60 P support 4 25 Studio Profile level 5 and 6 4 435 Studio Profile level 5 and 6 Conformance 7 33 Image Signature Tools 7 34 Video Signature Tools 3 Video augmentation by metadata A 31 Photo Player Reference Software 2 Photo Player Conformance B 4 Codec Configuration Description C 4 Video Tool Library 41 Video Tool Library Conformance FTV JVT 4 10 New AVC Profiles for Professional Applications Conformance 10 New AVC Profiles for Professional Applications Reference SW 10 Scalable Video Coding Conformance 10 Scalable Video Coding Reference SW 101 Multi-View Video Coding 4 103 verification tests Audio 4 420 SLS conformance 29 SMR Conformance 516 SMR Reference Software
12 20 MPEG-1/-2 on MPEG-4 reference software D 11 MPEG Surround Reference Software 2 MPEG Surround Conformance 2 Spatial Audio Object Coding X Unified Speech and Audio Coding 3DG 421 Geometry and shadow Conformance 4 4 32 FAMC (Frame based Animated Mesh Compress.) Conformance 33 Multiresolution profile conformance 3DG Compression Model Conformance 513 Geometry and shadow Reference Software 21 FAMC (Frame based Animated Mesh Compress.) Reference software 21 Multiresolution profile Reference Software 3DG Compression Model Reference Software 162 Frame-based animated mesh compression 3 3D Multiresolution profile 4 Space partitioning 25 3D Graphics Compression model ? Metaverse
9.2 Joint meetings The following joint meetings were held
Groups What Where Day Time Sys, 3DG Scene partitioning Systems Mon 17:00-18:00 Req, Sys IPTV Systems Tue 09:00-09:00 Req, Sys, 3DG Metaverse Systems Tue 09:30-10:00 Req, Sys Rose Systems Tue 10:00-11:00 Req, Sys, Vid AVS Systems Tue 11:00-12:00 Vid, JVT, Req MVC, bit depth reqs, FTV JVT Wed 16:00-17:30
10 WG management
10.1 Terms of reference The following document was approved
9600 Terms of reference
10.2 Officers Jörn Ostermann was appointed as Chairman of the Requirements Group
10.3 Editors The following document was approved
13 9604 Editors of MPEG standards
10.4 Liaisons The following liaison statements were issued
9714 Liaison to JPEG on ISO Base Format 9718 Response to DVB on File Format 9719 Response to DVB on Carriage and Storage of SVC 9720 Response to JPEG on Query Format 9721 Liaison to JTC1/SWG-ARM on PA Application Format 9722 Liaison to SMPTE on PA Application Format 9723 Liaison to TC20/SC13 on PA Application Format 9724 Liaison to JPEG on PA Application Format 9725 Response to JTC1/SC34 9726 Liaison to ITU-T SG16 on IPTV 9727 Liaison to Creative Common on Open Access Application Format 9728 Liaison to SMPTE on Post-Production Deliverables 9729 Liaison to NAB on Post-Production Deliverables 9730 Liaison to ATSC on Post-Production Deliverables 9731 Liaison to MPAA on Post-Production Deliverables 9732 Liaison to EBU on Post-Production Deliverables 9733 Liaison to IEC TC100 TA6 on Post-Production Deliverables 9734 Liaison to IFPI on Post-Production Deliverables 9735 Liaison to DMP on Presentation of Structured Information 9736 Liaison to ITU-T TC 9 WG43 on Video Surveillance AF 9614 Liaison Statement to SMPTE re RVC 9615 Liaison Statement to ITU-T SG 9 re FTV 9616 Liaison Statement to ITU-T SG 9 re Bitstream Splicing 9617 Liaison Statement template for various organizations re SVC verification testing report 9641 Liaison Statement to ETSI TC DECT 9660 Liaison Statement to ITU-T SG 16
10.5 Work item assignment
10.6 Ad hoc groups The following ad hoc groups were established w9664 Ad Hoc Group on Application Format w9663 Ad Hoc Group on MPEG File Formats w9665 Ad Hoc Group on Presentation of Structured Information w9662 Ad Hoc Group on Scene Representation
14 w9661 AHG on 3DG documents and software maintenance w9653 AHG on Audio Standards Maintenance w9668 AHG on Font Format Representation w9613 AHG on FTV w9655 AHG on Information Exchange with Virtual Worlds w9597 AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance w9667 AHG on MPEG Query Format w9599 AHG on MPEG-7 Visual w9598 AHG on Reconfigurable Video Coding w9657 AHG on Requirements for Media Value Chain Ontology w9666 AHG on Requirements for MPEG Post Production Deliverable Formats w9656 AHG on the RoSE Framework w9654 AHG on Unified Speech and Audio Coding and SAOC and AAC-ELD
10.7 Asset management The following documents were approved
9605 Schema assets 9606 Software assets 9607 Conformance assets 9608 Content assets 9609 URI assets
10.8 IPR management The following document was approved
15 9610 Standards under development for which a call for patent statements is issued
10.9 Work plan The following documents were approved
9601 MPEG Standards 9602 Table of unpublished FDISs 9603 Work plan and time line
11 Administrative matters
11.1 Schedule of future MPEG meetings The following schedule was approved
# City Country yy mm dd-dd 83 Antalya TR 08 01 14-18 84 Archamps FR 08 04-05 28-02 85 Hannover DE 08 07 21-25 86 Busan KR 08 10 13-17 87 Archamps FR 09 01-02 26-30 88 ? US? 09 04 20-24 89 London? UK? 09 06-07 29-03 90 Xian CN 09 10 26-30
11.2 Promotional activities The following document was approved
9561 Antalya press release
12 Resolutions of this meeting These were approved
13 A.O.B
14 Closing
16 Annex A – Attendance list
First name Last name Affiliation NB Christian Timmerer Klagenfurt University AT Dan Cernea Vrije Universiteit Brussel BE Jan De Cock Ghent University BE Rik Van de Walle Ghent University - IBBT BE Kenneth Vermeirsch Ghent University BE Touradj Ebrahimi EPFL CH Marco Mattavelli EPFL CH weizhong Chen Huawei technologies CO.,LTD CN Dandan Ding Zhejiang University CN Sixin Lin Haitao Yang Huawei Tech. Ltd Co. CN Tiejun Huang Peking University CN Shan Gao Lianhuan Xiong Huawei Technologies Co. Ltd. CN Yingjia Liu Huawei CN Siwei Ma Peking University CN Honggang Qi Institute of Computing Technology CN xuemin Wang Huawei technologies CO., LTD CN Lianhuan Xiong Huawei CN Xiaozhong Xu Tsinghua University CN Haitao Yang Xidian University CN Lijing Xu Yingjia Liu HUAWEI Technologies Co., Ltd. CN Lu Yu Zhejiang University CN Gang Zhu Tsinghua University CN Peter Amon Siemens AG DE Gero Bäse Siemens DE Johannes Boehm Thomson DE Stefan Doehla Fraunhofer IIS DE Ralf Geiger Fraunhofer IIS DE Sebastian Gerke Fraunhofer HHI DE Bernhard Grill Fraunhofer IIS DE Oliver Hellmuth Fraunhofer IIS DE Tilman Liebchen LG Electronics DE Markus Multrus Fraunhofer IIS DE Karsten Müller Fraunhofer HHI DE Matthias Narroschke Panasonic DE Tobias Oelbaum Technische Universität München DE Jens-Rainer Ohm RWTH Aachen DE Joern Ostermann Leibniz Universität Hannover DE Thomas Schierl Fraunhofer HHI DE Andreas Schneider Dolby Germany GmbH DE Markus Schnell Fraunhofer IIS DE Florian Schreiner Technische Universität München DE Heiko Schwarz Fraunhofer HHI DE Aljoscha Smolic Aljoscha Smolic DE Ralph Sperschneider Fraunhofer IIS DE Leonid Terentiev Fraunhofer IIS DE
17 Herbert Thoma Fraunhofer IIS DE Thomas Wiegand Fraunhofer HHI DE Steffen Wittmann Panasonic DE Pablo Carballeira López Universidad Politécnica de Madrid ES Jaime Delgado DMAG-UPC ES Marc Gauvin sDae ES Leonardo Lizcano Telefonica R&D ES Francisco Morán Burgos Universidad Politécnica de Madrid ES Ying Chen Tampere University of Technology FI Miska Hannuksela Nokia Corporation FI Jani Lainema Nokia FI Justin Ridge Nokia FI Kemal Ugur Nokia FI Juha Vartiainen The Finnish Standards Association, SFS FI xianglin wang Nokia Inc. FI Bertrand BERTHELOT France Telecom FR Yann Bodo Joost Technologies BV FR Vincent Bottreau Thomson FR Sebastien Brangoulo Joost Technologies BV FR Alice de Casanove Actimagine FR Jean-Claude Dufourd Streamezzo FR Patrick GIOIA Orange Labs FR Marc GUEZ VUCHER FRANCE FR Joel Jung Orange - France Telecom R&D FR Jean Francois Nezan IETR / INSA FR avaro olivier Streamezzo FR Stephane Pateux Orange Labs FR Pierrick Philippe France Telecom R&D FR Marius Preda Institut Telecom FR Françoise PRETEUX Institut TELECOM FR Mickael RAULET IETR / INSA FR Jerome vieron Thomson R&D France FR David Virette France Télécom FR Pierfrancesco Bellini University of Florence IT sabina brufani SISVEL IT Leonardo Chiariglione Cedeo.net IT Giovanni Cordara Telecom Italia Lab IT Kohtaro Asai Mitsubishi Electric JP Yukihiro Bando NTT JP Takeshi Chujoh Toshiba Corporation JP Toshiaki Fujii Nagoya University JP Noboru Harada NTT JP Takashi Ito Fujitsu Laboratories Ltd. JP Kota Iwamoto NEC Corporation JP Hideaki Kimata NTT JP NAOKI KOBAYASHI NTT JP Takuyo Kogure Matsushita Electric Ind. Co. Ltd JP Takehiro Moriya NTT JP Tokumichi Murakami Mitsubishi Electric JP
18 Hiroya Nakamura JVC (Victor Company of Japan, Limited) JP Takahiro Nishi Matsushita Electric (Panasonic) JP Toshiyuki Nomura NEC JP Takeshi Norimatsu Panasonic JP Yukiko Ogura IPSJ/ITSCJ JP Shun-ichi Sekiguchi Mitsubishi Electric Corporation JP Takanori Senoh National Institute of Info & Comm Tech JP Osamu Shimada NEC Corporation JP Shinya Shimizu NTT JP Akihiko Sugiyama NEC Corporation JP Teruhiko Suzuki Sony Corp. JP Masashi Takahashi Hitachi Ltd JP TK Tan NTT DoCoMo, Inc. JP Masayuki Tanimoto Nagoya University JP Akiyuki Tanizawa Toshiba Corporation JP Yasuhiro Toguri Sony Corporation JP Yoshihisa Yamada Mitsubishi Electric JP TOMOO YAMAKAGE TOSHIBA Corporation JP Tomoyuki Yamamoto Sharp JP Takahiro Yamasaki Oki Electric Industry Co., Ltd. JP Tomonobu Yoshino KDDI JP Hyouk Jean Cha LG Electronics KR Jihun Cha ETRI KR Suhee Cho ETRI KR Yoonsik Choe Yonsei University KR bumsuk choi ETRI KR Haechul Choi ETRI KR Jin Soo Choi ETRI KR Miran Choi ETRI KR WOONG IL CHOI Samsung KR Yungho Choi Konkuk University KR Dong-Hoon Han Sejong University KR Jong-Ki Han Sejong University KR Ki-Hun Han Sejong University KR YO-SUNG HO GIST KR SeoYoung Hwang Samsung Electronics CO., LTD KR Byeong-Moon Jeon LG Electronics KR Byeungwoo Jeon SKKU KR Yong-Joon Jeon LG Electronics KR Dong-Seok Jeong Inha University KR jechang jeong hanyang university KR Seyoon Jeong ETRI KR Jie Jia Sejong University KR Sanghyun Joo ETRI KR Yang-Won Jung LG Electronics KR Jung Won Kang ETRI KR Chan=Young Kim VARO VISION KR DAEYEON KIM Sejong university KR Daiyong Kim Hanyang University KR
19 Dong Soo Kim LG Electronics KR Ha Yoon Kim SK Telecom KR Hae Kwang KIM Sejong University KR Hansang Kim Samsung Electronics KR Hui Yong Kim ETRI KR Hyungyu Kim Hanyang University KR Inkwon Kim VARO VISION CO., LTD KR Jae-Gon Kim Korea Aerospace University KR Jong Lak Kim DSP Group KR JONGYOUN KIM Net&TV KR JungHoe Kim Samsung AIT KR Kyuheon Kim Kyung Hee Univ. KR Miyoung Kim Samsung AIT KR Munchurl Kim Information and Communications University KR Seong-wan Kim Yonsei Univ. KR Sikyung Kim Hangyang Univ. KR Yong-Goo Kim Yonsei Univ. KR Yong Han Kim University of Seoul KR Yong Tae Kim Samsung Electronics KR Seung Ryong Kook Kyunghee University KR Eunkyung Kwak HUMAX KR Alex Lee Humax Co., Ltd KR Hyobin Lee Yonsei Univ. KR Jangwon Lee Kyung Hee Univ. KR SANG HOON LEE DSPG KR Sinwook Lee Hanyang University KR Yoonjin Lee Kyunghee UNIV. KR Yung-Ki Lee Sejong University KR Yung Lyul Lee Sejong University KR JungEun Lim LG Electronics KR Taebeom Lim KETI KR Young-Kwon Lim net&tv Inc. KR Joo Hee Moon Sejong University KR Henney Oh LG Electronics KR Kwan-Jung Oh GIST KR Weon Geun Oh ETRI KR Gwang-Hoon Park Kyung Hee University KR HyoungMee Park Sejong University KR Jiho Park KETI KR Jongtae Park Kyunghee Univ, KR Joonyoung Park LG Electronics KR JUKYUNG PARK Net&TV KR Min Cheol Park Sejong University KR Min Woo Park Kyunghee University KR Seung-Wook Park LG Electronics KR Muhammad Syah Houari Sabirin Information and Communications University KR Jeongil Seo ETRI KR SeungYong Shim Sejong Univ. KR Dong-Gyu Sim Kwangwoon University KR
20 hyung sik(sean) suh LG ELECTRONICS Inc. KR Lim Sung-Chang Sejong University KR Hendry Tan Information and Communications University KR Gi-Mun Um ETRI KR Kwanghyun Won Sungkyunkwan University KR SHIM WOO SUNG Samsung electronics KR Jungyoup Yang Sungkyunkwan University KR JEONG-JU YOO ETRI KR YoungJoe Yoo Sejong Univ. KR Daeil Yoon Sejong University KR Kyoungro Yoon Konkuk University KR Sungyong Yoon LG Electronics KR Kug Jin Yun ETRI KR Fons Bruls Philips NL Wiebe de Haan Philips NL Jean H.A. Gelissen Philips Research NL Werner oomen Philips Applied Technologies NL Gisle Bjontegaard Tandberg NO Arild Fuldseth Tandberg NO Marek Domanski Poznan University of Technology PL Krzysztof Klimaszewski Poznań Univ. of Technology PL Marian Muczko Telekomunikacja Polska S.A. PL Lukasz Pikula Telekomunikacja Polska S.A. PL Kenneth Andersson Ericsson AB SE Per Fröjdh Ericsson SE Kristofer Kjörling Coding Technologies AB SE Heiko Purnhagen Coding Technologies AB SE Anisse Taleb Ericsson AB SE Lekha Chaisorn Institute for Infocom Research SG Ti Eu Chan Institute For Infocomm Research (A*STAR) SG Farzam Farbiz A*STAR Insitute for Infocomm Research SG Wei Siong Lee Institute for Infocomm Research SG Chong Soon Lim Panasonic Singapore Labs SG Corey Manders A*STAR Institute for Infocomm Research SG Wei Yao Institute for Infocomm Research SG Yongwei Zhu Institute for Infocomm Research SG James Annesley Kingston University UK Tanya Beech QintiQ UK Miroslaw Bober Mitsubishi Electric ITE-VIL UK Paul Brasnett Mitsubishi Electric ITE-VIL UK Leszek Cieplinski Mitsubishi Electric UK Catherine Grant Nine Tiles UK Mike Nilsson BT UK Madhukar Budagavi Texas Instruments Inc. US Yi-Jen Chiu Intel Corp. US Oscar Divorra Escoda Thomson US Alex Eleftheriadis Vidyo, Inc. US Cristina Gomila Thomson US Onur Guleryuz DoCoMo USA Labs US
21 Michael Horowitz Vidyo, Inc. US Shih-Ta Hsiang Motorola, Inc. US Yi Hu Conexant Systems US Walt Husak Dolby / SMPTE US Faisal Ishtiaq Motorola US Marta Karczewicz Qualcomm US Gwo Giun (Chris) Lee National Cheng Kung University US Vladimir Levantovsky Monotype Imaging Inc. US He-Yuan Lin National Cheng Kung University US Julie Lofton Hot Potato, Inc. US Ajay Luthra Motorola US Kyle McAdoo Conexant Systems US Sam Narasimhan Motorola US Purvin Pandit Thomson US Schuyler Quackenbush Audio Research Labs US Arturo Rodriguez Scientific Atlanta, a Cisco Company US Jesus Sampedro Polycom US Andrew Segall Sharp Labs of America US Suman Sharma Intel Corporation US David Singer Apple Inc., USA US Gary Sullivan Microsoft Corp. US Huifang Sun Mitsubishi Electric Research Labs US Pankaj Topiwala FastVDO US Anthony Vetro Mitsubishi Electric US Xin Wang ContentGuard, Inc. US Yong Yu Broadcom Corp US
22 Annex B – Agenda
Item 1 Opening 2 Roll call of participants 3 Approval of agenda 4 Allocation of contributions 5 Communications from Convenor 6 Report of previous meeting 7 Processing of NB Position Papers 8 Work plan management 1 1 Media coding 2 MPEG-4 Visual Simple Profile Level 6 3 AAC-ELD 4 New Profiles for Professional Applications 5 Scalable Video Coding 6 Multiview Video Coding 7 Geometry and Shadow 8 Binary Format for XML (Prefixes and Wild Card extensions) 9 Bitstream Syntax Description Language 1 Fixed point implementation of DCT/IDCT 0 1 Video Tool Library 1 1 Spatial Audio Object Coding 2 1 Free Viewpoint TV coding 3 1 Audio and speech coding 4 1 Ontology 5 16Video coding exploration 2 Composition coding 1 Lightweight Scene Representation 3 Description coding 1 Schema definition 2 Visual Descriptions Extensions 3 Visual Signature Tools 4 Technologies for digital photo management using MPEG-7 visual tools 5 Improvements to Geographic Descriptor 6 MPEG-7 Query Format 4 Systems support 1 Fragment Request Unit 5 IPMP
23 1 REL MAM (Mobile And optical Media) Profile 2 REL DAC (Dissemination And Capture) Profile 3 REL ORC (Open Release Content) Profile 4 IPMP XML Messages 6 Digital Item 1 Schema files for MPEG-21 standards 2 Security in Event Reporting 3 Review of DI 7 Transport and File formats 1 Carriage of SVC in MPEG-2 Systems 2 Transport of MPEG Surround data in AAC 3 MP4FF box for Original Audio File Information 4 File Format extensions for Description of Timed Metadata 5 Flute Hint Track 6 AVC File Format extensions for FRExt 7 AVC File Format extensions for SVC 8 AVC File Format extensions for MVC 9 Digital Item File Format 1 Digital Item Streaming 0 8 Multimedia architecture 1 Codec Configuration Representation 2 3D Graphics Compression Models 3 Media Streaming MAF Protocols 4 IPTV 5 Extensible Multimedia Platform 6 Metaverse 9 Application formats 1 Musical Slide Show Application Format 2 Media Streaming Application Format 3 Professional Archival MAF 4 Open Release Application Format 5 Portable Video Player MAF 6 Digital Multimedia Broadcasting Application Format 7 Video Surveillance MAF 8 Stereoscopic MAF 9 Cross media interactive presentation 1 Reference implementation 0 1 Symbolic Music Representation Reference Software 2 MPEG-1 and -2 on MPEG-4 Reference Software 3 BSAC Extensions Reference Software 4 Reference Hardware Description 5 New Profiles for Professional Applications Reference Software 6 SVC Reference Software 7 File Format Reference Software
24 8 Geometry and Shadow Reference Software 9 Frame-based Animated Mesh Compression Reference Software 1 MPEG-J GFX Reference Software 0 1 LASeR Reference Software 1 1 Open Font Format Reference Software 2 1 MPEG-7 Systems Reference Software 3 1 MPEG-21 REL Reference Software 4 1 Photo Player MAF Reference Software 5 1 Musical Slide Show MAF Reference Software 6 1 Binary MPEG format for XML Reference Software 7 1 Prefixes and wild card extensions Reference Software 8 1 MPEG Surround Reference Software 9 2 M3W Reference Software 0 1 Conformance 1 1 Audio BIFS v3 Conformance 2 Symbolic Music Representation Conformance 3 MPEG-4 Visual Simple Profile Level 6 Conformance 4 New Profiles for Professional Applications Conformance 5 SVC Profiles Conformance 6 MPEG-1 and -2 Audio in MPEG-4 Conformance 7 BSAC Conformance 8 1-bit Oversampled Audio Conformance 9 Audio Scalable to Lossless Conformance 1 File Format Conformance 0 1 Geometry & Shadow Conformance 1 1 Frame-based Animated Mesh Compression Conformance 2 1 MultiResolution Profile Conformance 3 1 Synthesized Texture Conformance 4 1 MPEG-J GFX Conformance 5 1 Laser Conformance 6 1 Open Font Format Conformance
25 7 1 Perceptual 3D Shape Conformance 8 1 Improvements to Geographic Descriptor Conformance 9 2 Binary MPEG format for XML Conformance 0 2 MPEG Surround Conformance 1 2 M3W Conformance 2 2 Video Tool Library Conformance 3 1 Maintenance 2 1 Systems coding standards 2 Video coding standards 3 Audio coding standards 4 Visual description coding standards 5 Audio description coding standards 6 MDS standards 9 Organisation of this meeting 1 Tasks for subgroups 2 Joint meetings 1 WG management 0 1 Terms of reference 2 Officers 3 Editors 4 Liaisons 5 Work item assignment 6 Ad hoc groups 7 Asset management 1 Reference software 2 Conformance 3 Test material 4 URI 8 IPR management 9 Work plan 1 Administrative matters 1 1 Responses to National Bodies 2 Schedule of future MPEG meetings 3 Promotional activities 1 Resolutions of this meeting 2 1 A.O.B 3
26 1 Closing 4
27 Annex C– Input contributions
Number Source Title m15029 Webmaster Antalya document register Noboru Harada Ad Hoc Group on Professional Archival Application m15030 Hendry Format Jaime Delgado Ad Hoc Group on Requirements of Media Value Chains m15031 Xin Wang Ontologies Ad Hoc Group on Maintenance of MPEG-4 Visual Yi-Shin Tung m15032 related Documents, Reference Software and Teruhiko Suzuki Conformance Euee S. Jang m15033 Marco Mattavelli Ad Hoc Group on Reconfigurable Video Coding Yoshihisa Yamada Miroslaw Bober m15034 Ryoma Oami Ad Hoc Group on MPEG-7 Visual Robert O'Callaghan Miroslaw Bober m15035 Ad Hoc Group on Video Augmentation by Metadata Thomas Wiegand Hideaki Kimata m15036 Ad Hoc Group on Free-Viewpoint Television Karsten Müller Young-Kwon Lim m15037 Ad Hoc Group on Scene Representation Jean Lefeuvre David Singer m15038 Ad Hoc Group on MPEG File Formats Visharam Mohammed Kyuheon Kim m15039 Hui Yong Kim Ad Hoc Group on Application Format Jean Cha Young-Kwon Lim m15040 Jihun Cha Ad Hoc Group on Digital Item Presentation Jean-Claude Dufourd Tobias Oelbaum m15041 Ad Hoc Group on SVC Verification Test Mathias Wien Jeong-Hwan Ahn Ad Hoc Group on 3DG documents, experiments and m15042 Nikolce Stefanoski software maintenance m15043 R. Sperschneider Ad Hoc Group on Audio Standards Maintenance S. Quackenbush Ad Hoc Group on Unified Speech and Audio Coding and m15044 Eunmi Oh SAOC
28 Ad Hoc Group on Requirements for MPEG Post m15045 Julie Lofton Production Deliverable Formats Kyoungro Yoon m15046 Ad Hoc Group on MPEG Query Format Mario Doeller Taka Senoh [email protected] m15047 [email protected] Consideration of Depth Format [email protected] [email protected] m15048 SC 29 Secretariat Liaison Statement from SMPTE [SC 29 N 8899] m15049 SC 29 Secretariat Liaison Statement from DVB [SC 29 N 8901] Table of Replies on ISO/IEC 23001-1:2006/FDAM 1 m15050 SC 29 Secretariat [SC 29 N 8902] Table of Replies on ISO/IEC 14496-4:2004/FDAM 14 m15051 SC 29 Secretariat [SC 29 N 8909] Table of Replies on ISO/IEC 14496-4:2004/FDAM 18 m15052 SC 29 Secretariat [SC 29 N 8910] Table of Replies on ISO/IEC 14496-4:2004/FDAM 19 m15053 SC 29 Secretariat [SC 29 N 8911] Table of Replies on ISO/IEC 14496-1:2004/FDAM 3 m15054 SC 29 Secretariat [SC 29 N 8912] Table of Replies on ISO/IEC 21000-4:2006/FDAM 1 m15055 SC 29 Secretariat [SC 29 N 8913] m15056 SC 29 Secretariat Liaison Statement from ITU-T SG 9 [SC 29 N 8919] Summary of Voting on ISO/IEC 14496-16:2006/PDAM m15057 SC 29 Secretariat 2 [SC 29 N 8920] Summary of Voting on ISO/IEC 14496-4:2004/FPDAM m15058 SC 29 Secretariat 29 [SC 29 N 8925] Summary of Voting on ISO/IEC 14496-5:2001/FPDAM m15059 SC 29 Secretariat 16 [SC 29 N 8926] Summary of Voting on ISO/IEC 14496-5:2001/FPDAM m15060 SC 29 Secretariat 17 [SC 29 N 8927] Summary of Voting on ISO/IEC 14496- m15061 SC 29 Secretariat 5:2001/Amd.1:2002/DCOR 1 [SC 29 N 8938] m15062 SC 29 Secretariat IEC CDV 62360 [SC 29 N 8941] m15063 SC 29 Secretariat Liaison Statement from SC 29/WG 1 [SC 29 N 8956] m15064 SC 29 Secretariat Liaison Statement from SC 29/WG 1 [SC 29 N 8957] m15065 SC 29 Secretariat Table of Replies on ISO/IEC 14496-5:2001/FDAM 11
29 [SC 29 N 8960] Summary of Voting on ISO/IEC 14496-15:2004/FPDAM m15066 SC 29 Secretariat 2 [SC 29 N 8961] Table of Replies on ISO/IEC FDIS 21000-14 [SC 29 N m15067 SC 29 Secretariat 8980] m15068 SC 29 Secretariat ISO/IEC FCD 19776-1 2nd Edition [SC 29 N 8985] m15069 SC 29 Secretariat ISO/IEC FCD 19776-3 2nd Edition [SC 29 N 8986] Table of Replies on ISO/IEC FDIS 23001-2 [SC 29 N m15070 SC 29 Secretariat 9001] m15071 SC 29 Secretariat ISO/IEC CD 19775-2 2nd Edition [SC 29 N 9002] m15072 SC 29 Secretariat Liaison Statement from ITU-T SG 9 [SC 29 N 9004] Table of Replies on ISO/IEC 14496-12:2005/FDAM 2 m15073 SC 29 Secretariat [SC 29 N 9006] Table of Replies on ISO/IEC FDIS 14496-23 [SC 29 N m15074 SC 29 Secretariat 9007] Table of Replies on ISO/IEC FDIS 23000-2 (2nd m15075 SC 29 Secretariat Edition) [SC 29 N 9033] Liaison Statement from ITU-T IPTV Focus Group (FG m15076 SC 29 Secretariat IPTV) [SC 29 N 9034] Liaison Statement from JTC 1/SC 34/WG 2 [SC 29 N m15077 SC 29 Secretariat 9035] Pierfrancesco Bellini Paolo Nesi Editors Study on ISO/IEC 14496-4:2004/FPDAM 29, m15078 Giorgio Zoia SMR Conformance Maurizio Capanai Pierfrancesco Bellini Paolo Nesi Editor Study on ISO/IEC 14496-5:2001/FPDAM 16 m15079 Giorgio Zoia Symbolic Music Representation reference software Maurizio Campanai Gwo Giun Lee Functional units of AVC inter-prediction for adaptive m15080 He-Yuan Lin interlace coding Ming-Jiun Wang USNB Contribution: Proposed amendment to ISO/IEC m15081 Andy Tescher for the USNB 14496-22 Simon Daniels The proposal for amendment of ISO/IEC 14496-22 (in m15082 Michelle Hill support of USNB comment m15081) Vladimir Levantovsky Sanghyun Joo m15083 Requirements on RoSE Framework Bumsuk Choi
30 Munchurl Kim Benoit Le Bonhomme Online platform for 3D graphics compression m15084 Marius Preda benchmarking Françoise Preteux Blagica Jovanova m15085 Marius Preda Software Implementation for P25 Francoise Preteux Blagica Jovanova m15086 Marius Preda Conformance dataset for P25 Francoise Preteux Ivica Arsov m15087 Marius Preda MPEG-4 3D graphics player for N93 and N95 Francoise Preteux Masayuki Tanimoto m15088 Toshiaki Fujii Available Technologies for FTV Kazuyoshi Suzuki Masayuki Tanimoto Toshiaki Fujii m15089 Contribution of Nagoya University on FTV Test Material Kazuyoshi Suzuki Norishige Fukushima Masayuki Tanimoto Improvement of Depth Map Estimation and View m15090 Toshiaki Fujii Synthesis Kazuyoshi Suzuki Jean Gelissen m15091 Requirements on Framework for RoSE Mark Verberkt [email protected]. Marc Gauvin [email protected]. m15092 A Common Core IP Model Jaime Delgado [email protected]. Victor Rodriguez Per Fröjdh m15093 Proposed re-structured ISO Base Media File Format David Singer m15094 S. Quackenbush 82nd MPEG Audio Report m15095 S. Quackenbush Collected Set of Possible Evaluation Guidelines m15096 S. Quackenbush Draft Workplan for Testing of SA Proposals Teruhiko Suzuki Nick Saunders m15097 Proposal for MPEG-4 visual studio profile level 5 and 6 John Stone Paul Gardiner m15098 Gang Zhu Inter-View Skip Mode for FTV using Depth Information
31 Xiaozhong Xu Ping Yang Yun He Proposal for MPEG-4 visual studio profile conformance m15099 Teruhiko Suzuki testing Teruhiko Suzuki Proposal of new level to support 1080@50p/60p for m15100 Ajay Luthra MPEG-2 video Yi-Jen Chiu Aljoscha Smolic Heribert Brust Corrected Camera Parameters for N9468 ?Call for m15101 Karsten Mueller Contributions on FTV Test Material? Marcus Mueller Thomas Wiegand Ingo Feldmann Marcus Mueller Frederik Zilly Ralf Tanger m15102 Progress Report on 3DTV Video Acquisition Karsten Mueller Aljoscha Smolic Peter Kauff Thomas Wiegand Sangki Kim Hyobin Lee Seongwan Kim m15103 Sangyoun Lee CE Report for VCE-5 Myungil Gil Jang Hyun Ki Kim Jeong Heo jeong Kota Iwamoto m15104 Text/Logo Mask Image Generation Software for VCE-7 Ryoma Oami Julie Lofton m15105 MPEG-M under the MPEG 21 Reflector Jeff Steele Weon Genu Oh Daeil Yoon Contribution of video test material for MPEG-7 video m15106 Jie Jia signature CE Hae Kwang Kim Kenji Otoi m15107 Yoshihisa Yamada Proposed text of the RVC FUs for MPEG-2 Kohtaro Asai m15108 [email protected] Subjective results for the SVC Verification Test Ruben Tous Proposal of Reference Software for MPQF. Validation of m15109 Jaime Delgado embedded XQuery expressions. m15110 Osamu Shimada A core experiment proposal for an additional SAOC
32 Toshiyuki Nomura functionality of separating real-environment signals into Akihiko Sugiyama multiple objects Osamu Hoshuyama Yang-Won Jung m15111 A proposed CE on object parameter estimation in SAOC Henney Oh Henney Oh m15112 Comments on SAOC applications and architectures Yang-Won Jung Sinwook Lee Jaebum Jun m15113 Byeongjun Kim The results of RVC CE 1.2 Chungku Yie Euee S. Jang Ju-Kyong Jin Weon-Geun Oh Dong-Jin Seo m15114 Proposal on Frame-Reduction video clip format Sang-il Na Jae-Hyun Huh Dong-Seok Jeong Late UKNB comments on the Study of CD for the Video m15115 James Annesley Surveillance Application Format R.1 Sang-Beom Lee m15116 -based Multi-view Depth Map Estimation for FTV Kwan-Jung Oh Byeongjun Kim Jaebum Jun m15117 Hyungyu Kim Study of Application Requirements Related to RVC Chungku Yie Euee S. Jang Miyoung Kim Comments on Unified Speech and Audio CfP Evaluation m15118 Eunmi Oh Guidelines JungHoe Kim Yo-Sung Ho Segment-based Multi-view Depth Map Estimation for m15119 Sang-Beom Lee FTV Kwan-Jung Oh Yo-Sung Ho m15120 Sang-Tae Na Virtual View Synthesis for FTV Kwan-Jung Oh m15121 Tilman Liebchen Update of ALS Conformance Errors and Corrections for MPEG-7: Part 3 - Visual m15122 James Annesley Reference Software Oliver Hellmuth Information and Verification Results for CE on m15123 Johannes Hilpert Karaoke/solo System Improving Performance of MPEG Andreas Hölzer SAOC RM0
33 Leonid Terentiev Cornelia Falch Houari Sabirin Use cases for content protection in Musical slide show m15124 Munchurl Kim Application Format 2nd Edition Hyungyu Kim Sikyung Kim m15125 Myungjoong Lee The results of RVC CE 1.1 Chungku Yie Euee S. Jang Hendry m15126 Proposed Editorial Update for ISO/IEC 23000-6 WD 1.0 Munchurl Kim Hyouk-Jean Cha Editor's study text of ISO/IEC 23000-4/PDAM1 Musical m15127 Tae Hyeon Kim slide show application format Jisoo Hong Hendry Proposal for Pre-Processing Tool Location Reference in m15128 Munchurl Kim Professional Archival Application Format Hendry Set of MPEG-7 Tools for Professional Archival m15129 Houari Sabirin Applications Format Munchurl Kim Weon-Geun Oh Ayoung Cho Won-Keun Yang Experiment Results of Image Signature for Complex m15130 Ik-Hwan Cho Conditions Ju-Kyong Jin Jun-Woo Lee Dong-Seok Jeong Weon-Geun Oh. Won-Keun Yang. The Extra Experiment Result to Verify the Method of m15131 Ayoung Cho. Performance Measure on MPEG-7 VCE-6 Dong-Seok Jeong m15132 Mathias Wien Verification of new SVC Verification Test Streams Jani Peltotalo m15133 Proposed corrections to ALC/FLUTE server file format Miska M. Hannuksela Jani Peltotalo m15134 Proposed additions to ALC/FLUTE server file format Miska M. Hannuksela m15135 Christian Timmerer MPEG-21 schema assets update Study Text of ISO/IEC FCD 23000-7 Open access m15136 Florian Schreiner application format Min-Jeong Lee m15137 Cross verification result for ETRI VCE-6 proposal Heung-Kyu Lee m15138 Ingo Kofler Multiple MPEG-21 DIA AdaptationQoS Descriptions
34 Christian Timmerer within a Digital Item Hermann Hellwagner [email protected] Zheng Huang [email protected] Tiejun Video Signature based on Inter-frame Correlation m15139 Huang Coefficients yhtian@ @pku.edu.cn Yonghong Tian [email protected] m15140 [email protected] Visual Signature based on Waston Perceptual Model [email protected] Hyouk-Jean Cha Editor's study text of ISO/IEC 23000-8/FCD Portable m15141 Tae Hyeon Kim video application format Herbert Thoma Jihun Cha Injae Lee Considerations on Integrating LASeR and DID m15142 Young-Kwon Lim Technologies for WIM TV KyungAe Moon Jinwoo Hong Jeongil Seo Seungkwon Beack CE on efficient decoding of a controllable object and an m15143 Kwang-ki Kim MBO Kyoungok Kang Jeongil Seo Seungkwon Beack Consideration on enhanced Karaoke processing for stereo m15144 Kwang-ki Kim FGO Kyeoungok Kang Oliver Wuebbolt m15145 Thoughts on Speech and Audio Evaluation Guidelines Johannes Boehm Stefan Döhla m15146 MPEG2-TS and RTP reception hint tracks Miska M. Hannuksela Stefan Döhla Extended sample grouping mechanism for the ISO Base m15147 Miska M. Hannuksela Media File Format Jani Peltotalo Proposed conformance files for ALC/FLUTE server file m15148 Miska M. Hannuksela format Khaled Mamou m15149 Titus Zaharia FAMC decoder conformance Françoise Prêteux Khaled Mamou Titus Zaharia m15150 FAMC integration into the MPEG-4 RefSoft Marius Preda Françoise Prêteux m15151 Markus Schnell Update on AAC-ELD Verification Test
35 Ralf Geiger Study Text of ISO/IEC 23000-10/CD Video Surveillance m15152 Gero Bäse Application Format Khaled Mamou Titus Zaharia m15153 Low-complexity approach for static mesh compression Marius Preda Françoise Prêteux m15154 Andreas Schneider Update on MPEG Surround Conformance Werner Oomen Evaluation criteria and test items for unified speech and m15155 Erik Schuijers audio coding Dandan Ding Marco Mattavelli Update of Classification of Tokens for FUs of MPEG-4 m15156 Christophe Lucarz SP and MPEG-4/AVC in RVC Framework Lu Yu James Annesley Video Surveillance Application Format: Reference m15157 James Orwell Software Kristofer Kjörling Homework according to the joint speech and audio m15158 Heiko Purnhagen workplan Christophe Lucarz Dandan Ding BSDL Description of MPEG-4 SP and AVC BP m15159 Jianjun Li Bitstream Syntax for RVC Framework Marco Mattavelli Kristofer Kjörling Thoughts on evaluation criteria for joint speech and m15160 Heiko Purnhagen audio workitem Andreas Schneider Proposed correction to PS conformance and reference m15161 Heiko Purnhagen software Cross Verification of SAOC CE on Karaoke m15162 Jonas Engdegard enhancement Christophe Lucarz Jianjun Li Auto-generation of RVC Parser from BSDL Syntax m15163 Marco Mattavelli Description: Variable Length Decoding Dandan Ding Christophe Lucarz Jianjun Li Functional Units for RVC Toolbox: Variable Length m15164 Marco Mattavelli Decoding Dandan Ding Ralf Geiger m15165 Markus Multrus Comments on Speech and Audio Evaluation Guidelines Bernhard Grill Dandan Ding Function Units for Conversion from Syntax to Sequence m15166 Christophe Lucarz of Tokens: BTYPE
36 Marco Mattavelli Lu Yu M. Raulet G. Roquier m15167 M. Wipliez Update of CAL2C code generation JF. Nezan O. Deforges m15168 Florian Schreiner Open Access Application Format: Reference Software Paul Brasnett m15169 Correction to Image Signature XM Software Miroslaw Bober Paul Brasnett Performance Evaluation of Image Signature on Extended m15170 Miroslaw Bober Database GENB comments on the Study of the MPEG-21 REL m15171 Florian Schreiner Open Access Profile FPDAM Paul Brasnett Extending the Trace Transform Image Signature to m15172 Miroslaw Bober Complex Conditions Dave Singer Editors' Input to ISO/IEC 14496-15/FPDAM 2 (SVC File m15173 Ye-Kui Wang Format) Thomas Rathgen Olgierd Stankiewicz. m15174 Depth Map Estimation Software Krzysztof Wegner. Olgierd Stankiewicz. m15175 Depth Map Estimation Software Krzysztof Wegner. Masanori Sano m15176 Hideki Sumiyoshi Paging function in MPEG Query Format Nobuyuki Yagi Masanori Sano Interpretation Consistency for SpatialQuery and m15177 Hideki Sumiyoshi TemporalQuery Nobuyuki Yagi m15178 David Singer Codec-independent color information in part 12 files m15179 DW Singer Backwards-compatibility for alternate groups Manuela Schinn m15180 WD on Audio part of MPEG-4 Conformance Ralph Sperschneider m15181 Karol Wnukowicz Cross verification result of Image Signature (VCE-6) Noboru Harada Updated requirements on Professional Archival m15182 Takehiro Moriya Application Format Yutaka Kamamoto Noboru Harada Proposed update of MPEG-4 ALS reference software for m15183 Takehiro Moriya OAFI Yutaka Kamamoto
37 Proposed workplan for Portable video application format m15184 Hyouk-Jean Cha conformance Shun-ichi Sekiguchi Kenji Otoi 4:4:4 video coding performance with adaptive motion m15185 Yoshihisa Yamada vector coding Kohtaro Asai Tokumichi Murakami m15186 KNB KNB Comments on RVC Hui Yong Kim Proposed text of ISO/IEC 23000-9/PDAM1 DMB AF: m15187 Houari Sabirin Conformance and Reference software Munchurl Kim Hui Yong Kim MyungSeok Ki Proposed text of ISO/IEC 23000-9/DCOR1 DMB AF: m15188 Gun Bang timescale of TS Yong Han Kim Hui Yong Kim Gun Bang Proposed WD on 14496-12 ISO-FF Amendment: MPEG- m15189 MyungSeok Ki 2 TS storage Han-Kyu Lee Yong Han Kim m15190 SC 29 Secretariat Table of Replies on ISO/IEC FDIS 23001-5 Gi-Mun Um Taeone Kim Segment-based Disparity Estimation using Foreground m15191 Namho Hur Separation Jinwoong Kim m15192 SC 29 Secretariat Table of Replies on ISO/IEC 14496-20:2006/FDAM 1 Summary of Voting on ISO/IEC 14496-4:2004/PDAM m15193 SC 29 Secretariat 31 Summary of Voting on ISO/IEC 14496-5:2001/PDAM m15194 SC 29 Secretariat 19 Xiaozhong Xu Transport of GB 20090.2 video data over ITU-T Rec. m15195 Xilin Chen H.222.0 | ISO/IEC 13818-1 Tiejun Huang Hideaki Kimata Proposal on Profiles for MVC (Multi-view Video m15196 Hiroya Nakamura Coding) Takashi Itoh [email protected]. m15197 Proposal for Video Signature [email protected]. Jeong-Hwan Ahn. KNB Comment on 14496-16:2006/AMD1.Corr1 (3D m15198 Daiyong Kim. Mesh Coding Extension Correction) Euee S. Jang
38 Hyungyu Kim Giseok Son Byeongjun Kim Proposed text of CCR CD: A section for DD m15199 Sinwook Lee transmission Chungku Yie Euee S. Jang Dandan Ding Lu Yu BSDL Description of AVS Bitstream Syntax for RVC m15200 Honggang Qi Framework Tiejun Huang Wen Gao Nikolce Stefanoski GNB comments on ISO/IEC 14496-16:2006/PDAM 2 m15201 Jörn Ostermann (FAMC) Honggang Qi Tiejun Huang m15202 Wen Gao Text Description for Bitstream Parser FU of AVS Dandan Ding Lu Yu Updated WD 23000-11 for Stereoscopic Video m15203 Kyuheon Kim Application Format KNB Response to Clause 3.2.2 of 82nd MPEG Shenzhen m15204 National Body of KOREA Meeting Resolution Proposed Working Draft of ISO/IEC 23000-5 2nd m15205 Filippo Chiariglione Edition L. Chiariglione Philip Merrill m15206 Luntian Mou WIM TV Trial at Beijing Olympics Olivier Avaro Xin Wang L. Chiariglione m15207 Requirements for Digital Item Presentation Olivier Avaro m15208 L. Chiariglione Requirements for MPEG eXtensible Middleware (MXM) Response to sc29n8883 Liaison from JVT on potential m15209 Wendy Aylsworth extension of SVC China NB Comments on Transport of GB 20090.2 video m15210 China National Body (CNNB) data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1 m15211 DVB via SC 29 Secretariat Liaison Statement from DVB [SC 29 N 9045] m15212 KNB KNB Comments on ISO/IEC 23000-4 2nd Edition FCD Zheng Huang m15213 Tiejun Huang Video Signature based on Mutual Infomation Yonghong Tian
39 ITU-T SG 16 via SC 29 m15214 Liaison Statement from ITU-T SG 16 Secretariat the DVD Forum via SC 29 m15215 Liaison Statement from the DVD Forum Secretariat Julie Lofton Working draft for proposed MPEG-M Production m15216 Jeff Steele Deliverables standard Paul Brasnett Updated Results on Extended Trace Transform Image m15217 Miroslaw Bober Signature Information exchange with Virtual Worlds (Metaverse1) m15218 Jean H.A. Gelissen Presentation Sikyung Kim m15219 Table of 3D models in the MPEG 3DGC repository Euee S. Jang m15220 Thomas Schierl for the GNB GNB comment on ISO/IEC 13818-1:2007/FDAM3 Jean-Claude Dufourd m15221 Joint LASeR/BIFS scene representation Olivier Avaro mxxxx webmaster Input contribution template Ohm r1000 Video Subgroup report for Friday Plenary Sullivan Sullivan r1001 JVT Report - Friday Plenary Ohm Sullivan r1002 JVT Report - Wednesday Plenary Ohm r1003 RVC RVC Report - Friday Plenary r1004 Tobias Oelbaum SVC Verification Report - Friday Plenary Ohm r1005 Video Subgroup Report - Wednesday Plenary Sullivan r1006 Miroslaw Bober MPEG-7 Visual Report - Friday Plenary List of Documents from the 83rd Meeting in Antalya, w9555 Convener Turkey w9556 Convener Resolutions of the 83rd Meeting in Antalya, Turkey List of AHGs Established at the 83rd Meeting in Antalya, w9557 Convener Turkey w9558 Convener Report of the 83rd Meeting in Antalya, Turkey Guidelines for Electronic Distribution of MPEG M and N w9559 Convener Documents w9560 Convener Press Release of the 83rd Meeting in Antalya, Turkey w9561 Convener Meeting Notice of the 84th Meeting in Archamps,
40 Switzerland w9562 HoD Guide for WG 11 Meeting Hosts w9563 video Request for 13818-2:2000/Amd.3 Text of ISO/IEC 13818-2:2000/PDAM 3 Level for w9564 Video 1080p/60 Support Study Text of ISO/IEC 14496-2:2004/PDAM5 Simple w9565 Video Studio Profile Levels 5 and 6 w9566 Video Study Text of ISO/IEC 14496-2:2004/DCOR3 Study Text of ISO/IEC 14496-4:2004/PDAM35 Simple w9567 Video Studio Profile Levels 5 and 6 Conformance Testing Disposition of Comments on ISO/IEC 14496- w9568 Video 4:2004/PDAM 31 Text of ISO/IEC 14496-4:2004/FPDAM 31 w9569 Video Conformance Testing for Scalable Video Coding Disposition of Comments on ISO/IEC 14496- w9570 Video 5:2001/Amd.1:2002/DCOR 1 w9571 Video Text of ISO/IEC 14496-5:2001/Amd.1:2002/COR 1 Disposition of Comments on ISO/IEC 14496- w9572 Video 5:2001/PDAM 19 Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference w9573 Video Software for Scalable Video Coding w9574 Video Text of ISO/IEC 14496-10:200X/DCOR 1 Disposition of Comments on ISO/IEC 14496- w9575 Video 10:200X/PDAM 1 Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview w9576 Video Video Coding w9577 Video Report on SVC Verification Tests w9578 Video Joint Multiview Video Model (JMVM) 7 w9579 Video JMVM 7 Software w9580 Video Overview of Multiview Video Coding (MVC) Text of ISO/IEC 15938-3:2001/PDAM 3 Image w9581 Video Signature Tools Description of Core Experiments for MPEG-7 New w9582 Video Visual Extensions w9583 Video Request for 13818-4:2004/Amd.3 w9584 Video Study Text of ISO/IEC CD 23001-4 Codec Configuration
41 Representation w9585 Video Reconfigurable Video Coding Requirements V 4.0 w9586 Video Overview of Reconfigurable Video Coding (RVC) w9587 Video Study Text of CD ISO/IEC 23002-4 Video Tool Library w9588 Video Extensions of Video Tool Library under consideration w9589 Video Description of Core Experiments in RVC w9590 Video RVC Simulation Model (RSM) V7.0 w9591 Video RVC Work Plan and FU Development Status w9592 Video RVC Conformance Testing Working Draft V4.0 w9593 Video Description of Exploration Experiments in RVC w9594 Video Methodologies for Video Toolbox Extension V2.0 Call for Contributions on 3D Video Test Material w9595 Video (Update) w9596 Video Description of Exploration Experiments in 3D Video AHG on Maintenance of MPEG-4 Visual related w9597 Convener Documents, Reference Software and Conformance w9598 Convener AHG on Reconfigurable Video Coding w9599 Convener AHG on MPEG-7 Visual w9600 Convener Terms of reference w9601 Convener MPEG Standards w9602 Convener Table of unpublished FDISs w9603 Convener Work plan and time line w9604 Convener Editors of MPEG standards w9605 Convener Schema assets w9606 Convener Software assets w9607 Convener Conformance assets w9608 Convener Content assets w9609 Convener URI assets Standards under development for which a call for patent w9610 Convener statements is issued List of Organisations with which MPEG entertains w9611 Convener liaisons w9612 DELETED DELETED
42 w9613 Convener AHG on FTV w9614 Convener Liaison Statement to SMPTE re RVC w9615 Convener Liaison Statement to ITU-T SG 9 re FTV w9616 Convener Liaison Statement to ITU-T SG 9 re Bitstream Splicing Liaison Statement template for various organizations re w9617 Convener SVC verification testing report Text of ISO/IEC 13818-4:2004/PDAM 3 Level for w9618 Video 1080@50/60p Conformance Testing w9619 Audio Workplan for AAC-ELD Verification Test DoC on ISO/IEC 14496-4:2004/FPDAM 20, SLS w9620 Audio Conformance w9621 Audio ISO/IEC 14496-4:2004/FDAM 20, SLS Conformance ISO/IEC 14496-4:2004/AMD 11/DCOR 3, Parametric w9622 Audio Stereo w9623 Audio ISO/IEC 14496-4:2004/AMD 19/DCOR 1, ALS ISO/IEC 14496-4:2004/AMD XX, WD on AAC-ELD, w9624 Audio OAFI and additional AAC Conformance DoC on ISO/IEC 14496-4:2004/FPDAM 29, SMR w9625 Audio Conformance w9626 Audio ISO/IEC 14496-4:2004/FDAM 29, SMR Conformance w9627 Audio MPEG-4 Audio Conformance Rollup w9628 Audio ISO/IEC 14496-5:2001/AMD 10/DCOR 2, ALS ISO/IEC 14496-5:2001/AMD XX, WD on AAC-ELD w9629 Audio Reference Sw. Study on ISO/IEC 14496-5:2001/FPDAM 20, MPEG-1 w9630 Audio and -2 Audio in MPEG-4 and BSAC Extensions DoC on ISO/IEC 23003-1:2006/FPDAM 1, MPEG w9631 Audio Surround Conformance ISO/IEC 23003-1:2006/FDAM 1, MPEG Surround w9632 Audio Conformance Workplan on further issues for MPEG Surround w9633 Audio Conformance DoC on ISO/IEC 23003-1:2006/FPDAM 2, MPEG w9634 Audio Surround Reference Sw. ISO/IEC 23003-1:2006/FDAM 2, MPEG Surround w9635 Audio Reference Sw.
43 w9636 Audio Status and Workplan on SAOC Core Experiments w9637 Audio WD on SAOC Text and Reference Software Evaluation Guidelines for Unified Speech and Audio w9638 Audio Proposals w9639 Audio Workplan on Speech and Audio Material Selection w9640 Audio Draft Workplan Evaluation Subjective Tests w9641 Convener Liaison Statement to ETSI TC DECT Study on PDAM of ISO/IEC 14496-4:2004 AMD32 w9642 3DGC (FAMC Conformance) Study on PDAM of ISO/IEC 14496-4:2004 AMD33 w9643 3DGC (MultiResolution Profile Conformance) ISO/IEC 14496-4:2004 PDAM 34 (3DGCM w9644 3DGC Conformance) w9645 3DGC ISO/IEC 14496-5 PDAM 22 (3DGCM RefSoft) w9646 3DGC Study of ISO/IEC 14496-16:2006/AMD1/DCOR1 DoC on ISO/IEC 14496-16:2006/PDAM2 (Frame-based w9647 3DGC Animated Mesh Compression) Text of ISO/IEC 14496-16:2006/FPDAM2 (Frame-based w9648 3DGC Animated Mesh Compression) w9649 3DGC WD2.0 of AFX 3rd Edition w9650 3DGC Requirements for low-complexity 3D mesh compression w9651 3DGC CfP for low-complexity 3D mesh compression w9652 3DGC Study of CD of ISO/IEC 14496-25 w9653 Convener AHG on Audio Standards Maintenance AHG on Unified Speech and Audio Coding and SAOC w9654 Convener and AAC-ELD w9655 Convener AHG on Information Exchange with Virtual Worlds w9656 Convener AHG on the RoSE Framework w9657 Convener AHG on Requirements for Media Value Chain Ontology w9658 Requirements Requirements for a Media Value Chain Ontology w9659 Requirements Requirements on RoSE Framework w9660 Convener Liaison Statement to ITU-T SG 16 w9661 Convener AHG on 3DG documents and software maintenance w9662 Convener Ad Hoc Group on Scene Representation
44 w9663 Convener Ad Hoc Group on MPEG File Formats w9664 Convener Ad Hoc Group on Application Format w9665 Convener Ad Hoc Group on Presentation of Structured Information AHG on Requirements for MPEG Post Production w9666 Convener Deliverable Formats w9667 Convener AHG on MPEG Query Format w9668 Convener AHG on Font Format Representation Text ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of w9669 Systems SVC in MPEG-2 Systems Text of ISO/IEC 13818-1:2007/Cor.2 WD2.0 related to w9670 Systems the carriage of AVC DoC on ISO/IEC 14496-5/FPDAM16 Symbolic Music w9671 Systems Representation Ref. Soft. Text of ISO/IEC 14496-5/FDAM16 Symbolic Music w9672 Systems Representation Ref. Soft. w9673 Systems DoC on ISO/IEC 14496-5/FPDAM17 LASeR Ref. Soft. w9674 Systems Text of ISO/IEC 14496-5/FDAM17 LASeR Ref. Soft. WD1.0 of Use of LASeR jointly with BIFS in MPEG-4 w9675 Systems Systems Architecture w9676 Systems Request for Amendment of ISO/IEC 14496-11 w9677 Systems ISO/IEC 14496-11 PDAM6 Scene Partitionning w9678 Systems Text of ISO/IEC 14496-12 3rd Edition w9679 Systems WD1.0 of Corrigendum on ISO/IEC 14496-12 w9680 Systems Updated Technology under Consideration for Part 12 DoC on ISO/IEC 14496-15/FPDAM2 SVC File Format w9681 Systems Extension Text of ISO/IEC 14496-15/FDAM2 SVC File Format w9682 Systems Extension w9683 Systems Request for 14496-22 2nd Edition w9684 Systems Text of CD ISO/IEC 14496-22 2nd Edition Items for consideration for Corrigendum or Amendment w9685 Systems of MPEG-21 DIA DoC on ISO/IEC 21000-5/FPDAM3 Open Access w9686 Systems Content Profile w9687 Systems Text of ISO/IEC 21000-5/FDAM3 Open Access Content
45 Profile MPEG-21 REL Profiles Software Implementation Plan w9688 Systems v.9 w9689 Systems MAF Overview Document w9690 Systems MAF Overview Presentation Study Text of ISO/IEC FCD 23000-4 Musical Slide w9691 Systems Show 2nd Edition Study Text of ISO/IEC 23000-4:200x/PDAM1 MSS w9692 Systems Application Format Conf. and Ref. Software Text of ISO/IEC 23000-5 2nd Edition WD1.0 Media w9693 Systems Streaming Application Format Requirements on Professional Archival Application w9694 Systems Format w9695 DELETED DELETED Text of ISO/IEC CD 23000-6 Professional Archival w9696 Systems Application Format DoC of ISO/IEC FCD 23000-7 Open Access Application w9697 Systems Format Text of ISO/IEC FDIS 23000-7 Open Access Application w9698 Systems Format w9699 Systems Request of Amendment for ISO/IEC 23000-7 Text of ISO/IEC PDAM1 23000-7 Conformance and w9700 Systems Reference Software Study Text of ISO/IEC 23000-8/FCD Portable Video w9701 Systems Application Format Workplan for Portable Video Application Format w9702 Systems Conformance and Ref. Soft. Text of ISO/IEC 23000-9/DCOR1 (MAF Application w9703 Systems Format) Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance w9704 Systems and Reference Software DoC on ISO/IEC CD 23000-10 (Video Surveillance w9705 Systems Application Format) Text of ISO/IEC FCD 23000-10 (Video Surveillance w9706 Systems Application Format) Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance w9707 Systems and Reference Software
46 Future Work on Surveillance AF's - collection of w9708 Systems requirements Text of ISO/IEC CD 23000-11 (Stereoscopic Video w9709 Systems Application Format) Requirements for MPEG Post Production Deliverable w9710 Systems Formats Gap Analysis between Post Production Deliverable w9711 Systems Requirements and Proposed Working Draft Text of WD1.0 MPEG Post Production Deliverable w9712 Systems Formats w9713 Systems Requirements for MXM (MPEG eXtensible Middleware) w9714 Convener Liaison to JPEG on ISO Base Format w9715 Systems Requirements for Presentation of Structured Information Preliminary WD of Presentation of Structured w9716 Systems Information w9717 Systems Requirements on WIM TV w9718 Convener Response to DVB on File Format w9719 Convener Response to DVB on Carriage and Storage of SVC w9720 Convener Response to JPEG on Query Format w9721 Convener Liaison to JTC1/SWG-ARM on PA Application Format w9722 Convener Liaison to SMPTE on PA Application Format w9723 Convener Liaison to TC20/SC13 on PA Application Format w9724 Convener Liaison to JPEG on PA Application Format w9725 Convener Response to JTC1/SC34 w9726 Convener Liaison to ITU-T SG16 on IPTV Liaison to Creative Common on Open Access w9727 Convener Application Format w9728 Convener Liaison to SMPTE on Post-Production Deliverables w9729 Convener Liaison to NAB on Post-Production Deliverables w9730 Convener Liaison to ATSC on Post-Production Deliverables w9731 Convener Liaison to MPAA on Post-Production Deliverables
47 Annex D– Output documents
Number Source Title w9555 Convener List of Documents from the 83rd Meeting in Antalya, Turkey w9556 Convener Resolutions of the 83rd Meeting in Antalya, Turkey w9557 Convener List of AHGs Established at the 83rd Meeting in Antalya, Turkey w9558 Convener Report of the 83rd Meeting in Antalya, Turkey w9559 Convener Guidelines for Electronic Distribution of MPEG M and N Documents w9560 Convener Press Release of the 83rd Meeting in Antalya, Turkey w9561 Convener Meeting Notice of the 84th Meeting in Archamps, Switzerland w9562 HoD Guide for WG 11 Meeting Hosts w9563 video Request for 13818-2:2000/Amd.3 w9564 Video Text of ISO/IEC 13818-2:2000/PDAM 3 Level for 1080p/60 Support Study Text of ISO/IEC 14496-2:2004/PDAM5 Simple Studio Profile w9565 Video Levels 5 and 6 w9566 Video Study Text of ISO/IEC 14496-2:2004/DCOR3 Study Text of ISO/IEC 14496-4:2004/PDAM35 Simple Studio Profile w9567 Video Levels 5 and 6 Conformance Testing w9568 Video Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 31 Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing for w9569 Video Scalable Video Coding Disposition of Comments on ISO/IEC 14496-5:2001/Amd.1:2002/DCOR w9570 Video 1 w9571 Video Text of ISO/IEC 14496-5:2001/Amd.1:2002/COR 1 w9572 Video Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 19 Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for w9573 Video Scalable Video Coding w9574 Video Text of ISO/IEC 14496-10:200X/DCOR 1 w9575 Video Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1 w9576 Video Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video Coding w9577 Video Report on SVC Verification Tests w9578 Video Joint Multiview Video Model (JMVM) 7
48 w9579 Video JMVM 7 Software w9580 Video Overview of Multiview Video Coding (MVC) w9581 Video Text of ISO/IEC 15938-3:2001/PDAM 3 Image Signature Tools w9582 Video Description of Core Experiments for MPEG-7 New Visual Extensions w9583 Video Request for 13818-4:2004/Amd.3 w9584 Video Study Text of ISO/IEC CD 23001-4 Codec Configuration Representation w9585 Video Reconfigurable Video Coding Requirements V 4.0 w9586 Video Overview of Reconfigurable Video Coding (RVC) w9587 Video Study Text of CD ISO/IEC 23002-4 Video Tool Library w9588 Video Extensions of Video Tool Library under consideration w9589 Video Description of Core Experiments in RVC w9590 Video RVC Simulation Model (RSM) V7.0 w9591 Video RVC Work Plan and FU Development Status w9592 Video RVC Conformance Testing Working Draft V4.0 w9593 Video Description of Exploration Experiments in RVC w9594 Video Methodologies for Video Toolbox Extension V2.0 w9595 Video Call for Contributions on 3D Video Test Material (Update) w9596 Video Description of Exploration Experiments in 3D Video AHG on Maintenance of MPEG-4 Visual related Documents, Reference w9597 Convener Software and Conformance w9598 Convener AHG on Reconfigurable Video Coding w9599 Convener AHG on MPEG-7 Visual w9600 Convener Terms of reference w9601 Convener MPEG Standards w9602 Convener Table of unpublished FDISs w9603 Convener Work plan and time line w9604 Convener Editors of MPEG standards w9605 Convener Schema assets w9606 Convener Software assets w9607 Convener Conformance assets w9608 Convener Content assets w9609 Convener URI assets
49 Standards under development for which a call for patent statements is w9610 Convener issued w9611 Convener List of Organisations with which MPEG entertains liaisons w9612 DELETED DELETED w9613 Convener AHG on FTV w9614 Convener Liaison Statement to SMPTE re RVC w9615 Convener Liaison Statement to ITU-T SG 9 re FTV w9616 Convener Liaison Statement to ITU-T SG 9 re Bitstream Splicing Liaison Statement template for various organizations re SVC verification w9617 Convener testing report Text of ISO/IEC 13818-4:2004/PDAM 3 Level for 1080@50/60p w9618 Video Conformance Testing w9619 Audio Workplan for AAC-ELD Verification Test w9620 Audio DoC on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance w9621 Audio ISO/IEC 14496-4:2004/FDAM 20, SLS Conformance w9622 Audio ISO/IEC 14496-4:2004/AMD 11/DCOR 3, Parametric Stereo w9623 Audio ISO/IEC 14496-4:2004/AMD 19/DCOR 1, ALS ISO/IEC 14496-4:2004/AMD XX, WD on AAC-ELD, OAFI and w9624 Audio additional AAC Conformance w9625 Audio DoC on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance w9626 Audio ISO/IEC 14496-4:2004/FDAM 29, SMR Conformance w9627 Audio MPEG-4 Audio Conformance Rollup w9628 Audio ISO/IEC 14496-5:2001/AMD 10/DCOR 2, ALS w9629 Audio ISO/IEC 14496-5:2001/AMD XX, WD on AAC-ELD Reference Sw. Study on ISO/IEC 14496-5:2001/FPDAM 20, MPEG-1 and -2 Audio in w9630 Audio MPEG-4 and BSAC Extensions DoC on ISO/IEC 23003-1:2006/FPDAM 1, MPEG Surround w9631 Audio Conformance w9632 Audio ISO/IEC 23003-1:2006/FDAM 1, MPEG Surround Conformance w9633 Audio Workplan on further issues for MPEG Surround Conformance DoC on ISO/IEC 23003-1:2006/FPDAM 2, MPEG Surround Reference w9634 Audio Sw. w9635 Audio ISO/IEC 23003-1:2006/FDAM 2, MPEG Surround Reference Sw. w9636 Audio Status and Workplan on SAOC Core Experiments
50 w9637 Audio WD on SAOC Text and Reference Software w9638 Audio Evaluation Guidelines for Unified Speech and Audio Proposals w9639 Audio Workplan on Speech and Audio Material Selection w9640 Audio Draft Workplan Evaluation Subjective Tests w9641 Convener Liaison Statement to ETSI TC DECT Study on PDAM of ISO/IEC 14496-4:2004 AMD32 (FAMC w9642 3DGC Conformance) Study on PDAM of ISO/IEC 14496-4:2004 AMD33 (MultiResolution w9643 3DGC Profile Conformance) w9644 3DGC ISO/IEC 14496-4:2004 PDAM 34 (3DGCM Conformance) w9645 3DGC ISO/IEC 14496-5 PDAM 22 (3DGCM RefSoft) w9646 3DGC Study of ISO/IEC 14496-16:2006/AMD1/DCOR1 DoC on ISO/IEC 14496-16:2006/PDAM2 (Frame-based Animated Mesh w9647 3DGC Compression) Text of ISO/IEC 14496-16:2006/FPDAM2 (Frame-based Animated Mesh w9648 3DGC Compression) w9649 3DGC WD2.0 of AFX 3rd Edition w9650 3DGC Requirements for low-complexity 3D mesh compression w9651 3DGC CfP for low-complexity 3D mesh compression w9652 3DGC Study of CD of ISO/IEC 14496-25 w9653 Convener AHG on Audio Standards Maintenance w9654 Convener AHG on Unified Speech and Audio Coding and SAOC and AAC-ELD w9655 Convener AHG on Information Exchange with Virtual Worlds w9656 Convener AHG on the RoSE Framework w9657 Convener AHG on Requirements for Media Value Chain Ontology w9658 Requirements Requirements for a Media Value Chain Ontology w9659 Requirements Requirements on RoSE Framework w9660 Convener Liaison Statement to ITU-T SG 16 w9661 Convener AHG on 3DG documents and software maintenance w9662 Convener Ad Hoc Group on Scene Representation w9663 Convener Ad Hoc Group on MPEG File Formats w9664 Convener Ad Hoc Group on Application Format w9665 Convener Ad Hoc Group on Presentation of Structured Information
51 w9666 Convener AHG on Requirements for MPEG Post Production Deliverable Formats w9667 Convener AHG on MPEG Query Format w9668 Convener AHG on Font Format Representation Text ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2 w9669 Systems Systems Text of ISO/IEC 13818-1:2007/Cor.2 WD2.0 related to the carriage of w9670 Systems AVC DoC on ISO/IEC 14496-5/FPDAM16 Symbolic Music Representation w9671 Systems Ref. Soft. Text of ISO/IEC 14496-5/FDAM16 Symbolic Music Representation Ref. w9672 Systems Soft. w9673 Systems DoC on ISO/IEC 14496-5/FPDAM17 LASeR Ref. Soft. w9674 Systems Text of ISO/IEC 14496-5/FDAM17 LASeR Ref. Soft. WD1.0 of Use of LASeR jointly with BIFS in MPEG-4 Systems w9675 Systems Architecture w9676 Systems Request for Amendment of ISO/IEC 14496-11 w9677 Systems ISO/IEC 14496-11 PDAM6 Scene Partitionning w9678 Systems Text of ISO/IEC 14496-12 3rd Edition w9679 Systems WD1.0 of Corrigendum on ISO/IEC 14496-12 w9680 Systems Updated Technology under Consideration for Part 12 w9681 Systems DoC on ISO/IEC 14496-15/FPDAM2 SVC File Format Extension w9682 Systems Text of ISO/IEC 14496-15/FDAM2 SVC File Format Extension w9683 Systems Request for 14496-22 2nd Edition w9684 Systems Text of CD ISO/IEC 14496-22 2nd Edition w9685 Systems Items for consideration for Corrigendum or Amendment of MPEG-21 DIA w9686 Systems DoC on ISO/IEC 21000-5/FPDAM3 Open Access Content Profile w9687 Systems Text of ISO/IEC 21000-5/FDAM3 Open Access Content Profile w9688 Systems MPEG-21 REL Profiles Software Implementation Plan v.9 w9689 Systems MAF Overview Document w9690 Systems MAF Overview Presentation w9691 Systems Study Text of ISO/IEC FCD 23000-4 Musical Slide Show 2nd Edition Study Text of ISO/IEC 23000-4:200x/PDAM1 MSS Application Format w9692 Systems Conf. and Ref. Software w9693 Systems Text of ISO/IEC 23000-5 2nd Edition WD1.0 Media Streaming
52 Application Format w9694 Systems Requirements on Professional Archival Application Format w9695 DELETED DELETED w9696 Systems Text of ISO/IEC CD 23000-6 Professional Archival Application Format w9697 Systems DoC of ISO/IEC FCD 23000-7 Open Access Application Format w9698 Systems Text of ISO/IEC FDIS 23000-7 Open Access Application Format w9699 Systems Request of Amendment for ISO/IEC 23000-7 w9700 Systems Text of ISO/IEC PDAM1 23000-7 Conformance and Reference Software w9701 Systems Study Text of ISO/IEC 23000-8/FCD Portable Video Application Format Workplan for Portable Video Application Format Conformance and Ref. w9702 Systems Soft. w9703 Systems Text of ISO/IEC 23000-9/DCOR1 (MAF Application Format) Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance and Reference w9704 Systems Software w9705 Systems DoC on ISO/IEC CD 23000-10 (Video Surveillance Application Format) w9706 Systems Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format) Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference w9707 Systems Software w9708 Systems Future Work on Surveillance AF's - collection of requirements w9709 Systems Text of ISO/IEC CD 23000-11 (Stereoscopic Video Application Format) w9710 Systems Requirements for MPEG Post Production Deliverable Formats Gap Analysis between Post Production Deliverable Requirements and w9711 Systems Proposed Working Draft w9712 Systems Text of WD1.0 MPEG Post Production Deliverable Formats w9713 Systems Requirements for MXM (MPEG eXtensible Middleware) w9714 Convener Liaison to JPEG on ISO Base Format w9715 Systems Requirements for Presentation of Structured Information w9716 Systems Preliminary WD of Presentation of Structured Information w9717 Systems Requirements on WIM TV w9718 Convener Response to DVB on File Format w9719 Convener Response to DVB on Carriage and Storage of SVC w9720 Convener Response to JPEG on Query Format w9721 Convener Liaison to JTC1/SWG-ARM on PA Application Format
53 w9722 Convener Liaison to SMPTE on PA Application Format w9723 Convener Liaison to TC20/SC13 on PA Application Format w9724 Convener Liaison to JPEG on PA Application Format w9725 Convener Response to JTC1/SC34 w9726 Convener Liaison to ITU-T SG16 on IPTV w9727 Convener Liaison to Creative Common on Open Access Application Format w9728 Convener Liaison to SMPTE on Post-Production Deliverables w9729 Convener Liaison to NAB on Post-Production Deliverables w9730 Convener Liaison to ATSC on Post-Production Deliverables w9731 Convener Liaison to MPAA on Post-Production Deliverables w9732 Convener Liaison to EBU on Post-Production Deliverables w9733 Convener Liaison to IEC TC100 TA6 on Post-Production Deliverables w9734 Convener Liaison to IFPI on Post-Production Deliverables w9735 Convener Liaison to DMP on Presentation of Structured Information w9736 Convener Liaison to IEC TC 9/WG 43 on Video Surveillance AF
54 Annex E – Requirements report
Source: Jörn Ostermann (Leibniz Universität Hannover)
15 Requirements documents approved at this meeting w9658 Requirements for a Media Value Chain Ontology w9659 Requirements on RoSE Framework
16 MPEG-V: Information exchange with virtual worlds
Several use cases were discussed. Until the next meeting, these use cases have to be verified. A call for proposals and evaluation methods for the proposals have to be prepared for the next meeting such that a call for proposals can be issued. The tentative work plan set is WD in July or October 2008 and CD in January 2009.
17 Explorations
17.1 IPTV Requirements No input on this topic was brought to the meeting. Since IPTV is an important area of work for MPEG, a joint meeting with Systems was held clarifying the contributions MPEG (codecs, IPMP, streaming) could make in this field. As a result, liaison from SG16 on Meta data, M3W, and WimTV are requested.
The IPTV Requirements document (N9167) has not been updated.
17.2 Rose
Requirements for Rose, the representation of sensory effects, were discussed. At the next meeting, the group plans to issue a Call for Proposals.
Input documents: m15083 Requirements on RoSE Framework m15091 Requirements on Framework for RoSE
17.3 Media Value Chain Ontology This topic has been discussed within MPEG for several meetings. It is now time to clarify a time line for standardization. Therefore, it was decided to prepare a requirements document and a Call for Proposals until the next meeting. At the next meeting, the decision on the documents will determine whether MPEG is going forward with this activity.
55 Input documents: m15031 Ad Hoc Group on Requirements of Media Value Chains Ontologies m15092 A Common Core IP Model
17.4 Future Work Items A review of the work areas of MPEG was started. Currently, MPEG focuses on the consumer market. Professional profiles of MPEG are provided such that consumer content can be produced efficiently. The standards define only the decoder, they do not define rendering. MPEG does not require or profile the use of error resilience features since the transport channel is assumed transparent.
In order to widen the usage of MPEG standards, MPEG members are requested to bring proposals for new work items to the next meetings. Possible items identified at the meeting are: Tools for consumer content creation, manipulation, annotation, distribution and privacy High efficiency video coding considering new colour spaces High efficiency audio coding Control of display setting from contents Accessibility including e-inclusion and alternative output devices for disabled Capture and presentation of smell, touch, vibrations and emotions
At the next meeting, potential work items will be discussed further.
56 Annex F – Systems report
Source: Systems Chair and Break-out group Chairs Editor: Olivier Avaro (Streamezzo) Contributors: David Singer (Apple), Young-Kwon Lim (Net&TV), Jean Gelissen (Philips), Gero Baese (Siemens)
18 Overview The main outputs of the meeting from the Systems Sub-group perspective are:
No. Title X 13818-1 MPEG-2 Systems 9669 Text ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2 Systems 9670 Text of ISO/IEC 13818-1:2007/Cor.2 WD2.0 related to the carriage of AVC X 14496-5 Reference Software 9671 DoC on ISO/IEC 14496-5/FPDAM16 Symbolic Music Representation Ref. Soft. 9672 Text of ISO/IEC 14496-5/FDAM16 Symbolic Music Representation Ref. Soft. 9673 DoC on ISO/IEC 14496-5/FPDAM17 LASeR Ref. Soft. 9674 Text of ISO/IEC 14496-5/FDAM17 LASeR Ref. Soft. X 14496-11 Scene Representation 9675 WD1.0 of Use of LASeR jointly with BIFS in MPEG-4 Systems Architecture 9676 Request for Amendment of ISO/IEC 14496-11 9677 ISO/IEC 14496-11 PDAM6 Scene Partitionning X 14496-12 ISO Base Media File Format 9678 Text of ISO/IEC 14496-12 3rd Edition 9679 WD1.0 of Corrigendum on ISO/IEC 14496-12 9680 Updated Technology under Consideration for Part 12 X 14496-15 AVC File Format 9681 DoC on ISO/IEC 14496-15/FPDAM2 SVC File Format Extension 9682 Text of ISO/IEC 14496-15/FDAM2 SVC File Format Extension X 14496-22 Open Font Format 9683 Request for 14496-22 2nd Edition 9684 Text of ISO/IEC 2nd Edition 14496-22 X 21000 General 9685 Items for consideration for Corrigendum or Amendment of MPEG-21 DIA X 21000-5 Rights Expression Language 9686 DoC on ISO/IEC 21000-5/FPDAM3 Open Access Content Profile 9687 Text of ISO/IEC 21000-5/FDAM3 Open Access Content Profile 9688 MPEG-21 REL Profiles Software Implementation Plan v.9 X 23000 General 9689 MAF Overview Document 9690 MAF Overview Presentation X 23000-4 Musical Slide Show Application Format 9691 Study Text of ISO/IEC FCD 23000-4 Musical Slide Show 2nd Edition 9692 Study Text of ISO/IEC 23000-4:200x/PDAM1 MSS Application Format Conf. and Ref. Software X 23000-5 Media Streaming Application Format Text of ISO/IEC 23000-5 2nd Edition WD1.0 Media Streaming Application 9693 Format X 23000-6 Professional Archival Application Format 9694 Requirements on Professional Archival Application Format 9695 Request for ISO/IEC 23000-6 Professional Archival Application Format
57 9696 Text of ISO/IEC CD 23000-6 Professional Archival Application Format X 23000-7 Open Access Application Format 9697 DoC of ISO/IEC FCD 23000-7 Open Access Application Format 9698 Text of ISO/IEC FDIS 23000-7 Open Access Application Format 9699 Request of Amendment for ISO/IEC 23000-7 9700 Text of ISO/IEC PDAM1 23000-7 Conformance and Reference Software X 23000-8 Portable Video Application Format 9701 Study Text of ISO/IEC 23000-8/FCD Portable Video Application Format 9702 Workplan for Portable Video Application Format Conformance and Ref. Soft. X 23000-9 Digital Multimedia Broadcasting Application Format 9703 Text of ISO/IEC 23000-9/DCOR1 (DMB Application Format) 9704 Text of ISO/IEC 23000-9/AMD1 WD1.0 Conformance and Reference Software X 23000-10 Video Surveillance Application Format 9705 DoC on ISO/IEC CD 23000-10 (Video Surveillance Application Format) 9706 Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format) 9707 Text of ISO/IEC 23000-10/AMD1 WD1.0 Conformance and Reference Software 9708 Future Work on Surveillance AF's – collection of requirements X 23000-11 Stereoscopic Video Application Format 9709 Text of ISO/IEC CD 23000-11 (Stereoscopic Video Application Format) X XXX Post Production Deliverable Formats 9710 Requirements for MPEG Post Production Deliverable Formats 9711 Gap Analysis between PPD Requirements and Proposed Working Draft 9712 Text of WD1.0 MPEG Post Production Deliverable Formats X Exploration 9713 Requirements for MXM (MPEG eXtensible Middleware) 9715 Requirements for Presentation of Structured Information 9716 Preliminary WD of Presentation of Structured Information 9717 Requirements on WIM TV X Assets and Standing Documents 9605 MPEG Schema Assets Updates X Liaison 9718 Response to DVB on File Format 9719 Response to DVB on Carriage and Storage of SVC 9720 Response to JPEG on Query Format 9721 Liaison to JTC1/SWG-ARM on PA Application Format 9722 Liaison to SMPTE on PA Application Format 9723 Liaison to TC20/SC13 on PA Application Format 9724 Liaison to JPEG on PA Application Format 9725 Response to JTC1/SC34 9726 Liaison to ITU-T SG16 on IPTV 9727 Liaison to Creative Common on Open Access Application Format 9728 Liaison to SMPTE on Post-Production Deliverables 9729 Liaison to NAB on Post-Production Deliverables 9730 Liaison to ATSC on Post-Production Deliverables 9731 Liaison to MPAA on Post-Production Deliverables 9732 Liaison to EBU on Post-Production Deliverables 9733 Liaison to IEC TC100 TA6 on Post-Production Deliverables 9734 Liaison to IFPI on Post-Production Deliverables 9735 Liaison to DMP on Presentation of Structured Information 9736 Liaison to ITU-T TC 9 WG43 on Video Surveillance AF 9714 Liaison to JPEG on ISO Base Format
58 19 General issues
19.1 General The meeting report from Shenzhen has been approved. The following demonstrations have been made: None.
19.2 List of standards under development P Pt Edit. Project Description CfP WD CD FCD FDIS
2 1 2006 Amd.3 SVC in MPEG-2 07/07 08/01 08/07 Systems 2 1 2006 Cor.2 Transport of AVC 08/04 08/07 Specification 4 1 200x Amd.4 Registration Authority 07/10 08/04 08/07 4 4 2007 Amd.26 Open Font Format Conf. 07/04 07/10 08/04 4 4 2007 Amd.27 LASeR Amd.1 06/10 07/07 07/10 08/04 Conformance 4 4 2007 Amd.xx SVC File Format Conf. TBS 4 5 2007 Amd.14 Open Font Format Ref. 07/10 08/04 08/10 09/01 Soft 4 5 2007 Amd.xx AVC File Format Ref. TBS Soft 4 5 2007 Amd.xx SVC File Format Ref. TBS Soft 4 5 2007 Amd.xx Synthesized Texture 08/04 08/07 09/01 Ref. Soft 4 11 2005 Amd.6 Scene Partitionning 08/01 08/04 08/10 7 5 2008 Amd.4. Improvements to 08/04 geographic descriptor 7 7 2008 Amd.3. Improvements to 08/04 geographic descriptor conformance 7 12 2008 1st Ed. MPEG Query Format 07/10 08/04 21 8 200x Amd.1 Minor Enhancement 07/10 08/04 08/07 21 9 200x Amd.1 MP21 Mime Type 07/04 07/10 08/04 21 15 200x Amd.1 Security in Event 08/04 Reporting A 4 200x 2nd Ed. Protected MSS AF 07/04 07/07 07/10 08/04 A 4 200x Amd.1 MSS AF Conf. and Soft 07/07 07/10 08/04 08/07 A 5 200x 2nd Ed. MS AF 08/01 08/04 08/10 09/04 A 6 200x 1st Ed. Professional Archival 07/10 08/01 08/04 08/10 AF A 7 200x Amd.1 OA AF Ref. Soft and 08/01 08/04 08/10 Conf. A 8 200x 1st Ed. Portable Video Player 06/10 07/04 07/10 08/04 AF A 8 200x Amd.1 PVP AF Ref. Soft. And Conf.
59 A 9 200x Amd.1 DMB AF Ref .Soft. And Conf. A 10 200x 1st Ed. Video Surveillance AF 07/04 07/07 08/01 08/07 A 10 200x Amd.1 Video Surveillance AF A 11 200x 1st Ed. Stereoscopic Video AF 07/04 08/01 08/04 08/10 A 11 200x 1st Ed. SV AF Ref. Soft. And 08/07 08/10 09/04 09/07 Conf. B 2 200x Amd.1 Fragment Request Unit Ref. Soft. And Conf. E 8 200x 1st Ed. Ref. Soft. and 07/01 07/07 08/04 08/07 Conformance V 1 200x 1st Ed. Interface with Virtual 08/07 08/10 09/01 09/04 09/10 World
60 19.3 Standing Documents
Pr Pt Documents No. Meeting 1 1 MPEG-1 White Paper – Multiplex Format N7675 05/07 Nice 1 1 MPEG-1 White Paper – Terminal Architecture N7676 05/07 Nice 1 1 MPEG-1 White Paper – Multiplexing and N7677 05/07 Nice Synchronization 2 1 MPEG-2 White Paper – Multiplex Format N7678 05/07 Nice 2 1 MPEG-2 White Paper – Terminal Architecture N7679 05/07 Nice 2 1 MPEG-2 White Paper – Multiplexing and N7680 05/07 Nice Synchronization 2 11 MPEG-2 White Paper – MPEG-2 IPMP N7503 05/07 Poznan 4 1 MPEG-4 White Paper – MPEG-4 Systems N7504 05/07 Poznan 4 1 MPEG-4 White Paper – Terminal Architecture N7610 05/10 Nice 4 1 MPEG-4 White Paper – M4MuX N7921 06/01 Bangkok 4 1 MPEG-4 White Paper – OCI N8148 06/04 Montreux 4 6 MPEG-4 White Paper – DMIF N8149 06/04 Montreux 4 11 MPEG-4 White Paper – BIFS N7608 05/10 Nice 4 12 MPEG-4 White Paper – ISO File Format N8150 06/04 Montreux 4 14 MPEG-4 White Paper – MP4 File Format N7923 06/01 Bangkok 4 15 MPEG-4 White Paper – AVC FF N7924 06/01 Bangkok 4 13 White Paper on MPEG-4 IPMP N7505 05/07 Poznan 4 13 MPEG IPMP Extensions Overview N6338 04/03 München 4 17 White Paper on Streaming Text N7515 05/07 Poznan 4 18 White Paper on Font Compression and Streaming N7508 05/07 Poznan 4 20 Presentation Material on LASER N6969 05/01 Hong- Kong 4 20 White Paper on LASeR N7507 05/07 Poznan 4 22 White Paper on Open Font Format N7519 05/07 Poznan 7 1 MPEG-7 White Paper - MPEG-7 Systems N7509 05/07 Poznan 7 1 MPEG-7 White Paper – Terminal Architecture N8151 06/04 Montreux 21 9 MPEG-21 White Paper – MPEG-21 File Format N7925 06/01 Bangkok A X MPEG Application Format Overview N9421 07/10 Shenzhen A X MAF Overview Document N9691 08/01 Antalya A X MAF Overview Presentation N9690 08/01 Antalya B X MPEG-B White Paper – BinXML N7922 06/01 Bangkok E X MPEG Multimedia Middleware Context and N6335 04/03 München Objectives E X 1rst M3W White paper N7510 05/07 Poznan E X 2nd M3W White Paper : Architecture N8152 06/04 Montreux E X Tutorial on M3W N8153 06/04 Monreux E X M3W White Paper : Multimedia Middleware N8687 06/10 Hanzhou Architecture E X M3W White Paper : Multimedia API N8688 06/10 Hanzhou E X M3W White Paper : Component Model N8689 06/10 Hanzhou E X M3W White Paper : Resource and Quality N8690 06/10 Hanzhou Management E X M3W White Paper : Component Download N8691 06/10 Hanzhou E X M3W White Paper : Fault Management N8692 06/10 Hanzhou
61 E X M3W White Paper : System Integrity N8693 06/10 Hanzhou Management
62 19.4 Mailing Lists Reminder
Kindly Managed Topic Information by Liste Reflector : [email protected] List-Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/gen-sys General mailto:[email protected]? University of Systems subject=subscribe Klagenfurt List List-Archive: http://lists.uni-klu.ac.at/pipermail/gen-sys List-Help: mailto:[email protected]? subject=help Liste Reflector : [email protected] List-Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mpeg7-sys mailto:[email protected]? University of BiM subject=subscribe Klagenfurt List-Archive: http://lists.uni-klu.ac.at/pipermail/mpeg7-sys List-Help: mailto:[email protected]? subject=help Liste Reflector : [email protected] List-Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mp4-sys File mailto:[email protected]? University of Format subject=subscribe Klagenfurt List-Archive: http://lists.uni-klu.ac.at/pipermail/mp4-sys List-Help: mailto:[email protected]? subject=help Liste Reflector : [email protected] List-Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mpeg-laser mailto:[email protected]? University of LASeR subject=subscribe Klagenfurt List-Archive: http://lists.uni-klu.ac.at/pipermail/mpeg-laser List-Help: mailto:[email protected]? subject=help Liste Reflector : [email protected] List-Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/maf-sys University of MAF mailto:[email protected]?subject=subscribe Klagenfurt List-Archive: http://lists.uni-klu.ac.at/pipermail/maf-sys List-Help: mailto:[email protected]? subject=help MPEG-2 #1: Please subscribe via http://lists.uni- University of on File klu.ac.at/mailman/listinfo/isoff-transport. Please use only that Klagenfurt Format email address for posting messages with which you're subscribed. Otherwise the email won't be delivered. #2: The email address for posting messages is: isoff- [email protected]
63 #3: The archive is accessible via http://lists.uni- klu.ac.at/mailman/private/isoff-transport/ for list members only.
19.5 FAQ The FAQ were updated as needed.
19.6 AOB None.
64 20 MPEG-2 Systems (13818-1)
20.1 General M15210: China NB Comments on Transport of GB 20090.2 video data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1. The Systems sub-group thanks the China NB for their input contribution on the carriage of AVS in MPEG-2 Systems. The Systems sub-group recommends to use mechanisms already in place to carry data in formats defines by organization external to MPEG (i.e. using the SMPTE registration authority). To the knowledge of the Systems sub-group, these mechanisms will fully satisfy the requirements of the China NB. In addition, the Systems sub-group noted the interest of the carriage of RVC on MPEG-2 Systems and welcome contributions in this area.
M15195: Transport of GB 20090.2 video data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1. Noted.
20.2 13818-1:2005 Amd.3 Carriage of SVC 20.2.1 Topics 1. Transport of Scalable Video Coding 20.2.2 Contributions M15220: Late GNB on ISO/IEC 13818-1:2007/FDAM3. Joint work was done to produce Study text of the Carriage of SVC. Technical Work in Progress.
20.3 13818-1:2005 DCOR.2 20.3.1 Topics 1. Coorigendum on the carriage of AVC 20.3.2 Contributions None. Technical Work in Progress.
21 MPEG-4 Conformance (14496-4)
21.1 14496-4 Amd.26 Open Font Format Conformance 21.1.1 Topics 1. Open Font Format Conformance 21.1.2 Contributions None. Technical Work in Progress.
65 21.2 14496-4 Amd.27 LASeR V2 Conformance 21.2.1 Topics 1. LASeR V2 Conformance 21.2.2 Contributions None. Technical Work in Progress.
22 MPEG-4 Reference Software (14496-5)
22.1 14496-5 Amd.14 22.1.1 Topics 1. Open Font Format Reference Software 22.1.2 Contributions None. Technical Work In Progress.
22.2 14496-5 Amd.16 22.2.1 Topics 1. Symbolic Music Representation Reference Software 22.2.2 Contributions M15059: Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 16 [SC 29 N 8926]. See DoC. M15079: Editor Study on ISO/IEC 14496-5:2001/FPDAM 16 Symbolic Music Representation reference software. Taken as input to produce final text.
Technical Work Completed.
22.3 14496-5 Amd.17 22.3.1 Topics 1. LASeR Reference Software 22.3.2 Contributions M15060: Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 17 [SC 29 N 8927]. Technical Work Completed.
66 23 MPEG-4 ISO Base File Format (14496-12)
23.1 14496-12 ISO Base Media File format General 23.1.1 15093 editor's re-structure part 12 Thank you. Please publish this ASAP with the help of the secretariat. Two fixes needed to movie fragments, and a sentence fragment “A track” needs removing in track selection. 23.1.2 15147 extended sample groups This attacks a number of problems; having compact group ‘definition’ in-line in the mapping box, a different way to compress the mapping (absolute sample numbers), multiple group definitions for the same type (with separated mapping tables), and extended, possibly variable-length in-line ‘values’ in the mapping box. All except the last seem fairly straightforward. The absolute sample numbers was a stylistic point (no other sample table has absolute numbers). This design is also somewhat complicated by a desire to use the same definition material for both sample groups and timed meta-data. We’d like a resolution asking for input on improving the sample group design, overall, taking into account these (and maybe other) issues and opportunities. 23.1.3 15211 indexing This explains the timed meta-data use of the same structures. 23.1.4 15178 color info To systems plenary, please. 23.1.5 15179 alt. groups and backwards compatibility Interesting, but we have some compatibility issues here, and we ought to define better what the track header flags mean and what their required behavior is, as well. It may be time to lift the ‘track is disabled if it’s a hint track’ rule, also. Hold this to the next meeting also (since we’re holding the possible Corr.). 23.1.6 w9379 Deriving from Part 12 updated Please add this to the TuC (or amendment if it becomes one). 23.1.7 Other amendment material We agree to include the information on RTP recording hint tracks (including the RTCP format), from the DVB liaison.
23.2 14496-12:2008/Amd2 23.2.1 Topics 1. New Edition & Amd.2 23.2.2 Contributions M15093 : Proposed re-structured ISO Base Media File Format. Adopted as starting point for new edition.
23.2.3 15073 Part 12 FDAM 2 replies Thank you for 100% approval.
67 23.2.4 15133 corr. to alc/flute Actually this is a Corr. to part 12 in general. We probably want to wait to issue this as a Corr. to the 2008 edition of part 12. 23.2.5 15134 Additions to alc/flute The first item seems needed; interleaving to construct source symbols from source files is very awkward right now. The second may be (?) a mis-understanding; every hint sample is attached to a sample entry that provides this data. This goes into the TuC for the upcoming amendment. 23.2.6 15148 Alc/flute conformance Thank you. We’re not sure about the file extension, but also not sure what is best. The spreadsheet needs updating, and we need to find out how to handle a ‘large’ (12MB) conformance file. Then we can open an amendment to the conformance part. We’ll do that in Archamps, with all the other Corr.s and Amd.s.
23.3 General consensus on the MPEG-2 TS approach 23.3.1 What is a sequence? multiple program, of which single is a special case 23.3.2 Hint track overall timescale recommended at 90kHz, or an integer division or multiple thereof. 23.3.3 What is a sample? 23.3.3.1 either multi-program, single TS packet per sample 23.3.3.2 or single-program, multi TS packet per sample In case (a), sync sample table is present but empty. Sample groups may be used to mark the sync points of the programs. In case (b), samples that contain GOP boundaries should have a GOP boundary at the start of a sample. The sync sample table marks the samples which start GOPs, and if the sync sample table is absent, all the samples are at the start of GOPs. If the sync sample table is present but empty, the GOP positions are unknown and may be not at the start of samples. Case (b) covers the (unusual) case of one sample for the sequence. 23.3.3.3 what about preceding and trailing bytes? (TS and FEC) as examples 23.3.3.4 how do you tell the difference? sample-size/(188+prec+trail) = N 23.3.3.5 are PMT, PAT, OD etc. also still in-stream? yes, probably. the sample entry documents the initial OD/PAT/PMT conditions for all samples associated with it. If these change, a new sample entry is needed for the first sample at or after the change. If they are not in the sample entry, then they are dynamic and the stream must be scanned.
68 23.3.4 What is in a sample entry? 23.3.4.1 0 or more PMTs 23.3.4.2 0 or 1 PAT 23.3.4.3 0 or more OD 23.3.4.4 indication for whether sample times are exactly PCR times 23.3.4.5 Transport offset there is an issue that this only applies to one sample, and isn’t safe under say editing (or random access). we should warn that this field may need updating after e.g. editing. 23.3.4.6 optional boxes for format of preceding and trailing bytes (not defined here) 23.3.5 What are the hint track timestamps (stts)? They may be reception/transmission times or PCR times. But there is a recommendation that the PCR times be used, as these are piece-wise linear and the stts table then compacts sensibly. The big question is, do we have a packet structure that allows the documentation of a reception/transmission offset from the PCR time? 23.3.6 Special issues for recording support? de-hinting issues (error concealment) 23.3.7 Track references to associated media tracks (“de-hinting”, linking at authoring time) 23.3.8 Constructors that use track refs yes 23.3.9 Other matters to correct/amd in part 12 definition of a hint tracks
23.4 Part 12 MPEG-2 TS Storage 23.4.1 15146 MPEG-2 TS Hint Tracks Thank you for the introduction. We do need to think about de-hinting and error concealment. 23.4.2 15211 DVB MPEG-2 TS, (Indexing?) We need to decide whether to reply to the liaison. 23.4.3 15189 Proposed MPEG-2 TS storage Thank you for this initial start on the combined specification, to all on the reflector and particularly those in the Sunday pre-meeting. The editing team will take the notes developed in the meeting and make an input to the next meeting of the proposed amendment. The editors should prepare their best effort by Feb 8th. The editing team is Hui Yong Kim, Stefan Döhla, David Singer.
24 MPEG-4 AVC File Format (14496-15)
24.1 14496-15:2004/Amd.2 24.1.1 Topics
1. SVC File Format Extensions
69 24.1.2 15066 Part 15 FPDAM replies Processed, thank you. 24.1.3 15173 editor's SVC FF draft Accepted, according to the Finnish request, as the basis of future work.
Technical Work Completed.
24.2 14496-15:2004/Amd.3? MVC FF 24.2.1 Topics 1. MVC File Format Extensions
24.2.2 Contributions Lacking contributions, we’ll ask again, otherwise take things slowly. Technical Work in Progress.
70 25 LASeR (14496-20)
25.1 14496-20/Amd.xxx 25.1.1 Topics 1. LASeR Extensions
25.1.2 Contributions None.
Technical Work in Progress.
26 LASeR (14496-22) Open Font Format
26.1 14496-22/Amd.1 26.1.1 Topics
2. Open Font Format Extension
26.1.2 Contributions M15081: USNB Contribution: Proposed amendment to ISO/IEC 14496-22. And M15082 : The proposal for amendment of ISO/IEC 14496-22 (in support of USNB comment m15081). Decision to start a new work item following USNB request. Decision to take text of M15082 as a basis to produce CD text of new edition of the Open Font Format Specification.
M15077: Liaison Statement from JTC 1/SC 34/WG 2 [SC 29 N 9035]. Request to update reference in our specification. Accepted and integrated in the text of 2nd Edition CD.
Technical Work in Progress.
27 15938-12 MPEG-7: Query Format
27.1 General Discussion on the location of MPEG Query Format Specification. Decision to keep it as Part 12 of MPEG-7 AND to integrate conformance and reference software as Amendment of Part 12.
M15176, M1577 : Nobody to present these contributions. Postponned to next meeting. M15109 : Not enough people to progress this specification. Postponned to next meeting.
71 28 21000 MPEG-21
28.1 General M15135: MPEG-21 schema assets update. Taken as a basis to produce the related output document on Schema Assets. M15138: Multiple MPEG-21 DIA AdaptationQoS Descriptions within a Digital Item. Used to produce Items for consideration for Corrigendum or Amendment of MPEG-21 DIA.
28.2 MPEG-21 File Format Amendment 28.2.1 Topics 1. Mime Type 28.2.2 Contributions None.
29 MPEG-A MAF (23000)
29.1 23000-4 Musical Slide Show MAF 29.1.1 Topics 1. Protected Musical Slide Show MAF 29.1.2 Contributions M15212: KNB Comments on ISO/IEC 23000-4 2nd Edition FCD. See DoC. M15124: Use cases for content protection in Musical slide show Application Format 2nd Edition. Integrated in study text of 2nd Edition. M15127: Editor's study text of ISO/IEC 23000-4/PDAM1 Musical slide show application format. Use as the basis to produce Study text. Technical Work in Progress.
29.2 23000-5 Media Streaming MA 29.2.1 Topics 1. Media Streaming MAF 29.2.2 Contributions M15205: Proposed Working Draft of ISO/IEC 23000-5 2nd Edition. Proposed text for the: a. Reference Software (using Chillout) b. Conformance Testing c. Informative Annex Taken as a basis to produced output document. Decision to keep the informative annex on example Technical Work in Progress.
29.3 23000-6 Professional Archival AF 29.3.1 Topics 1. Professional AF
72 29.3.2 Contributions M15182: Updated requirements on Professional Archival Application Format. Taken as basis for producing requirements document. M15126: Proposed Editorial Update for ISO/IEC 23000-6 WD 1.0. Approved. Take as a basis to produce CD text. M15128: Proposal for Pre-Processing Tool Location Reference in Professional Archival Application Format. And M15129 : Set of MPEG-7 Tools for Professional Archival Applications Format. Accepted for introduction in FCD.
Technical Work in Progress.
29.4 23000-7 Open Access Application Format 29.4.1 Topics 1. Open Access Application Format 29.4.2 Contributions M15168: Open Access Application Format: Reference Software. Taken as basis to produce WD. M15171: GENB comments on the Study of the MPEG-21 REL Open Access Profile FPDAM. All comments where disposed of. See DoC. M15136: Study Text of ISO/IEC FCD 23000-7 Open access application format. Used to produce FDIS text.
Technical Work Completed.
29.5 23000-8 Portable Video Player MAF 29.5.1 Topics 1. Portable Video Player MAF 29.5.2 Contributions M15141: Editor's study text of ISO/IEC 23000-8/FCD Portable video application format. Taken as input for producing study text. M15184: Proposed workplan for Portable video application format conformance. Taken as input for producing output workplan. Technical Work in Progress.
29.6 23000-9 DMB AF 29.6.1 Topics 1. DMB MAF 29.6.2 Contributions M15187: Proposed text of ISO/IEC 23000-9/PDAM1 DMB AF: Conformance and Reference software. Use as basis to produce WD. M15188: Proposed text of ISO/IEC 23000-9/DCOR1 DMB AF: timescale of TS.
Technical Work in Progress.
73 29.7 23000-10 Video Surveillance MAF 29.7.1 Topics 1. Video Surveillance MAF 1st Edition 29.7.2 Contributions M15157: Video Surveillance Application Format: Reference Software. Taken as basis for producing WD of reference software. See DoC. M15152: Study Text of ISO/IEC 23000-10/CD Video Surveillance Application Format. Taken as basis to produce text of FCD. M15115: Early UKNB comments on the Study of CD for the Video Surveillance Application Format.
Technical Work in Progress.
29.8 23000-11 Stereoscopic Video AF 29.8.1 Topics 1. Vide 29.8.2 Contributions M15203: Updated WD 23000-11 for Stereoscopic Video Application Format. Taken as input to produce text of the CD.
Technical Work in Progress.
30 MPEG-E Multimedia Middleware (23004)
30.1 Multimedia Middleware 30.1.1 Topics 1. MPEG Multimedia Middleware 30.1.2 Contributions None. Technical Work in Progress.
74 31 Requirements and Exploration
31.1 Standing Documents None.
31.2 New Proposals 31.2.1 WIM TV IPTV : Informal report from MPEG experts who contributed to IPTV Focus Group. We will continue to liaise with ITU-SG16 and provide various MPEG specifications that would be relevant to the group.
M15206: WIM TV Trial at Beijing Olympics. Use case noted and will be use as a basis to drive requirements for WIM TV.
31.2.2 Proposal for Standardization of MPEG eXtensible Middleware M15208: Taken as a basis for the production of the requirements for MXM. Decision on a timeline (cf. Requirements document).
31.2.3 Proposal for Standardization of ROSE M15083: Requirements on RoSE Framework. M15091: Requirements on Framework for RoSE. Both contributions taken as input to produce updated requirement document.
31.2.4 Proposal for Standardization of Content Deliverables for Professionally Produced Film, TV, Video M15216, M15105 : Working draft for proposed MPEG-M Production Deliverables standard & MPEG-M under the MPEG 21 Reflector.
The Systems sub-group, noting that : 1. it has generated a Post-production Deliverable (PPD) requirements document at the 82nd WG11 meeting that was not made publicly available 2. it has reviewed the results of the Ad hoc group on Requirements for MPEG Post Production Deliverable Formats (N9549) 3. it has developed a Post-production Deliverable (PPD) working draft (WD) at the 83rd WG 11 meeting 4. it has made a gap analysis between the current requirements and the PPD WD concluding that the WD provides a broad coverage of PPD requirements, but that a. Some requirement are not yet formulated in a way that allows the development of a technical solution; b. Some requirement were insufficiently documented to be able to perform the gap analysis; c. Some requirements were not fully satisfied recommends making the PPD Requirement document as well as the WD publicly available and to liaise with relevant SDOs and trade organizations that may have an interest in this activity inviting them to comment on the two documents and to join in the development of the PPD standard.
The Systems sub-group recommends promoting the proposed PPD specification to WD and progressing the PPD standard with the following time line: - CD : 2008/04
75 - FCD : 2008/10 - FDIS : 2009/04 The WD will be progressed to CD pending the successful completion of the gap analysis between the current requirements and the CD.
31.2.5 Proposal for Standardization of Interfaces with Virtual Worlds None.
31.2.6 Presentation of Structured Information M15207: Requirements for Digital Item Presentation. Taken as input for the production of the requirement for this activity.
M15142: Considerations on Integrating LASeR and DID Technologies for WIM TV. Taken as input for the production of the WD on the Presentation of Structured Information.
31.2.7 Carriage of RVC Joint meeting with Video. Review requirements on the carriage of RVC on MPEG-2 Systems. The activity will be started at the next meeting. Technical inputs are welcomed. Known open issues : Updatability of descriptors in MPEG-2 Systems.
31.2.8 Scene Partitionning Joint meeting with 3DGC. Decision to integrate the Scene Partitionning specification in Part 11 and start a committee draft in Antalya.
32 Liaison Cf. Liaison output.
76 33 Latest References and Publication Status
Pr Pt Standard No. Issue Status Doc. With Purpose ISO Award 2 1 ISO/IEC 13818-1/Amd.7 Published 2000/12 Done 2 1 ISO/IEC 13818-1:2000 (MPEG-2 Systems 2nd Edition) 00/12 Published 2000/12 Proposed 2 1 ISO/IEC 13818-1:2000/COR1 (FlexMux Descr.) N3844 01/01 Pisa Published 2002/03 N/A 2 1 ISO/IEC 13818-1:2000/COR2 (FlexMuxTiming_ descriptor) N4404 01/12 Pattaya Published 2002/12 N/A 2 1 ISO/IEC 13818-1:2000/Amd.1 (Metadata on 2) & COR1 on Amd.1 N5867 03/07 Published 2003/12 Proposed Trondheim 2 1 ISO/IEC 13818-1:2000/Amd.2 (Support for IPMP on 2) N5604 03/03 Pattaya Published 2004/03 N/A 2 1 ISO/IEC 13818-1:2000/Amd.3 (AVC Carriage on MPEG-2) N5771 03/07 Published XXXX Proposed Trondheim 2 1 ISO/IEC 13818-1:2000/Amd.4 (Metadata Application CP) N6847 04/10 Palma FDAM ITTF to be published N/A 2 1 ISO/IEC 13818-1:2000/Amd.5 (New Audio P&L Sig.) N6585 04/07 FDAM ITTF to be published N/A Redmond 2 1 ISO/IEC 13818-1:2000/COR3 (Correction for Field Picture) N6845 04/10 Palma COR ITTF to be published N/A 2 1 ISO/IEC 13818-1:2000/COR4 (M4MUX Code Point) N7469 05/07 Poznan COR ITTF to be published N/A 2 1 ISO/IEC 13818-1:2000/COR5 (Corrections related to 3rd Ed.) N7895 06/01 COR ITTF to be published N/A Bangkok 2 1 ISO/IEC 13818-1:2006 (MPEG-2 Systems 3rd Edition) 06/xx Published ITTF TBP 2 1 ISO/IEC 13818-1:2006/Amd.1 (Transport of Streaming text) N8369 06/07 FDAM ITTF to be published TBP Klagenfurt 2 1 ISO/IEC 13818-1:2006/Amd.2 (Carriage of Auxialiry Video Data) N8798 07/01 FDAM ITTF to be published TBP Marrakech 2 1 ISO/IEC 13818-1:2006/Cor.1.2 (Reference to AVC Specification) N9365 07/10 FDAM ITTF to be published TBP Shenzhen 2 11 ISO/IEC 13818-1:2003 (IPMP on 2) N5607 03/03 Pattaya Published 2003/12 Proposed 4 1 ISO/IEC 14496-1 (MPEG-4 Systems 1st Ed.) N2501 98/10 Atl. City Published 1999/12 Done
77 4 1 ISO/IEC 14496-1/Amd.1 (MP4, MPEG-J) N3054 99/12 Hawaii Published 2001/11 Done 4 1 ISO/IEC 14496-1/Cor.1 N3278 00/03 Published 2001/11 N/A Noordwijk. 4 1 ISO/IEC 14496-1:2001 (MPEG-4 Systems 2nd Ed.) N3850 01/01 Pisa Published 2001/11 N/A 4 1 ISO/IEC 14496-1:2001/Amd.1 (Flextime) Published 2002/10 Done 4 1 ISO/IEC 14496-1:2001/Cor.1 N4264 01/07 Sydney COR ITTF N/A 4 1 ISO/IEC 14496-1:2001/Cor.2 N5275 02/10 Shangai COR ITTF N/A 4 1 ISO/IEC 14496-1:2001/Cor.3 N6587 04/07 COR ITTF N/A Redmond 4 1 ISO/IEC 14496-1:2001/Amd.2 (Textual Format) N4698 02/03 Jeju AMD ITTF N/A Island 4 1 ISO/IEC 14496-1:2001/Amd.3 (IPMP Extensions) N5282 02/10 Published 2004-05 N/A Shanghai 4 1 ISO/IEC 14496-1:2001/Amd.4 (SL Extension) N5471 02/12 Awaji Published 2003/12 N/A 4 1 ISO/IEC 14496-1:2001/Amd.7 (AVC on 4) N5976 03/10 Published 2004-08 N/A Brisbanne 4 1 ISO/IEC 14496-1:2001/Amd.8 (ObjectType Code Points) N6202 03/12 Hawaii AMD ITTF to be published N/A 4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 05/04 Busan PDAM ITTF Final Text N/A Editing 4 1 ISO/IEC 14496-1:200x/Cor4 (Node Coding Table) N7473 05/07 Poznan PDAM ITTF to be published N/A 4 1 ISO/IEC 14496-1 (MPEG-4 Systems 3rd Ed.) N5277 02/10 IS ITTF to be published Proposed Shanghai 4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 05/04 Busan PDAM ITTF Final Text N/A Editing 4 1 ISO/IEC 14496-1:200x/Cor.1 (Clarif. On audio codec behavior) N8117 06/04 COR ITTF Final Text N/A Montreux Editing 4 1 ISO/IEC 14496-1:200x/Amd.2 (3D Profile Descriptor Extensions) N8372 06/07 PDAM ITTF to be published N/A Klagenfurt 4 1 ISO/IEC 14496-1:200x/Cor.2 (OD Dependencies) N8646 06/10 COR ITTF to be published N/A Hangzhou 4 1 ISO/IEC 14496-1:200x/Amd.3 (JPEG 2000 support in Systems) N8860 07/01 PDAM ITTF to be published N/A Marrakech
78 4 4 ISO/IEC 14496-1:200x/Amd.17 (ATG Conformance) N8861 07/01 PDAM ITTF to be published N/A Marrakech 4 4 ISO/IEC 14496-1:200x/Amd.22 (AudioBIFS v3 conformance) N9295 07/07 PDAM ITTF to be published N/A Lausanne 4 4 ISO/IEC 14496-1:200x/Amd.23 (Synthesized Texture conformance) N9369 07/10 PDAM ITTF to be published N/A Shenzhen 4 4 ISO/IEC 14496-1:200x/Amd.24 (File Format Conformance) N9370 07/10 PDAM ITTF to be published N/A Shenzhen 4 4 ISO/IEC 14496-1:200x/Amd.25 (LASeR V1 Conformance) N9372 07/10 PDAM ITTF to be published N/A Shenzhen 4 5 ISO/IEC 14496-1:200x/Amd.12 (File Format) N9020 07/04 San Jose PDAM ITTF to be published N/A 4 5 ISO/IEC 14496-1:200x/Amd.16 (SMR Ref. Soft) N9672 08/01 Antalya PDAM ITTF to be published N/A 4 5 ISO/IEC 14496-1:200x/Amd.17 (LASeR Ref. Soft) N9674 08/01 Antalya PDAM ITTF to be published N/A 4 6 ISO/IEC 14496-6:2000 Published 2000/12 N/A 4 8 ISO/IEC 14496-8 (MPEG-4 on IP Framework) N4712 02/03 Jeju Published 2004-05 Proposed 4 11 ISO/IEC 14496-11 (MPEG-4 Scene Description 3rd N6960 05/01 FDIS SC29 Final Text Proposed Edition) HongKong Editing 4 11 ISO/IEC 14496-11/Amd.1 (AFX) N5480 02/12 Awaji FDAM ITTF Integration in 1st N/A Ed. 4 11 ISO/IEC 14496-11/Amd.2 (Advanced Text and Graphics) N6205 03/12 Hawaii FDAM ITTF Integration in 1st N/A Ed. 4 11 ISO/IEC 14496-11/Cor.1 N6203 03/12 Hawaii COR SC29 N/A 4 11 ISO/IEC 14496-11/Cor.3 Valuator/AFX related correction N6594 04/07 COR ITTF Integration in 1st N/A Redmond Ed. 4 11 ISO/IEC 14496-11/Amd.3 Audio BIFS Extensions N6591 04/07 FDAM ITTF Integration in 1st Proposed Redmond Ed. 4 11 ISO/IEC 14496-11/Amd.4 XMT and MPEG-J Extensions N6959 05/01 FDAM ITTF Integration in 1st N/A HongKong Ed. 4 11 ISO/IEC 14496-11/Cor.3 (Audio BIFS Integrated in 3rd Edition) N7230 05/04 Busan COR ITTF Final Text N/A Editing 4 11 ISO/IEC 14496-11/Cor.5 (Misc Corrigendum) N8383 06/07 COR SC29 N/A Klagenfurt
79 4 11 ISO/IEC 14496-11/Amd.5 Symbolic Music N8657 06/10 FDAM ITTF TBP Representation Hangzhou 4 11 ISO/IEC 14496-11/Cor.6 (AudioFx Correction) N9021 07/04 San Jose COR SC29 N/A 4 12 ISO/IEC 14496-12 (ISO Base Media File Format) N5295 02/10 Published 2004-02 Proposed Shanghai 4 12 ISO/IEC 14496-12/Amd.1 ISO FF Extension N6596 04/07 FDAM ITTF FDAM 04/11/30 N/A Redmond 4 12 ISO/IEC 14496-12/Cor.1 (Correction on File Type N7232 05/04 Busan COR ITTF Final Text N/A Box) Editing 4 12 ISO/IEC 14496-12/Cor.2 (Miscellanea) N7901 06/01 COR ITTF Final Text N/A Bangkok Editing 4 12 ISO/IEC 14496-12/Amd.1 (Description of timed N8659 06/10 FDAM ITTF N/A metadata) Hangzhou 4 12 ISO/IEC 14496-12/Cor.3 (Miscellanea) N9024 07/04 San Jose COR ITTF Final Text N/A Editing 4 12 ISO/IEC 14496-12/Amd.2 (Flute Hint Track) N9023 07/04 San Jose FDAM ITTF N/A 4 13 ISO/IEC 14496-13 (IPMP-X) N5284 02/10 IS ITTF to be published Proposed Shanghai 4 14 ISO/IEC 14496-14 (MP4 File Format) N5298 02/10 Published 2003-11 Proposed Shanghai 4 14 ISO/IEC 14496-14/Cor.1 (Audio P&L Indication) N7903 06/01 COR ITTF Final Text N/A Bangkok Editing 4 15 ISO/IEC 14496-15 (AVC File Format) N5780 03/07 Published 2004-04 Proposed Trondheim 4 15 ISO/IEC 14496-15/Amd.1 (Support for FREXT) N7585 05/10 Nice FDAM ITTF Final Text N/A Editing 4 15 ISO/IEC 14496-15/Cor.1 N7575 05/10 Nice COR ITTF N/A 4 15 ISO/IEC 14496-15/Cor.2 (NAL Unit Restriction) N8387 06/07 COR ITTF N/A Klagenfurt 4 15 ISO/IEC 14496-15/Amd.2 (SVC File Format N9682 08/01 Antalya FDAM ITTF N/A Extension) 4 17 ISO/IEC 14496-17 (Streaming Text) N7479 05/07 Poznan FDAM ITTF TBP
80 4 18 ISO/IEC 14496-18 (Font Compression and Streaming) N6215 03/12 Hawaii Published 2004-07 Proposed 4 18 ISO/IEC 14496-18/Cor.1 (Misc. corrigenda and N8664 06/10 COR ITTF N/A clarification) Hangzhou 4 19 ISO/IEC 14496-19 (Synthesized Texture Stream) N6217 03/12 Hawaii Published 2004-07 Proposed 4 20 ISO/IEC 14496-20 (LASeR) N7588 05/10 Nice FDAM Editor TBP 4 20 ISO/IEC 14496-20/Cor.1 (Misc. corrigenda and N8666 06/10 COR ITTF N/A clarification) Hangzhou 4 20 ISO/IEC 14496-20/Amd.1 (LASeR Extension) N9029 07/04 San Jose FDAM ITTF N/A 4 20 ISO/IEC 14496-20/Cor.2 (Profile Removal) N9381 07/10 FDAM ITTF N/A Shenzhen 4 20 ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support) N9384 07/10 FDAM ITTF N/A Shenzhen 4 22 ISO/IEC 14496-22 (Open Font Format) N8395 06/07 FDAM Editor Final Text TBP Klagenfurt Editing 7 1 ISO/IEC 15938-1 (MPEG-7 Systems) N4285 01/07 Sydney Published 2002/07 Done 7 1 ISO/IEC 15938-1/Amd.1 (MPEG-7 Systems Extensions) N6326 04/03 Munich FDAM ITTF FDAM 04/11/28 N/A 7 1 ISO/IEC 15938-1/Cor.1 (MPEG-7 Systems Corrigendum) N6328 04/03 Munich COR Editor N/A 7 1 ISO/IEC 15938-1/Cor.2 (MPEG-7 Systems Corrigendum) N7490 05/07 Poznan COR ITTF N/A 7 1 ISO/IEC 15938-1/Amd.2 (BiM extension) N7532 05/10 Nice FDAM ITTF N/A 7 2 ISO/IEC 15938-2 (MPEG-7 DDL) N 01/07 Sydney Published 2002/02 Done
7 7 ISO/IEC 15938-7/Amd.2 (Fast Access Ext. Conformance) N8672 06/10 FDAM ITTF N/A Hangzhou 21 9 ISO/IEC 21000-9 (MPEG-21 File Format) N 05/01 FDIS ITTF FDIS 05/01/21 Done HongKong 21 16 ISO/IEC 21000-16 (MPEG-21 Binary Format) N 05/04 Busan FDIS ITTF FDIS 05/04/22 TBP
21 5 ISO/IEC 21000-5 (Open Release Content Profile) N 08/01 Antalya FDAM ITTF TBP
A 1 ISO/IEC 23000-4 (Musical Slide Show MAF) N 07/04 San Jose FDIS ITTF TBP
A 1 ISO/IEC 23000-9 (Digital Multi. Broadcasting MAF) N 07/10 FDIS ITTF TBP
81 Shenzhen A 1 ISO/IEC 23000-7 (Open Access MAF) N 08/01 Antalya FDIS ITTF TBP
B 1 ISO/IEC 23001-1 (XML Binary Format) N 05/10 Nice FDIS ITTF TBP
B 1 ISO/IEC 23001-1/Cor.1 (Misc. Editorial and technical N 06/10 COR ITTF N/A clar.) Hangzhou B 1 ISO/IEC 23001-1/Cor.2 (Misc. Editorial and technical N 07/04 San Jose COR ITTF N/A clar.) B 1 ISO/IEC 23001-1/Amd.1 (Reference Soft. & Conf.) N 07/01 FDAM ITTF N/A Marrakech B 1 ISO/IEC 23001-1/Amd.1 (Exten. On encoding of wild N 07/07 PDAM ITTF to be published N/A cards) Lausanne B 2 ISO/IEC 23001-1 (Fragment Request Unit) N 07/04 San Jose FDIS ITTF TBP
B 3 ISO/IEC 23001-3 (IPMP XML Messages) N 07/04 San Jose FDIS ITTF TBP
E 1 ISO/IEC 23008-1 Architecture N 07/01 FDAM ITTF N/A Marrakech E 2 ISO/IEC 23008-2 Multimedia API N 07/01 FDAM ITTF N/A Marrakech E 3 ISO/IEC 23008-3 Component Model N 07/01 FDAM ITTF N/A Marrakech E 4 ISO/IEC 23008-4 Ressource & Quality Management N 07/01 FDAM ITTF N/A Marrakech E 5 ISO/IEC 23008-5 Component Download N 07/04 San Jose FDAM ITTF N/A
E 6 ISO/IEC 23008-6 Fault Management N 07/04 San Jose FDAM ITTF N/A
E 7 ISO/IEC 23008-7 System Integrity Management N 07/04 San Jose FDAM ITTF N/A
29116 1 ISO/IEC 29116 Media Streaming MAF Protocols N 07/10 FDAM ITTF N/A
82 Shenzhen
83 34 Resolutions of Systems
Cf. WG11 resolution.
35 Contributions Reviewed by the Systems Subgroup
N° Title Authors m1503 Ad Hoc Group on MPEG File Formats David Singer 8 Visharam Mohammed m1505 Summary of Voting on ISO/IEC 14496-5:2001/FPDAM SC 29 Secretariat 9 16 [SC 29 N 8926] m1506 Summary of Voting on ISO/IEC 14496-5:2001/FPDAM SC 29 Secretariat 0 17 [SC 29 N 8927] m1506 Liaison Statement from SC 29/WG 1 [SC 29 N 8956] SC 29 Secretariat 3 m1506 Liaison Statement from SC 29/WG 1 [SC 29 N 8957] SC 29 Secretariat 4 m1506 Summary of Voting on ISO/IEC 14496-15:2004/FPDAM SC 29 Secretariat 6 2 [SC 29 N 8961] m1507 Liaison Statement from JTC 1/SC 34/WG 2 [SC 29 N SC 29 Secretariat 7 9035] m1507 Editor Study on ISO/IEC 14496-5:2001/FPDAM 16 Pierfrancesco Bellini 9 Symbolic Music Representation reference software Paolo Nesi Giorgio Zoia Maurizio Campanai m1508 USNB Contribution: Proposed amendment to ISO/IEC Andy Tescher for the USNB 1 14496-22 m1508 The proposal for amendment of ISO/IEC 14496-22 (in Simon Daniels 2 support of USNB comment m15081) Michelle Hill Vladimir Levantovsky m1508 Requirements on RoSE Framework Sanghyun Joo 3 Bumsuk Choi Munchurl Kim m1509 Requirements on Framework for RoSE Jean Gelissen 1 Mark Verberkt m1509 Proposed re-structured ISO Base Media File Format Per Fröjdh 3 David Singer m1509 Proposed re-structured ISO Base Media File Format Per Fröjdh 3 David Singer m1510 MPEG-M under the MPEG 21 Reflector Julie Lofton 5 Jeff Steele m1510 MPEG-M under the MPEG 21 Reflector Julie Lofton 84 N° Title Authors 5 Jeff Steele m1510 Proposal of Reference Software for MPQF. Validation of Ruben Tous 9 embedded XQuery expressions. Jaime Delgado m1511 Early UKNB comments on the Study of CD for the Video James Annesley 5 Surveillance Application Format m1512 Use cases for content protection in Musical slide show Houari Sabirin 4 Application Format 2nd Edition Munchurl Kim m1512 Proposed Editorial Update for ISO/IEC 23000-6 WD 1.0 Hendry 6 Munchurl Kim m1512 Editor's study text of ISO/IEC 23000-4/PDAM1 Musical Hyouk-Jean Cha 7 slide show application format Tae Hyeon Kim Jisoo Hong m1512 Proposal for Pre-Processing Tool Location Reference in Hendry 8 Professional Archival Application Format Munchurl Kim m1512 Set of MPEG-7 Tools for Professional Archival Hendry 9 Applications Format Houari Sabirin Munchurl Kim m1513 Proposed corrections to ALC/FLUTE server file format Jani Peltotalo 3 Miska M. Hannuksela m1513 Proposed additions to ALC/FLUTE server file format Jani Peltotalo 4 Miska M. Hannuksela m1513 MPEG-21 schema assets update Christian Timmerer 5 m1513 Study Text of ISO/IEC FCD 23000-7 Open access Florian Schreiner 6 application format m1513 Multiple MPEG-21 DIA AdaptationQoS Descriptions Ingo Kofler 8 within a Digital Item Christian Timmerer Hermann Hellwagner m1514 Editor's study text of ISO/IEC 23000-8/FCD Portable Hyouk-Jean Cha 1 video application format Tae Hyeon Kim Herbert Thoma m1514 Considerations on Integrating LASeR and DID Jihun Cha 2 Technologies for WIM TV Injae Lee Young-Kwon Lim KyungAe Moon Jinwoo Hong m1514 MPEG2-TS and RTP reception hint tracks Stefan Döhla 6 Miska M. Hannuksela m1514 Extended sample grouping mechanism for the ISO Base Stefan Döhla 7 Media File Format Miska M. Hannuksela m1514 Proposed conformance files for ALC/FLUTE server file Jani Peltotalo 8 format Miska M. Hannuksela m1515 Study Text of ISO/IEC 23000-10/CD Video Surveillance Gero Bäse 2 Application Format
85 N° Title Authors m1515 Video Surveillance Application Format: Reference James Annesley 7 Software m1516 Open Access Application Format: Reference Software Florian Schreiner 8 m1517 GENB comments on the Study of the MPEG-21 REL Florian Schreiner 1 Open Access Profile FPDAM m1517 Editors' Input to ISO/IEC 14496-15/FPDAM 2 (SVC File Dave Singer 3 Format) Ye-Kui Wang Thomas Rathgen m1517 Paging function in MPEG Query Format Masanori Sano 6 Hideki Sumiyoshi Nobuyuki Yagi m1517 Interpretation Consistency for SpatialQuery and Masanori Sano 7 TemporalQuery Hideki Sumiyoshi Nobuyuki Yagi m1517 Codec-independent color information in part 12 files David Singer 8 m1517 Backwards-compatibility for alternate groups DW Singer 9 m1518 Updated requirements on Professional Archival Noboru Harada 2 Application Format Takehiro Moriya Yutaka Kamamoto m1518 Proposed workplan for Portable video application format Hyouk-Jean Cha 4 conformance m1518 Proposed text of ISO/IEC 23000-9/PDAM1 DMB AF: Hui Yong Kim 7 Conformance and Reference software Houari Sabirin Munchurl Kim m1518 Proposed text of ISO/IEC 23000-9/DCOR1 DMB AF: Hui Yong Kim 8 timescale of TS MyungSeok Ki Gun Bang m1518 Proposed WD on 14496-12 ISO-FF Amendment: MPEG- Hui Yong Kim 9 2 TS storage Gun Bang MyungSeok Ki Han-Kyu Lee Yong Han Kim m1519 Transport of GB 20090.2 video data over ITU-T Rec. Xiaozhong Xu 5 H.222.0 | ISO/IEC 13818-1 Xilin Chen Tiejun Huang m1520 Updated WD 23000-11 for Stereoscopic Video Kyuheon Kim 3 Application Format m1520 Updated WD 23000-11 for Stereoscopic Video Kyuheon Kim 3 Application Format m1520 Proposed Working Draft of ISO/IEC 23000-5 2nd Filippo Chiariglione 5 Edition m1520 WIM TV Trial at Beijing Olympics L. Chiariglione
86 N° Title Authors 6 Philip Merrill Luntian Mou Olivier Avaro Xin Wang m1520 Requirements for Digital Item Presentation L. Chiariglione 7 Olivier Avaro m1520 Requirements for MPEG eXtensible Middleware (MXM) L. Chiariglione 8 m1521 China NB Comments on Transport of GB 20090.2 video China National Body 0 data over ITU-T Rec. H.222.0 | ISO/IEC 13818-1 (CNNB) m1521 Liaison Statement from DVB [SC 29 N 9045] DVB via SC 29 Secretariat 1 m1521 KNB Comments on ISO/IEC 23000-4 2nd Edition FCD KNB 2 m1521 Working draft for proposed MPEG-M Production Julie Lofton 6 Deliverables standard Jeff Steele m1521 Working draft for proposed MPEG-M Production Julie Lofton 6 Deliverables standard Jeff Steele m1522 Late GNB on ISO/IEC 13818-1:2007/FDAM3 Thomas Shierl 0
87 Annex G – Video report
Source: Jens Ohm and Gary Sullivan, Chairs
1 MPEG-2 Support for 1080/50p/60p
More industry support was brought regarding a new level for MPEG-2 video, which would enable compatibility of decoders for 1080p 50 and 60 fps formats. It was decided to place this new level on top of the previous existing levels in Main profile, but prohibiting the use of interlace-oriented tools in bitstreams of the new level (although decoders remain required to be capable of decoding bitstreams of lower levels, which may use these tools).
PDAM texts related to the video standard and to the conformance standard were produced. Work related to support for larger formats in the 4:2:2 profile is reported to be under further study. This will not affect the timeline of the ongoing amendment work, but if necessary be handled in a later work item.
Documents reviewed: m15100 Proposal of new level to support Teruhiko Suzuki, Ajay Luthra, Yi-Jen Chiu 1080@50p/60p for MPEG-2 video
Documents approved: No. Title TBP Available 9563 Request for 13818-2:2000/Amd.3 N 08/01/18 9564 Text of ISO/IEC 13818-2:2000/PDAM 3 Level for 1080@50/60p N 08/01/28 9583 Request for 13818-4:2004/Amd.3 N 08/01/18 9618 Text of ISO/IEC 13818-4:2004/PDAM 3 Level for 1080@50/60p N 08/01/28 Conformance Testing
2 MPEG-4 Simple Studio Profile Levels 5 & 6
The amendment work to support larger formats (beyond 1920x1080) with MPEG-4 simple studio profile was started by the 82nd meeting. Some necessary modifications to the PDAM text were reported in M15097. Furthermore, work on conformance (both for the newly proposed levels, and for improvements for already existing levels) has progressed (M15099). It was decided to issue Study texts for the PDAMs based on these contributions.
Documents reviewed: m15061 Summary of Voting on ISO/IEC SC 29 Secretariat 14496-5:2001/Amd.1:2002/DCOR 1 [SC 29 N 8938] m15097 Proposal for MPEG-4 visual studio Teruhiko Suzuki, Nick Saunders, John Stone, Paul Gardiner profile level 5 and 6 Found some problems in the extension after the last meeting. First issue is macroblock number and total number of macroblocks in slice header. Another issue is wrong syntax for intra_DC in case of RGB. Put first issue in Study of PDAM, and second issue in Study of DCOR. m15099 Proposal for MPEG-4 visual studio Teruhiko Suzuki profile conformance testing Add new conformance streams for levels 2-4 into Amd.35 (Study of PDAM). Replace the entire table for StuP conformance streams, including marks that the basic functional testing streams also apply to simple StuP.
88 Documents approved: No. Title TBP Available 9565 Study Text of ISO/IEC 14496-2:2004/PDAM5 Simple Studio N 08/01/18 Profile Levels 5 and 6 9566 Study Text of ISO/IEC 14496-2:2004/DCOR3 N 08/01/18 9567 Study Text of ISO/IEC 14496-4:2004/PDAM35 Simple Studio N 08/01/18 Profile Levels 5 and 6 Conformance Testing 9570 Disposition of Comments on ISO/IEC 14496- N 08/01/18 5:2001/Amd.1:2002/DCOR 1 9571 Text of ISO/IEC 14496-5:2001/Amd.1:2002/COR 1 N 08/02/01
3 Development of AVC
The video subgroup jointly approved the ISO standard related output documents that were produced during the 26th JVT meeting which was held in parallel. Important work items in this context were as follows – SVC verification tests – Approval of MVC PDAM – Preparation of software and conformance FPDAM for Scalable Video Coding
The report of the SVC verification tests was finalized, using conditions suitable for a range of possible application scenarios for progressive video, including Video-conferencing with quality scalability for the Common Intermediate Format (CIF, 352x288 pixels) at 30 frames per second (fps) video, and spatial scalability for 640x352 pixels at 60 fps video with an enhancement substream for 1280x704 pixels at 60 fps; Mobile TV with quality scalability for the Quarter Video Graphics Array (QVGA, 320x240 pixels) format at 25 fps video, and spatial scalability for QVGA at 12.5 fps with an enhancement substream for VGA (640x480 pixels) at 25 fps enhancement; HD TV with spatial scalability for 720p (1280x720 pixels) at 50 fps with 1080p (1280x1080 pixels) at 50 fps enhancement; and Movie production with spatial scalability for 1080p at 25 fps being the highest resolution, with two lower resolutions provided for scalability.
For the performance evaluations, SVC was compared against AVC single layer coding by means of subjective testing. Subjective tests were performed following relevant international recommendations using a controlled environment and a high number of test subjects.
The results of these tests indicate that these various types of scalability for these applications can be achieved with a bit rate overhead typically equal to or less than 10% when compared to AVC single layer coding using only the highest resolution in the test case. In the HDTV and movie cases, comparable quality was achieved with no apparent need to increase bit rate at all. The bit rate savings obtained by SVC compared to AVC simulcast transmission depend on the particular test case, and were found to be between 17% and 40% of the simulcast bit rate. These bit rate savings relative to simulcast are particularly important for applications in which video must be provided with different spatial resolutions, for which simulcast would previously have been the only available AVC-based standardized solution.
All results and more detailed description of the test setup are included in the public test report (N9577).
89 MVC has reached the level of FPDAM as amendment 1 of the new edition of AVC. The specification does not include any new coding tools at the macroblock level or below (see JVT report).
A first contribution related to MVC profiling was discussed jointly with JVT and MPEG Requirements SG. The current idea is defining only on “multiview high profile”, no interlaced coding, constraint set flag could be used to perform switching between main and high. Level definitions are preliminary and will need more careful investigation about buffer sizes, restrictions in inter-view prediction etc. In general, it should be avoided to define an entirely new set of levels beyond the existing ones. One solution could be made e.g. by starting from maximum number of macroblocks per second, and derive therefrom useful values e.g. for maximum number of views etc. within a given level.
Further study will be necessary taking into account requirements of certain applications (such as stereo, n-view), and investigate for specific levels, whether e.g. the numbers of possible reference pictures are sufficient for the multiview application. Ways to enable parallel processing should also be considered.
Documents reviewed: m15193 Summary of Voting on ISO/IEC SC 29 Secretariat 14496-4:2004/PDAM 31 m15194 Summary of Voting on ISO/IEC SC 29 Secretariat 14496-5:2001/PDAM 19 m15108 Subjective results for the SVC Tobias Oelbaum ([email protected]) Verification Test m15132 Verification of new SVC Verification Mathias Wien Test Streams m15196 Proposal on Profiles for MVC (Multi- Hideaki Kimata, Hiroya Nakamura, Takashi Itoh view Video Coding)
Documents approved: No. Title TBP Available 9568 Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 31 N 08/01/18 9569 Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing N 08/02/29 for Scalable Video Coding 9572 Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 19 N 08/01/18 9573 Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for N 08/03/20 Scalable Video Coding 9574 Text of ISO/IEC 14496-10:200X/DCOR 1 N 08/04/11 9575 Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1 N 08/01/18 9576 Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video N 08/02/15 Coding 9577 Report on SVC Verification Tests Y 08/01/18 9578 Joint Multiview Video Model (JMVM) 7 N 08/02/15 9579 JMVM 7 Software N 08/02/22 9580 Overview of Multiview Video Coding (MVC) Y 08/01/18
4 MPEG-7 Visual
4.1 MPEG-7 Visual related work in Antalya The MPEG-7 breakout group was active during the whole week. Input documents related to the Visual part in 15938-3 are listed in the table below. All these documents were reviewed and discussed.
90 m15103 CE Report of VCE-5 Sangyoun Lee m15104 Text/Logo Mask Image Generation Kota Iwamoto, Ryoma Oami Software for VCE-7 m15106 Contribution of video test material Weon Genu Oh, Daeil Yoon, Jie Jia, Hae Kwang Kim for MPEG-7 video signature CE m15114 Proposal on Frame-Reduction video Ju-Kyong Jin, Weon-Geun Oh, Dong-Jin Seo, Sang-il Na, Jae-Hyun clip format Huh, Dong-Seok Jeong m15122 Errors in MPEG-7 reference James Annesley software m15130 Experiment Results of Image Weon-Geun Oh, Ayoung Cho, Won-Keun Yang, Ik-Hwan Cho, Ju- Signature for Complex Conditions Kyong Jin, Jun-Woo Lee, Dong-Seok Jeong m15131 The Extra Experiment Result to Weon-Geun Oh, Won-Keun Yang, Ayoung Cho, Dong-Seok Jeong Verify Performance Measure Method of MPEG-7 VCE-6 m15137 Cross verification result for ETRI Min-Jeong Lee, Heung-Kyu Lee VCE-6 proposal m15139 Video Signature based on Inter- [email protected] Zheng Huang, [email protected] Tiejun frame Correlation Coefficients Huang, yhtian@ @pku.edu.cn Yonghong Tian m15140 Visual Signature based on Waston [email protected], [email protected], [email protected] Perceptual Model m15169 Correction to Image Signature XM Paul Brasnett, Miroslaw Bober Software m15170 Performance Evaluation of Image Paul Brasnett, Miroslaw Bober Signature on Extended Database m15172 Extending the Trace Transform Paul Brasnett, Miroslaw Bober Image Signature to Complex Conditions m15181 Cross verification result of Image Karol Wnukowicz Signature (VCE-6) m15217 Updated Results on Extended Trace Paul Brasnett, Miroslaw Bober Transform Image Signature
On major work item has been the further review of image signature descriptors as investigated in VCE-6. A Dataset of approximately 130.000 images was used. Independence was tested on 8.45 billion image pairs. Robustness was tested on 250.000 images (24+1 different modifications). The following findings were made, comparing 15130 vs. 15217: – Average performance above 90% at 10ppm for both methods – 15130 (“concentric circles” method) performs significantly better for cropping – 15217 (extension of method in XM/WD, “trace transform” method) performs slightly/sometimes significantly better for most other cases – Cropping result is very specific, because 15130 method would fail in case of non-center crops, but 15217 has also very poor result for case of heavy cropping In general, it can be concluded that both methods require more development for the cases of the more difficult modifications. In particular, further investigations appear necessary for cases of more localized signatures (which were extracted similarly in both methods, using areas around feature points). The global descriptors appear useful only for simple conditions, but could be also used as first step in quick database search, sorting out the clearly dissimilar images.
The following decision was made: - The technology of current WD (global signature) is promoted for PDAM - For complex operation, most probably good localized descriptors would be required, which must not necessarily be derivates from the current global descriptor. Further investigations will be made in the upcoming CE, also taking into account combinations of complex conditions, such as combination of translation and cropping (no-center case), and also with scaling. necessary - The method of 15217 (global WD/XM descriptor with some more localized feature extraction) will not be put into a new version of the XM, because it may turn out in CE that other localized descriptors perform better.
For Video Signatures, it is estimated that 208 hours of video content are required, of which ~100 hours have already been collected or committed to be submitted in near future. Varied content is
91 required such as: sports, news, film, soap, variety of others. The set should consist of approx. 4.000 longer clips and 24.000 shorter clips; 50 million comparisons must be performed for the envisaged range of quality (successful hits vs. false alarm rate).
Furthermore, work on software to create video modifications automatically has been done in VCE-7. Commitment for providing the remaining data material is expected for the time between Antalya and Archamps meetings. It is therefore decided to delay the CfP by one meeting to finalize the testing database work.
Technically, two new proposals (ideas) were received and reviewed, related to video signatures. Their testing awaits completion of the full database.
The following timeline is planned for the ongoing work on video signatures: . Final CfP: 2008/04 with responses 2008/07 . PDAM: 2008/07 . FPDAM: 2009/01 . FDAM: 2009/07
Three Core Experiments will continue: – Face Recognition in IR images – Image Signature for Complex conditions – Video Signature (collection of test material and software tools for CfP preparation)
A bug report concerning visual XM was reviewed. The bug reported and solutions proposed will be verified by the XM maintenance team at Warsaw University.
The development of the reference software for ISO/IEC 23000-3 was continued. The BIM incompatibility issue was solved, new SDL’s were provided to generate bitstreams. The Software is being rewritten to reflect new SDL’s, expected completion is within 3 weeks after the Antalya meeting.
4.2 Output documents related to MPEG-7 Visual
No. Title TBP Available 15938-3 Visual 9581 Text of ISO/IEC 15938-3:2001/PDAM 3 Image Signature Tools N 08/01/28 9582 Description of Core Experiments for MPEG-7 New Visual N 08/01/18 Extensions
5 23002 MPEG-C Video Technologies
5.1 23001-4 and 23002-4 Reconfigurable Video Coding (RVC)
5.1.1 Allocation of input contributions
MPEG-B related CE (Monday Afternoon January 14, 2008 2:00PM or Tuesday Morning) Doc. No. Authors Title m15113 Sinwook Lee The results of RVC CE 1.2 Jaebum Jun Review comments: compression results of CDDL are provided but there is Byeongjun Kim still no comparison with BSDL description compressed with BiM. There are Chungku Yie hints that suggests that a CDDL representation can be converted back to Euee S. Jang 92 BSDL schema.
Recommendations: continue the investigation for comparing BiM compression and to prove that conversion to BSDL schema is possible. m15125 Hyungyu Kim The results of RVC CE 1.1 Sikyung Kim Review comments: the contribution presents results of compressing XML- Myungjoong Lee based RVC DDL. The compression performances are higher for CDDL, Chungku Yie however there is no proof that applying a XML schema to BiM would not Euee S. Jang achieve better results.
Recommendations: to include into the Study document and solicit contributions showing that CDDL can perform better than BiM when using a DDL schema. m15117 Byeongjun Kim Study of Application Requirements Related to RVC Jaebum Jun Hyungyu Kim Review comments: doubts on addressing such requirements at RVC level Chungku Yie while they should be addressed at system level that support RVC. The group Euee S. Jang acknowledges that such requirements should be taken into account at both RVC and System level.
Recommendations: start some studies for RVC on IP and on MPEG-2 TS, including application scenarios. m15159 Christophe Lucarz BSDL Description of MPEG-4 SP and AVC BP Dandan Ding Jianjun Li Bitstream Syntax for RVC Framework Marco Mattavelli Review comments: the contribution describes how BSDL can be used to describe a low-level bitstream.
Recommendations: add the BSDL description to the “study of CD” and proceed with validation of the schema. Include extensions needed to describe the low-level segments of the bitstreams (VLD) into the study document. m15163 Christophe Lucarz Auto-generation of RVC Parser from BSDL Syntax Jianjun Li Marco Mattavelli Description: Variable Length Decoding Dandan Ding Review comments: the contribution proposes a systematic procedure for generating VLD decoding FUs from VLD tables
Recommendations: to include the technology as informative procedure for the generation of bitstream parsers from BSDL descriptions. m15166 Dandan Ding Function Units for Conversion from Syntax to Christophe Lucarz Marco Mattavelli Sequence of Tokens: BTYPE Lu Yu Review comments: the contribution presents FUs for Btype generation and Motion Vectors generation for the instantiation of parser from a BSDL description.
Recommendations: to complete the implementation work for MPEG-4 SP, AVC and MPEG-2 for possible inclusion in the MPEG toolbox. m15199 Hyungyu Kim Proposed text of CCR CD: A chapter for DD Sinwook Lee Byeongjun Kim transmission Chungky Yie Euee S. Jang Review comments: the contribution presents the text for the CD for inclusion of CDDL in the CD.
Recommendations: keep this co ntribution for possible inclusion of the CDDL technology in the FCD. m15200 Dandan Ding BSDL Description of AVS Bitstream Syntax for Lu Yu Honggang Qi RVC Framework Tiejun Huang 93 Wen Gao Review comments: the contribution presented the extensions needed to fully describe AVS bitstream syntax using BSDL.
Recommendations: the group recommends to check and define necessary extensions and compare them to the one of AVC for next meeting.
MPEG-C related CE (Tuesday Morning January 15, 2008 9:00AM) Doc. No. Authors Title m15080 Gwo Giun Lee Functional units of AVC inter-prediction for He-Yuan Lin Ming-Jiun Wang adaptive interlace coding
Review comments: the contribution identifies new FUs necessary to implement interlaced adaptive coding for AVC.
Recommendations: continue the work, develop textual description and CAL SW and respect the naming convention for inclusion in FCD at next meeting.
m15107 Kenji Otoi Proposed text of the RVC FUs for MPEG-2 Yoshihisa Yamada Kohtaro Asai Review comments: the contribution updates the textual description of FUs for MPEG-2
Recommendations: include update in the MPEG-C CD
m15156 Dandan Ding Update of Classification of Tokens for FUs of Marco Mattavelli Christophe Lucarz MPEG-4 SP and MPEG-4/AVC in RVC Framework Lu Yu Review comments: the contributions updates token classification of FUs included in the RSM.
Recommendations: update classification for newly submitted FU (MPEG-2) and interlaced AVC coding modes to include them in the FCD.
m15164 Christophe Lucarz Functional Units for RVC Toolbox: Variable Length Jianjun Li Marco Mattavelli Decoding Dandan Ding Review comments: the contribution presents new FUs for VLD decoding to be included in the RVC toolbox
Recommendations: to include the new FUs in the toolbox and provide the textual descriptions for the MPEG-C study of CD.
EE related (to be discussed on) Doc. No. Authors Title m15202 Honggang Qi Text Description for Bitstream Parser FU of AVS Tiejun Huang Review comments: Wen Gao Dandan Ding Lu Yu Recommendations:
General (Tuesday Afternoon January 15, 2008 at 2:00PM) Doc. No. Authors Title
m15167 M. Raulet Update of CAL2C code generation G. Roquier Review comments: M. Wipliez JF. Nezan 94 O. Deforges Recommendations:
5.1.2 Action points after contribution review Decide naming convention for VLD decoding FUs (All). Done, include definition in the Study document.
Upload BSDL schemas and corresponding bitstreams on the CVS: o MPEG-4 SP (Dandan, Christophe) 05-02-08 o AVC (Mickael) 12-02-08 o AVS (Dandan) 12-02-08
Generate MPEG-2 BSDL description (Dandan) 28-02-08 Upload on CVS BSDL and bitstreams for MPEG-2 28-02-08
SW for generation of VLD FUs o Upload the SW for generation of CAL FUs on the CVS (Dandan, Christophe) 25-01-2008
New VLD decoding FU in the toolbox: o Upload new VLD decoding FUs on the CVS with correct naming convention (Dandan, Christophe) for MPEG-4 SP 25-01-2008 o and provide textual description for study of CD part C 25-01-2008 o Generate VLD decoding FUs for MPEG-2 (Yamada, Dandan) 25-01-2008 o and provide textual descriptions for the study document of part C 02-02-08 o Generate VLD decoding FUs for AVC (Dandan, He-Yuan/Chris) 02-03-08 o and provide textual descriptions for the study document of part C 02-02-08
Non normative FUs: o Upload AVS parser on the CVS (Dandan) 20-01-08 o and provide textual description. o Generate AVS VLD FUs and upload on the CVS (Dandan) 02-02-08 o Provide textual description 02-02-08
Update token classification with all new uploaded FUs (VLD FUs, AVS parser, MPEG-2, AVC interlaced) for the study document of CD part C 15-03-8
Provide text explaining the procedure for BSDL schema validation to be included in the study of CD for part B as informative annex. (Mickael) 07-02-08
Include Copyright disclaimer to all CAL FUs in the CVS (Christophe) 15-02-08
5.1.3 RVC - Systems Joint meeting on Systems RVC support
After revision of the requirements for RVC systems support it was concluded that: 1. Most of the mechanisms needed by RVC are available in Systems technology (MPEG-2 TS, MP4 file format), but not in all transport formats (i.e. switch of a configuration at a given time in a stream is available only in MPEG-4 transport format) 2. Some other mechanisms are not directly available (i.e. change of systems parameters such as buffer size, bitrate, etc etc, …)
95 3. Activity on systems level support for RVC can start from next meeting with input contributions that address one or more of the Systems requirements approved at this meeting. 4. Commitments for input contributions to the Systems Group for next April meeting have been agreed and are reported in the workplan document.
5.1.4 Output document processing
CD Part B (Editor: Gwo Giun) Revision completed in Wednesday session
Study of CD Part B
Section 5.2 BSDL extensions for RVC + examples of BSDL schema for MPEG-4 SP, AVC, MPEG-2 (Dandan, Christophe, Mickael, Marco) 28-02-08 Annex D Non normative procedure for: – BSDL schema validation (Mickael) – the instantiation of parsers fro the ADM (Dandan, Christophe, Mickael, Marco) New section for compressed decoder description (Euee, Hyungyu) Annex D non normative procedure for instantiation of a ADM from a compressed description.
CD part C (Editor: (Gwo Giun) Yishin) – finalize
Study of CD part C (Editor: (Gwo Giun) Yishin) Include all new FUs Update Token classification
Conformance WD 4.0 (Editor: Gwo Giun) New version approved at this meeting (two weeks editing period)
Output Documents:
No. Title TBP Available 23001-4 Codec Configuration Representation 9584 Study Text of ISO/IEC CD 23001-4 Codec Configuration N 08/03/17 Representation 23002-4 Video Tool Library 9585 Reconfigurable Video Coding Requirements V 4.0 N 08/01/18 9586 Overview of Reconfigurable Video Coding (RVC) Y 08/02/02 9587 Study Text of CD ISO/IEC 23002-4 Video Tool Library N 08/03/17 9588 Extensions of Video Tool Library under consideration N 08/02/04 9589 Description of Core Experiments in RVC N 08/02/04 9590 RVC Simulation Model (RSM) V7.0 N 08/02/04 9591 RVC Work Plan and FU Development Status N 08/01/18 9592 RVC Conformance Testing Working Draft V4.0 N 08/02/04 9593 Description of Exploration Experiments in RVC N 08/01/18 9594 Methodologies for Video Toolbox Extension V2.0 N 08/03/24
96 6 Explorations – Free Viewpoint Video/Television
The exploratory work on free-viewpoint video has its roots in the “3DAV” exploration, which was originally started in December 2001, and later led in a first CfP on multiview video compression technology (current MVC development in JVT). As discussed in the previous meeting, FTV can be defined as a compressed representation and associated technologies which enable generating a large number of different views from a sparse view set. This most probably (from technologies currently known) requires implementation of depth/disparity map estimation (non-normative), definition of depth/disparity map representation/compression and interpolation/rendering method (not clear yet whether the latter should be non-normative or normative). All of these elements rely on each other, such that proper technology selection will most probably not be simple. Furthermore, higher distortion may be expected than for MVC (or at least quality may not be measurable in terms of pixel fidelity, geometric distortions may appear that might only be noticeable under certain observation conditions). The amount of distortion most probably would also depend on compactness (density of views) and complexity of the methods. Depending on concrete application, the view number to be generated may range from two for simple stereoscopic up to "many" for almost-free walk-through a scene.
In Antalya, more clarification was achieved about the focus of the next phase of the FTV work. Realistic market needs appear to be existing in supporting standardized formats for upcoming 3D (M-view) displays where the number of views M as locally generated influences the quality of visual perception. While currently numbers of approximately M=9 are used in prototypes, while for the future M of up to 40-50 could be expected. Even then, the view angle support will be relatively narrow (max. 20 degrees left-to-right), which is a clear (and implementation-wise realistic) limitation as compared to the “really free” FTV scenario. One additional advantage could be that with such narrow view alteration ranges, co-planar camera setups could still be useful. After extensive discussion, the group agreed that the name of "3D Video" is very well reflecting this subset scenario of FTV (namely, enabling technology for 3D video displays).
Related to the last meeting’s call for test sequences and depth/disparity estimation algorithms the following input contributions were brought: - 2 contributions announcing generation of new test materials (15089, 15102), both will use dense camera arrays. First proposal for 80 cameras with 5 cm baseline (convergent); the other 15 cameras with 7 cm baseline (linear/co-planar). - 4 contributions on depth estimation (15090, 15119, 15175, 15191) - 2 contributions on view generation (15090, 15120) In addition, M15101 reports corrected camera parameters for an existing sequence, M15047 and M15088 relate to more generic applications and requirements of depth map technology and FTV (no need detected to revise the apps & reqs document based on these contributions). M15088 indicated possibilities to perform skipped view encoding when higher-quality depth information is available.
Following the focus as described above, it was concluded that co-planar camera setup would be the optimum case for this kind of 3D Video applications, and test sequences should be captured according to this. Nevertheless, slight rectification would most likely be necessary even in the parallel setup, due to slight variation in camera properties and the impossibility for perfect mechanical adjustment. Nevertheless, the original shots should be as close to co-planar as possible to keep rectification artifacts to a minimum. Sequences should be provided in rectified, illumination- and color-compensated version. Following this, a new call for test sequences was produced, which also includes a high-level description of 3DTV and FTV to make the purposes clear for which the new materials should be useful. Again, it is called to provide depth maps, depth estimation and interpolation software packages. To get more evidence about the possible elements of the 3D video chain, an 97 Exploration Experiment was started to find out how depth estimation and interpolation inter- relate, based on the proposals that were brought (and for which software must be made available in this context). The results of this should bring evidence by the next meeting about how we can find out about – whether sufficiently good depth estimation algorithms are available – which level of quality can be achieved in view synthesis, and whether e.g. PSNR comparison against original views is useful – suitability of test sequences we have (and will have after next meeting) for purposes of upcoming CfP
Documents reviewed m15047 Consideration of Depth Format Taka Senoh, [email protected], [email protected], [email protected], [email protected] Reports about various versions of depth, e.g. absolute z-depth, disparity. Various versions of normalization of depth and defining depth ranges. Proponents should check relationship with definitions in 23002-3. m15088 Available Technologies for FTV Masayuki Tanimoto, Toshiaki Fujii, Kazuyoshi Suzuki Presented. m15089 Contribution of Nagoya University on Masayuki Tanimoto, Toshiaki Fujii, Kazuyoshi Suzuki, Norishige FTV Test Material Fukushima m15090 Improvement of Depth Map Masayuki Tanimoto, Toshiaki Fujii, Kazuyoshi Suzuki Estimation and View Synthesis m15098 Inter-View Skip Mode for FTV using Gang Zhu, Xiaozhong Xu, Ping Yang, Yun He Depth Information m15101 Corrected Camera Parameters for Aljoscha Smolic, Heribert Brust, Karsten Mueller, Marcus Mueller, N9468; Call for Contributions on FTV Thomas Wiegand Test Material? m15102 Progress Report on 3DTV Video Ingo Feldmann, Marcus Mueller, Frederik Zilly, Ralf Tanger, Acquisition Karsten Mueller, Aljoscha Smolic, Peter Kauff, Thomas Wiegand m15119 Segment-based Multi-view Depth Map Sang-Beom Lee, Kwan-Jung Oh Estimation for FTV m15120 Virtual View Synthesis for FTV Sang-Tae Na, Kwan-Jung Oh m15175 Depth Map Estimation Software Olgierd Stankiewicz, Krzysztof Wegner. m15191 Segment-based Disparity Estimation Gi-Miun Um, Taeone Kim, Namho Hur, Jinwoong Kim using Foreground Sepration
Output documents: No. Title TBP Available Exploration – Free Viewpoint TV Coding 9595 Call for Contributions on 3D Video Test Material (Update) Y 08/01/18 9596 Description of Exploration Experiments in 3D Video N 08/01/18
98 Annex H – JVT report
Source JVT Management Team (Gary J. Sullivan, Jens-Rainer Ohm, Thomas Wiegand, and Ajay Luthra)
Abstract The Joint Video Team (JVT) of ITU-T Q.6/16 and ISO/IEC JTC 1/SC 29/WG 11 held its 26th meeting during 13-18 January, 2008 at the Divan Hotel in Antalya, Turkey. The JVT meeting was held under the chairmanship of Dr. Gary Sullivan (Microsoft/USA) and Dr. Jens-Rainer Ohm (RWTH Aachen/Germany), and under the associate chairmanship of Dr. Thomas Wiegand (Fraunhofer HHI/Germany) and Dr. Ajay Luthra (Motorola/USA). The JVT meetings opened at approximately 2:30 p.m. on Sunday 13 January 2008 and closed at approximately 11:45 a.m. on Friday 18 January 2008. Approximately 124 people attended the JVT meetings and approximately 40 input documents were discussed. The meetings took place in a co-located fashion with a meeting of ISO/IEC JTC 1/SC 29/WG 11 (MPEG) – one of the two parent bodies of the JVT. The subject matter of the JVT meeting activities consisted of work on video coding.
1 Documents of the JVT meeting
1.1 Input documents
1.1.1 Administrative input contributions JVT-Z000 (Admin) List of documents of Antalya meeting JVT-Z001-M (Admin) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj mgmt and errata JVT-Z002 (Admin) [T. Wiegand, K. Suehring, A. Tourapis, T. Suzuki, G. J. Sullivan] AHG Report: JM text, ref soft, bitstream, conf JVT-Z003 (Admin) [H. Schwarz, J. Vieron, T. Wiegand, M. Wien, A. Eleftheriadis, V. Bottreau] AHG Report: JSVM text, S/W, conf JVT-Z004 (Admin) [A. Segall, T. Wiegand] AHG Report: SVC bit depth and chroma format JVT-Z005 (Admin) [J. Ridge, M. Karczewicz] AHG Report: FGS applications and design simplification JVT-Z006 (Admin) [A. Vetro, P. Pandit] AHG Report: MVC high-level syntax & buffer management JVT-Z007 (Admin) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, Y. Chen] AHG Report: MVC JD & JMVM text & software JVT-Z008 (Admin) [P. Pandit, H. Kimata, S. Cho, K. Muller] AHG Report: MVC RRU and mixed-resolution view coding JVT-Z009 (Admin) [P. Pandit, H. S. Koo] AHG Report: MVC JMVM coding tools
1.1.2 Input liaison statements and parent-body inputs The following WG 11 parent-body input contributions were noted:
M14863 JNB comment on 1080p50/60 MPEG-2/H.262 M14869 Technical proposal on 1080p50/60 MPEG-2/H.262 M15108: Subjective results for the SVC verification test M15132: Verification of new SVC verification test streams 99 M15209: Liaison response from SMPTE to sc29n8883 Liaison from JVT on potential extension of SVC M15215: Liaison response from DVD Forum regarding progress of video coding work
1.1.3 Non-administrative input contributions JVT-Z020 ( Prop 2.2/3.1) [P. L. Lai (USC), P. Pandit, P. Yin, C. Gomila (Thomson)] CE2: Adaptive reference filtering for MVC JVT-Z021 ( Prop 2.2) [H. Yang, Y. Chang, J. Huo (Xidian Univ.), S. Lin, S. Gao, L. Xiong (Huawei)] CE1: Fine motion matching for motion skip mode in MVC JVT-Z022 / M15185 ( Prop 2.2/3.1) [S. Sekiguchi, K.Otoi, Y. Yamada, K. Asai, T. Murakami (MEI)] 4:4:4 video coding perf with adaptive MV coding JVT-Z023 ( Prop 2.2) [S. Cho, N. Hur, J. Kim, S.-I. Lee (ETRI)] Coding eff of stereoscopic video coding using residual downsampling JVT-Z024 ( Info) [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic (HHI), Y.-K. Wang (Nokia), C. Ying (Tech. U. Tampere)] MVC decoding process and HRD design JVT-Z025 ( Errata 2.0/3.1) [Y.-K. Wang, M. M. Hannuksela (Nokia)] SVC corrigendum items JVT-Z026 ( Prop 2.2.1/3.1) [Y. Chen (TUT), Y.-K. Wang (USTC), S. Liu, M. M. Hannuksela (Nokia), H. Li (Nokia)] On asymmetric MVC JVT-Z027 ( Prop 2.2/3.1) [H. Nakamura, M. Ueda (JVC)] Comments on SPS MVC extension JVT-Z028 ( Prop Profiles) [B.-M. Jeon (LG), W. S. Shim (Samsung), S. Cho (ETRI), G. H. Park (Kyung Hee U.), P. Pandit (Thomson), Y.-L. Lee (Sejong U.)] About MVC coding tools JVT-Z029 ( Prop 2.2/3.1) [G. Zhu, X. Xu, P. Yang, Y. He (Tsinghua U.), J. Zheng, X. Zheng (Hisilicon)] MVC inter-view skip mode using depth information JVT-Z030 ( Prop 2.2/3.1) [Y. S. Ho, K. J. Oh, C. Lee (GIST)] Regional disparity derivation for MVC motion skip mode JVT-Z031 ( Prop 2.2) [J. H. Park, B.H. Choi (KETI)] MVC motion skip mode with residual pred JVT-Z032 ( Prop 2.2) [J. H. Park, B.H. Choi (KETI)] Clarification of motion_skip_enable_flag JVT-Z033 ( Info) [Y. Chen (TUT), Y.-K. Wang (Nokia), S. Liu (USTC), M. M. Hannuksela, H. Li (Nokia)] CE1: Information on motion skip and CE 1 JVT-Z034 ( Prop 2.2) [S. Cho, B. Lee, N. Hur, J. Kim, S.-I. Lee (ETRI)] Prelim subjective test results for mixed resolution stereo video coding JVT-Z035 / M15102 ( Info) [I. Feldmann, M. Mueller, F. Zilly, R. Tanger, K. Mueller, A. Smolic, P. Kauff, T. Wiegand (HHI)] Progress report on 3DTV video acquisition JVT-Z036 ( Prop Reqs) [A. Segall (Sharp)] On the requirements for bit-depth and chroma format scalability JVT-Z037-V ( Info) [Y. Su, A. Segall (Sharp)] Verif of JVT-Z021: Fine motion matching for motion skip mode in MVC (CE1) JVT-Z038 ( SEI Prop 2.0/3.1) [S. Yea, A. Vetro (MERL), A. Smolic, H. Brust (HHI)] Revised syntax for SEI message on multiview acquisition information JVT-Z039 ( Info) [S. Liu, A. Vetro (MERL)] Requirements for bit-depth scalable coding JVT-Z040 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to convey suitable splice points in the bitstream JVT-Z041 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to control DPB output in non-seamless spliced bitstreams with end_of_stream JVT-Z042 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to forewarn location of end_of_stream JVT-Z043 ( Errata) [H. Schwarz (HHI)] SVC errata
100 1.1.4 Late-registered input contributions, BoG reports, etc.
JVT-Z044-L (Late Errata 2.0/3.1) [V. Bottreau (Thomson)] On level limits common to scalable profiles – constraint "l" JVT-Z045-Q (Late Prop 2.2/3.1) [Y. Yu, S. Gordon, M. Yang (Broadcom)] Bit depth SVC with a prediction filter JVT-Z046-QV (Late Verif) [J.-Z. Xu (Microsoft)] Verif JVT-Z029 JVT-Z047-Q / M15196 (Late Prop 2.0/3.1) Proposal on Profiles for MVC (Multi-view Video Coding) JVT-Z048-QV (Late Verif) [H. Yang (Xidian Univ.)] Verif JVT-Z029 JVT-Z049-B (BoG Report) [A. Vetro (MERL)] BoG report on MVC profiles
1.2 Late document availability Non-administrative documents with document numbers suffixed in this report with "-L", "-Q", or "-M" were classified as late. Such documents will only be considered as information documents only (unless agreed otherwise by the group) if time permits, and consideration of them may be shifted to the end of the meeting as determined appropriate by the group.
For some time now, the JVT has agreed that no late-uploaded (non-AHG-report, non-liaison, non-verification) contribution would be presented without having a minimum of 4 JVT participants (from different other than that of the primary contribution author) recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Such support to allow a presentation is to be understood to not necessarily imply support of the adoption of the content of the late contribution, but only as a positive expression that the document should be allowed to be presented. Additionally, the provider of a presented late contribution shall send an email apology to the JVT email reflector. This rule does not apply to material requested by the JVT at the meeting (e.g., reports of JVT- authorized side activities).
JVT decision: Agreed.
A check mark () indicates a contribution considered to be available on time.
The suffixes for contributions not marked as “” are explained below: – "-L" indicates a non-administrative contribution that was somewhat late but was available by the second meeting day (JVT-Z044 was in this category at this meeting). – "-Q" were more late than that (JVT-Z045 through JVT-Z048 were in this category at this meeting – two of which were verification documents). – "-M" were still missing at the time of preparation of this report. – "-B" were break-out group discussion reports and other input requested during the meeting
Further suffixing by “V” indicates a contribution that contains a cross-verification of a proposal.
Three contributions were subject to lateness penalties as follows: JVT-Z044-L (Late Prop/Errata 2.0/3.1) [V. Bottreau (Thomson)] On level limits common to scalable profiles – constraint "l" JVT-Z045-Q (Late Prop 2.2/3.1) [Y. Yu, S. Gordon, M. Yang (Broadcom)] Bit depth SVC with a prediction filter JVT-Z047-Q / M15196 (Late Prop 2.0/3.1) [H. Kimata (NTT), H. Nakamura (JVC), T. Itoh (Fujitsu), T. Nomura (Sharp)] Proposal on Profiles for MVC (Multi-view Video Coding)
101 Notes on the apologies and named participant support for these contributions are included in the sections of this report that discuss each of these documents.
There were no objections to presentations of late documents at this meeting.
It was noted that the situation surrounding the need for on-time availability of contributions has substantially improved since our lateness penalty rules were adopted.
1.3 Withdrawn document registrations None.
1.4 Major output documents Major output documents submitted to parent-body review included the following. (Dates listed are planned dates of availability.)
1.4.1.1.1 JVT-Z200 Meeting report of the 26th JVT meeting (this document)
1.4.1.1.2 JVT-Z205-M (WG 11 N9569) Draft conformance testing for SVC (V. Bottreau) [2008-02-29] (Conveyed to WG 11 as "Text of ISO/IEC 14496-4:2004/FPDAM 31 Conformance Testing for Scalable Video Coding".)
1.4.1.1.3 JVT-Z207 (WG 11 N9578 Joint multi-view video model (JMVM) 7 text [2008- 02-15]
1.4.1.1.4 JVT-Z208 (WG 11 N9579) JMVM 7 software [2008-02-22]
1.4.1.1.5 JVT-Z209 (WG 11 N9576) Joint draft multi-view video coding (MVC) [2008- 02-15] (Conveyed to WG 11 as "Text of ISO/IEC 14496-10:200X/FPDAM 1 Multiview Video Coding".)
1.4.1.1.6 JVT-Z210-M (WG 11 N9574) ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced video coding defect report (G. Sullivan) [2008-04-11] (Conveyed to WG 11 as "Text of ISO/IEC 14496-10:200X/DCOR 1".)
1.4.1.1.7 JVT-Y211-M (WG 11 N9573) Draft reference software for SVC [2008-03-20] (Conveyed to WG 11 as "Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding".)
1.5 JVT internal output documents JVT internal output documents included the following. (Dates listed are planned dates of availability.)
1.5.1.1.1 JVT-Z202-M Joint scalable video model (JSVM) text
1.5.1.1.2 JVT-Z203-M JSVM software
102 2 JVT administrative and liaison topics
2.1 IPR policy reminder and update Participants were reminded of the IPR policy established by the parent organizations of the JVT and were referred to the parent body web sites for further information. The IPR policy was summarized for the participants.
Participants were particularly reminded of the need to supply a completed JVT IPR status reporting form in all technical proposals for normative standardization. Participants were also reminded of the need to formally report patent rights to the top-level parent bodies (using the common reporting form found on the database listed below) and to make verbal and/or document IPR reports within the JVT as necessary in the event that they are aware of unreported patents that are essential to implementation of a standard or of a draft standard under development.
The JVT chair noted that the top-level parent bodies have agreed upon a common patent policy for ITU-T, ITU-R, ISO, and IEC.
Some relevant links for organizational and IPR policy information are provided below: – http://www.itu.int/ITU-T/ipr/index.html (new common patent policy for ITU-T, ITU-R, ISO, IEC and guidelines and forms for formal reporting to the parent bodies) – http://ftp3.itu.int/av-arch/jvt-site (JVT contribution template for each meeting) – http://www.itu.int/ITU-T/studygroups/com16/jvt/index.html (JVT founding charter) – http://www.itu.int/ITU-T/dbase/patent/index.html (ITU-T IPR database) – http://www.itscj.ipsj.or.jp/sc29/29w7proc.htm (SC29 Procedures)
The JVT chair noted that the ITU TSB director's AHG on IPR had recently issued a clarification of the IPR reporting process for ITU-T standards, as follows (and as previously sent to the JVT email reflector), per SG 16 TD 327 (GEN/16):
“TSB has reported to the TSB Director’s IPR Ad Hoc Group that they are receiving Patent Statement and Licensing Declaration forms regarding technology submitted in Contributions that may not yet be incorporated in a draft new or revised Recommendation. The IPR Ad Hoc Group observes that, while disclosure of patent information is strongly encouraged as early as possible, the premature submission of Patent Statement and Licensing Declaration forms is not an appropriate tool for such purpose.
In cases where a contributor wishes to disclose patents related to technology in Contributions, this can be done in the Contributions themselves, or informed verbally or otherwise in written form to the technical group (e.g. a Rapporteur’s group), disclosure which should then be duly noted in the meeting report for future reference and record keeping.
It should be noted that the TSB may not be able to meaningfully classify Patent Statement and Licensing Declaration forms for technology in Contributions, since sometimes there are no means to identify the exact work item to which the disclosure applies, or there is no way to ascertain whether the proposal in a Contribution would be adopted into a draft Recommendation.
Therefore, patent holders should submit the Patent Statement and Licensing Declaration form at the time the patent holder believes that the patent is essential to the implementation of a draft or approved Recommendation.”
103 The JVT chair noted (as also previously remarked on the JVT email reflector) that since we are at the completion of the MVC amendment project, it was suggested that if anyone needs to report IPR on that topic and has not yet done so, now would be a good time to file formal notices to the parent bodies for any patent rights that are believed to be essential to the implementation of the MVC extensions (not to mention any notices not previously filed relating to the new SVC profiles, AVC professional profiles, or other previous projects).
It is suggested that, to enable proper interpretation of such formal notices, the MVC amendment should be clearly identified in such formal notices. For example, as “ITU-T Rec. H.264 and ISO/IEC 14496-10 Advanced video coding (2007 Ed.) Amendment 1 (2008): Multiview video coding”. Notices pertaining to other efforts should be made with a similar degree of clarity of identification of the specific standardization work item to which the declaration pertains.
The chair invited participants to make any necessary verbal reports of previously-unreported IPR in draft standards under preparation and opened the floor for such reports: No such verbal reports were made.
2.2 Meeting opening and remarks by the chairmen The meeting was opened at approximately 2:30 p.m. on Sunday 13 January 2008.
At the opening session of the meeting, the JVT chairs reminded participants of the relevant IPR policy as described above, and reviewed the status and plans for the major projects under way in the JVT. The largest area of activity consisted of multi-view video coding (MVC) extensions of the ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced video coding (AVC) standard. SVC work was categorized as "phase 1" or "phase 2", depending on whether the work related to the recently-designed initial SVC amendment or to a potential future further SVC extension.
Documents were made available for download at http://ftp3.itu.int/av-arch/jvt- site/2008_01_Antalya.
The deadline was Tuesday January 8th 2008 for registrations and uploads.
Initially-missing non-administrative documents included the late-registered document JVT-Z045, which was registered verbally during the opening of the meeting. . Document JVT-Z044 had also been registered and uploaded late, but was available approximately 2 days prior to the opening of the meeting. Documents with numbers higher than that of JVT-Z045 were registered after the opening session of the meeting.
Meeting information could be found at http://www.sunflowerconferences.com/mpeg83/.
A document template had been made available at http://ftp3.itu.int/av-arch/jvt-site/JVT-Zxxx.dot. It contained important instructions and policy information. Participants had been encouraged to read it and use it as the basis of their contributions.
Opening remarks: – IPR policy reminder – Professional profiles – follow up work on reference software and conformance – Scalable video coding (SVC) phase I – follow up work on reference software and conformance and collaboration with MPEG was needed on verification testing – SVC phase II – work areas included investigation of bit depth, color gamut, and chroma format scalability and fine-granularity scalability
104 – Multiview video coding (MVC) was a major project underway, and constituted the topic of most contributions to the meeting – Corrigendum work is needed, and was a major priority for this meeting
Further work and additional needs on the development, standardization, and maintenance of the base specification and the recently-completed SVC and professional profiles, and of associated reference software and conformance specifications was noted. Needs for verification testing to be conducted by the WG 11 parent body were noted and discussed.
The incoming status of work on errata aspects of the AVC specification was as found in JVT- Y210, which was delivered (during the Antalya meeting) as an output from the previous meeting. Other inputs on errata consisted of JVT-Z025, JVT-Z043, and JVT-Z044.
The chair remarked that there were few late document uploads this time, and that the submitted documents seem to be generally adhering to the JVT guidelines in terms of formatting, filenames, etc., which is a good development, although further improvement (particularly in the formatting conventions) is still needed. The JVT operating rules on that subject have helped.
2.3 JVT communication practices JVT documents were available at http://ftp3.itu.int/av-arch/jvt-site.
These can also be accessed via ftp with the site name ftp3.itu.int, user ID avguest and password Avguest. Upon login, documents will then be found in the directory "jvt-site". Uploading of contributions is done by upload via ftp protocol to the "jvt-site/dropbox" directory using this account ID and password.
JVT email lists are managed through the site http://mailman.rwth-aachen.de/mailman/options/jvt- xyz, and to send email to one of these reflectors, the email address is "[email protected] aachen.de", where "xyz" corresponds to – "experts" for general experts group discussions – "bitstream" for bitstream exchange activities – "svc" for SVC work – "mvc" for MVC work
2.4 Scheduling and logistics notes Some parallel sessions were held during the meeting, particularly including some parallel review of MVC and SVC contributions. Some “break-out group” (BoG) side activities and informal study efforts were also conducted. Documents produced by break-out group activities (if any) are listed in this report with the abbreviation “BoG” and are suffixed with "-B".
A contribution template JVT-Zxxx.dot for the JVT meeting was made available on the JVT ftp site: http://ftp3.itu.int/av-arch/jvt-site/2008_01_Antalya. It contained essential information for JVT participants. Participants had been instructed to read it carefully, particularly if they planned to be submitting contributions to the meeting.
The document registration and upload deadline was Tuesday 8 January 2008 (the Tuesday preceding the meeting).
Note that the JVT has agreed that no late-uploaded (non-AHG-report, non-liaison, non- verification) contribution will be presented without having a minimum of 4 non-affiliated JVT
105 participants from different organizations recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Additionally, the provider of a presented late contribution must send an email apology to the JVT email reflector.
2.5 Administrative documents
2.5.1.1.1 JVT-Z000 (Admin) List of documents of Antalya meeting As listed herein.
2.5.1.1.2 JVT-Z001-M (Admin) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj mgmt and errata General project status was reported verbally as described above (see opening remarks).
The latest version of the meeting report of the Shenzhen meeting (marked as draft 5) had been made available on 4 December 2007 (approximately 6 weeks prior to the meeting).
On errata: The JVT-Y210 output of the previous meeting was produced and made available during the current meeting. Other inputs on errata consisted of JVT-Z025, JVT-Z043, and JVT- Z044.
2.5.1.1.3 JVT-Z002 (Admin) [T. Wiegand, K. Suehring, A. Tourapis, T. Suzuki, G. J. Sullivan] AHG Report: JM text, ref soft, bitstream, conformance This document described the activities of the JM text, reference software and bitstream conformance ad hoc group since the last JVT meeting.
On JM Reference Text: There was no activity to report on the JM reference text.
On JM reference software: The professional profile integration has been finished. Several bugs have been found and been fixed. The bug fixing activity is ongoing.
Software releases JM 13.1 and JM 13.2 have been issued.
The following issues were reported to be the most important (volunteers needed):
As the official H.264/AVC reference software, the JM should be a correct source for checking implementations. This means the decoder should be able to decode all valid H.264/AVC bitstreams and the encoder should never create invalid bitstreams. This is currently not the case.
Depending on the configuration the JM encoder can create invalid bitstreams: – Level constraints are not properly checked – The 16-bit transform requirement is not checked – In Baseline/Main/Extended profile the restriction of CAVLC syntax elements needs proper handling
The software coordinators would like to encourage all H.264/AVC experts to volunteer for fixing these issues. 106 Known Issues / Reporting bugs: A web based bug tracking system has been set up for keeping track of known issues and missing features. The system is publicly accessible but requires registration for entering bug reports.
The system is located at http://ipbt.hhi.de.
This internet site contains also some usage instructions.
Please note that the bug tracking system is using encrypted/secure http (https) for protecting the user’s login. The used certificate is self signed and has to be imported into the user’s web browser. The SHA-1 fingerprint of the certificate is 69:21:86:d9:3e:72:da:3f:e8:30:df:a8:dd:fa:a5:4c:ed:85:b5:09.
A list of known issues and their state can be found at: https://ipbt.hhi.de/mantis/view_all_bug_page.php.
As an annex to the AHG report, a list of the 34 issues identified as active in the bug tracker system (as of 2008-01-14) was provided.
It was requested that certain rules should be followed before reporting any new bugs: – The database should be searched on whether the same issue was previously reported. If the problem was reported before, but there is additional information, then this information should be added to the original report. – It should be specified if the problem is related to the encoder, decoder or both. – The version of the software used should be specified. – Description of the problem should be as precise as possible. – The necessary steps to reproduce the problem should be described in detail. – If available, the configuration files or/and command line syntax used to run the software should be provided. – The language of the standard should be used when referencing the text description. – After filing the report, the user should check if he/she is requested to provide additional or other information relating to this issue.
Bitstream Exchange Activities: Communications related to bitstream exchange activity have taken place on the bitstream exchange reflector ([email protected]). However this topic was not so active since the last JVT meeting.
The FTP area for downloading bitstream files is on the main JVT Experts FTP site: ftp://ftp3.itu.int/jvt-site/bitstream_exchange/
The bitstreams can alternatively be accessed from the following http site. http://ftp3.itu.int/av-arch/jvt-site/bitstream_exchange/
To volunteer a bitstream for testing, contributors are requested to include it in a zip archive along with related files (trace files, configuration, reconstructed frames) in a zip archive and upload it to the dropbox: ftp://ftp3.itu.int/jvt-site/dropbox using the user ID "avguest" and password "Avguest".
107 In general, the following naming convention is being followed for the bitstreams in the exchange: FeatureCode_Source_VersionLetter Please refer to the spreadsheet and files on the FTP site for examples.
Once a bitstream is uploaded to the dropbox, e-mail should be sent to [email protected], and/or the bitstream exchange reflector and it will be made available in the bitstream_exchange directory.
To sign up for the bitstream exchange reflector, use the web address given below. http://mailman.rwth-aachen.de/mailman/listinfo/jvt-bitstream.
No new bitstreams for non-"professional" profiles had been exchanged since the last meeting.
Conformance bitstreams for professional profiles: New conformance bitstreams for High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra, CAVLC 4:4:4 Intra, and High 4:4:4 profile were generated and available at JVT ftp site. Additional bitstreams are available for lossless coding.
However, it was found that the JM software (JM13.0 or later) crashed when decoding some conformance bitstreams. The volunteers investigated the problem and found it seems to be a problem in the JM software. The following problems were found: 1) 4:2:0 10-bit intra-only: no encoder/decoder match both for luma and chroma 2) 4:2:2 10-bit intra-only: decoder crashes 3) DC quantization for 4:2:2
It was confirmed that problem 1 above was fixed in JM13.1, and that problem 3 should be fixed by the next JM release.
Regarding the conformance bitstreams for 4:2:0 8 bit profiles: There was a report that the conformance stream cama1_vtc_c seems to be missing zero_bytes. However this report was not checked yet. This should be investigated further.
The AHG recommended to continue to collect more conformance bitstreams
2.5.1.1.4 JVT-Z003 (Admin) [H. Schwarz, J. Vieron, T. Wiegand, M. Wien, A. Eleftheriadis, V. Bottreau] AHG Report: JSVM text, S/W, conf This document presented the report of the AhG on JSVM text, JSVM software, and SVC conformance.
The text of Joint Scalable Video Model (JSVM-12) was submitted as JVT-Y202. The text of the Joint Scalable Video Model wasn't modified relative to JVT-X202 (JSVM-11).
The JSVM software was submitted as JVT-Y203 and JVT-Y211. It corresponds to CVS tag "JSVM_9_10_DEVEL2". The reported status is summarized in the following.
The following changes had been implemented relative to JVT-X203: – correction of position calculation for inter-layer intra and residual prediction (including recent changes according to JVT-X201) – correction of subset SPS (syntax and usage) – correction of prefix NAL unit syntax – correction of slice header syntax – correction of IDR support
108 – correction of SVC profile identifications – correction of SEI message identifiers – several bug fixes
Furthermore, an effort had been started to improve the decoder implementation (major rewrite) and remove unnecessary code (e.g. tools that are not supported in the standard) from the decoder implementation. – removal of RCDO – removal of 4-tap upsampling filters – removal of FGS (remaining code parts) – removal of fragmented NAL units – removal of additional "base layer decoder" – clean up of processing order in decoder – clean processing of access units in decoder – general improvement, simplifications of main decoder classes
The clean-up of the decoder implementation had not been finished yet. Some of the tools that were implemented in prior versions of the JSVM software were reported to not be supported: – Temporal direct mode: Currently not supported, so that most of the AVC conformance bitstreams cannot be decoded (in SVC, the temporal direct mode is not supported). – Error detection and error concealment: The error detection and error concealment code had been temporarily removed, so that all validation scripts that simulate packet losses fail. – The previously implemented code assumed a fixed GOP structure (by analyzing the parameters of the first two access units) and only worked for 2 layers. It was reported that it should be tried to implement the error detection and concealment in a more general way. – Support of interlaced coding tools: Bugs related to interlaced coding tools need to be fixed. – Support of multiple slice groups: Bugs related to multiple slice groups need to be fixed (the bugs already existed in last version of the JSVM software).
The following tools were reported to need to be implemented or fixed in order to align the software to the text (JVT-X201): – fixing implementation of multiple slice groups (and IROI) – fixing implementation of interlaced tools – implementation of new loop filter modes (two filter passes, JVT-W063r1) – correction of position calculation for inter-layer prediction of coding modes and motion vectors – double check SEI syntax – order of redundant pictures in bitstream – re-implementing temporal direct mode (low priority – not required for SVC bit streams) – re-implementing error concealment and detection (lower priority – long term issue) – base layer rate control (JVT-W043, non-normative)
Additional fixes of which the software coordinators are currently not aware reportedly might also be required to align the JSVM software to the text.
In order to keep track of the changes in software development and to always provide an up-to- date version of the JSVM software, a CVS server for the JSVM software has been set up at the Rheinisch-Westfälische Technische Hochschule (RWTH) Aachen. The CVS server can be accessed using WinCVS or any other CVS client. The server is configured to allow read access only using the parameters specified below. Write access to the JSVM software server is restricted to the JSVM software coordinators group.
109 – authentication: pserver – host address: garcon.ient.rwth-aachen.de – path: /cvs/jvt – user name: jvtuser – password: jvt.Amd.2 – module name: jsvm or jsvm_red
The following example shows how the JSVM software can be accessed by using a command line CVS client. cvs –d :pserver:jvtuser:[email protected]:/cvs/jvt login cvs –d :pserver:[email protected]:/cvs/jvt checkout jsvm
In the following example, it is shown how a specific JSVM software version – specified by a tag (JSVM_9_8 in the last example above) – can be obtained using a command line CVS client. Note that "co" represents an abbreviation for the command checkout, which was used in the example above. cvs –d :pserver:jvtuser:[email protected]:/cvs/jvt login cvs –d :pserver:[email protected]:/cvs/jvt co –r JSVM_9_8 jsvm
It is also possible to check out only a reduced JSVM software package by using the module name jsvm_red instead of jsvm. In this case, the directories JSVM0-config-sample and MVC-Configs are ommitted in the checkout, as shown below. cvs –d :pserver:jvtuser:[email protected]:/cvs/jvt login cvs –d :pserver:[email protected]:/cvs/jvt co jsvm_red
The CVS repository includes a JSVM software manual, which provides further information on the JSVM software.
The text of the conformance test specification document "Draft conformance testing for SVC" had been submitted as JVT-Y205.
A first SVC related errata list had been submitted as JVT-Z043.
The editors and software coordinators were thanked for their excellent and diligent work.
2.5.1.1.5 JVT-Z004 (Admin) [A. Segall, T. Wiegand] AHG Report: SVC bit depth and chroma format The AhG was established at the Shenzhen meeting to study bit-depth and chroma format scalability. The mandates of the AhG were: – Identify applications – Work out suggestions for detailed needs – Find/create test material – Study bit-depth reduction techniques, e.g., tone-mapping tools – Study color space and/or gamma conversion requirements – Study combined spatial and bit depth scalability – Define experiments and test conditions – Investigate software and text modification needs – Identify complexity issues
The AhG sent a kick-off message to the JVT main reflector ([email protected]) on December 7, 2007. The message contained [BDS] in the subject line. 110 Related contributions JVT-Z036 [A. Segall (Sharp)] On the requirements for bit-depth and chroma format scalability This document discusses the requirements for bit-depth scalability within the context of consumer applications. Current trends in display technology are the focus, and it is asserted that these trends motivate the need for higher bit-depth formats within consumer devices. Thus, it is proposed that development of any bit-depth scalable system should consider these applications.
JVT-Z039 [S. Liu, A. Vetro (MERL)] Requirements for bit-depth scalability This document considers a new application scenario for bit-depth scalable coding in which receiver-side editing of a high dynamic range video is desired. Requirements for bit-depth scalable coding are described and preliminary results that aim to demonstrate the benefits of higher-bit depth video at the receiver are shown.
JVT-Z045-Q (Late Prop 2.2/3.1) [Y. Yu, S. Gordon, M. Yang (Broadcom)] Bit depth SVC with a prediction filter (registered after the AHG report was written) This document describes research work on bit depth SVC. By applying a filter to the reconstructed image from the lower layer, an average of 4.4% BDBR, or an average of 0.15 dB BDPSNR, can reportedly be achieved at the 10 bit top layer for "Viper" sequences. Higher gain is reportedly seen on input sequences with normal lighting conditions. (One sequence had a reported benefit exceeding 10%.)
Two liaison statements related to bit-depth and chroma format scalability had been sent to our MPEG parent body: M15209 from SMPTE and M15215 from DVD Forum.
The AhG recommended – To review related contributions during the meeting – To continue the study of bit-depth and chroma format scalability – To continue evaluating test material
2.5.1.1.6 JVT-Z005 (Admin) [J. Ridge, M. Karczewicz] AHG Report: FGS applications and design simplification At the Shenzhen meeting, the JVT established the FGS applications and design simplification AHG activity with the following mandates: – Identify applications that may require FGS functionality and their characteristics. – Determine to what extent new coding tools are needed to achieve the functionality. – Define experiments and test conditions relating to FGS technology. – Coordinate with JSVM software effort to align JSVM software with current design. – Explore simplification of FGS tool design.
No contributions were submitted to this meeting on the subject matter of this AHG.
The main questions asked during the last couple of meetings did not relate to AR-FGS design but rather to its applicability, specifically regarding: – Importance of conversational applications for mobile devices. – Influence of high variations of bits per frame on delay in OFDMA based networks.
The AHG report suggested that the best forums to answer these questions are 3GPP and 3GPP2. Hence it was suggested to consider sending liaison letters to these forums requesting their comments. Alternatively since some of JVT participants are also active members of 3GPP/PP2 and it was believed that SVC issues would be discussed in the upcoming 3GPP and 3GPP2
111 meetings, such that we may use the results of those discussions as input to decide on needs for future work in this area.
2.5.1.1.7 JVT-Z006 (Admin) [A. Vetro, P. Pandit] AHG Report: MVC high-level syntax & buffer management At the Shenzhen meeting, the JVT established the AhG on MVC high-level syntax & buffering, with the following mandates: – Discuss high-level syntax for MVC including NAL unit type, NAL unit header extension, SPS extensions, slice layer, and integration with SVC syntax. – Discuss reference picture management to enable simultaneous picture output of different views and to facilitate parallel processing. – Discuss issues related to HRD. – Propose refined syntax and decoding processes for JMVM.
The editors were reported to have made several improvements to the JD text related to high level syntax.
A contribution related to the decoding process and HRD for MVC can be found in JVT-Z024 [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic (HHI), Y.-K. Wang (Nokia), C. Ying (Tech. U. Tampere)] MVC decoding process and HRD design
The AhG on MVC high-level syntax & buffering recommended discussing the issues related to HRD and making any necessary revisions to the MVC text.
2.5.1.1.8 JVT-Z007 (Admin) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, Y. Chen] AHG Report: MVC JD & JMVM text & software At the Shenzhen meeting, the JVT established the AhG on JMVM and JD text editing and software, with the following mandates: – Collect comments on the draft, perform necessary editing, and upload the final document by the deadline. – Maintain JMVM and JD document and collect comments on the text until the next meeting. – Coordinate JMVM software integration – Coordinate the bug-fixing process for the JMVM software – Maintain the JMVM software manual
The JMVM6 and JD5 were submitted to the JVT as JVT-Y207 and JVT-Y209, respectively. The JD text included a minor syntax change regarding view dependency as described in JVT-Y061. The JMVM included the following updates: – JVT-Y033: IC bug fix – JVT-Y058: motion skip bug fixes – JVT-Y042/Y053: single loop decoding
Several other editorial improvements and clarifications were reported to have also been made to the JD and JMVM text, including: – Better alignment with the latest SVC specification – Clean up the decoding process with regard to IC (intensity compensation – including the adopted fix in JVT-Y033)
112 Some minor editorial revisions to the JMVM were reported to have been received regarding the IC tool since the final document was uploaded and were requested to be considered as editor’s input to the meeting (provided as an attachment to the ad hoc report).
The JMVM 6 software was delivered to the group on November 30th, 2007. This release contained the addition of a motion skip flag in slice header, simplification B- and P-Skip modes on illumination compensation, sending multiple GDVs (global disparity vectors) in the case of multiple inter-view references, SPS (sequence parameter set) simplification, single loop decoding, and some software improvements.
Some software issues that still need to be addressed were reported as: – Fix software for compile errors for gcc 3.4+ version. – Remove all compilation warnings. – Output order of views is not sequential or parallel (it is on an as ready basis). – Prepare validation scripts (work in progress). – Provide support GOPsize=1 (with motion skip & IC). – Trace file support for arbitrary view_id assignments.
The AhG on JMVM and JD text editing recommended: – To consider the editor’s input (provided as an attachment to the ad hoc report) in preparing future versions of the JMVM and JD. – To discuss the issues in the current version of the software as mentioned above. – To improve the manual created for the JMVM software. – To follow the same software integration guidelines present in JSVM (repeated below).
In order to improve the whole software integration process, the software integration guidelines and rules are as follows: – The integrated software shall compile without warnings when using the provided VC6 and, VS .NET workspaces, as well as Linux makefiles. – Do not use variable declarations inside the header of for-loops (as the scope for for-loops is not correctly supported with all compilers). – Follow the coding style of the JMVM software. Use 2 (two) spaces for indentation, no tabs. – Re-use code and integrate functionality as possible. Try to avoid redundant code. – Do not change the meaning of existing input parameters, but rather define new ones if necessary (and applicable). – Make sure that new parameters have meaningful default values. Tools should not be switched on by default (if not decided different by the JVT). – Do not re-structure the output of the compiled binaries (if not decided different by the JVT). – Please change the JMVM version number macro (i.e. "_JMVM_VERSION_":) located in the file "CommonDefs.h" to be inline with your integration tag.
Reference to CVS repository [CVS] host address: garcon.ient.rwth-aachen.de user name: jvtuser password: jvt.Amd.2 authentication: pserver path: /cvs/jvt module name: jmvm or jmvm_red
jmvm_red does not check out certain old folders related to SVC.
2.5.1.1.9 JVT-Z008 (Admin) [P. Pandit, H. Kimata, S. Cho, K. Müller] AHG Report: MVC RRU and mixed-resolution view coding Mandates 113 – Investigate approaches for enhancing MVC coding efficiency using spatial downsampling – Evaluate the complexity of such methods – Investigate the relationship between downsampling approaches and view interpolation – Investigate low-complexity methods for mobile stereoscopic 3DTV applications
No emails had been exchanged on this topic on the reflector.
There were three contributions to this meeting that were noted to relate to this AhG, as follows: JVT-Z023 [S. Cho, N. Hur, J. Kim, S.-I. Lee (ETRI)] Coding eff of stereoscopic video coding using residual downsampling JVT-Z026 [Y. Chen, Y.-K. Wang, S. Liu, M. M. Hannuksela, H. Li (Nokia)] On asymmetric MVC JVT-Z034 [S. Cho, B. Lee, N. Hur, J. Kim, S.-I. Lee (ETRI)] Prelim subjective test results for mixed resolution stereo video coding
The AHG recommended to review the related contributions during the meeting.
2.5.1.1.10 JVT-Z009 (Admin) [P. Pandit, H. S. Koo] AHG Report: MVC JMVM coding tools The JMVM coding tools AhG had been established with the following mandates: – Investigate simplification and improvement of current JMVM coding tools (IC and motion skip) – Investigate techniques for single loop decoding to reduce complexity starting with motion skip
No relevant email had been exchanged on the reflector during the interim period since the last meeting.
The following contributions were noted to relate to the AhG: JVT-Z021 [S. Lin, S. Gao, L. Xiong (Huawei), H. Yang, Y. Chang, J. Huo (Xidian University)] CE1: Fine Motion Matching for Motion Skip Mode in MVC JVT-Z029 [G. Zhu, X. Xu, P. Yang and Y. He (Tsinghua U.)] MVC inter-view skip mode using depth information JVT-Z030 [Y. S. Ho, K. J. Oh, C. Lee (GIST)] Regional disparity derivation for MVC motion skip mode JVT-Z031 [J. H. Park, B.H. Choi (KETI)] MVC motion skip mode with residual pred JVT-Z032 [J. H. Park, B.H. Choi (KETI)] Clarification of motion_skip_enable_flag
The AhG recommended to discuss the related contributions at the meeting.
2.6 Closing session notes In the closing session there were no requests to reopen discussions of preceding agenda topics and side activities recorded elsewhere in this report.
The JVT thanked its WG 11 parent body for hosting the 26th JVT meeting, and Sunflower Conference Services for arrangement of meeting logistics.
The meeting was closed at 11:45 a.m. on Friday 18 January 2008.
114 2.7 JVT liaison communications and parent-body communications
The JVT did not receive liaison communications at this meeting. However two WG 11 parent body input liaison statements were noted as discussed below.
No liaison statements were sent by the JVT from the meeting. However, outgoing liaison statements were sent by WG 11 regarding SVC verification test results as discussed below.
2.7.1.1.1 M15209: Response from SMPTE to sc29n8883 Liaison from JVT on potential extension of SVC SMPTE appears interested in bit depth and will be meeting in March – no detailed input was reported to be available prior to that.
2.7.1.1.2 M15215: Liaison response from DVD Forum regarding progress of video coding work The DVD Forum WG-1 thanked the Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC 1/SC 29/WG 11 and ITU-T SG 16 Q.6) for informing them about the recent progress on enhancements of the ITU-T Rec. H.264 & ISO/IEC 14996-10 Advanced Video Coding standard.
The DVD Forum indicated that it is studying the benefit for consumers, manufacturers and content providers to adopt these enhancements into their specifications. Although their study has not been concluded, they consider it essential to retain backward compatibility with the existing products to avoid market confusion. In addition, they would appreciate it if JVT could provide evidence of improvement by the enhancements so that their study becomes more practical.
The further information requested by the DVD Forum was provided by the outgoing WG 11 liaison statement N9617 discussed below.
2.7.1.1.3 N9617: Liaison Statement template for various organizations re SVC verification testing report The WG 11 parent body sent liaison letters conveying the verification test report for SVC to a variety of organizations – specifically to: ARIB, ATSC, BDA, DLNA, DVB, DVD Forum, EBU, FLO Forum, IEC TC 100, IETF AVT, ISMA, ITU-R SG 6, ITU-T SG 9, ITU-T SG 12, OMA, SCTE, SMPTE, TTA-DMB, WorldDMB, 3GPP, and 3GPP2. See also item 1 of section 10 of this report.
3 AVC base specification, errata, and related topics The latest prior errata reporting status was provided in the JVT-Y210 output document of the previous meeting. Three additional documents related to errata issues were submitted for consideration at this meeting: JVT-Z025, JVT-Z043, and JVT-Z044.
3.1.1.1.1 JVT-Z025 ( Errata 2.0/3.1) [Y.-K. Wang, M. M. Hannuksela (Nokia)] SVC corrigendum items This document reported three possible corrigendum items for the SVC specification.
115 The first item is related to a semantics constraint on sub-picture scalable layer SEI message. A fix is proposed.
JVT Decision: Adopted in spirit – exact phrasing to be determined (esp. relating to possibility of presence in different NAL units).
The second item is related to a constraint on the co-existence of “old” H.264/AVC SEI message and “new” SVC SEI message” in a same SEI NAL unit. A fix is proposed.
JVT Decision: Adopted in spirit – exact phrasing to be determined (esp. relating to changing an "and" to "that is").
The third one is on the definition of “decoded picture”. It was proposed to add a definition of “decoded picture” in Annex G to explicitly exclude a “reference base picture” being a “decoded picture”. Remark: Perhaps adding a NOTE (e.g. near semantics of use_ref_base_pic_flag) would be better than adding a new definition.
JVT Decision: Adopted in spirit – exact phrasing to be determined.
3.1.1.1.2 JVT-Z043 ( Errata) [H. Schwarz (HHI)] SVC errata This document contained an SVC related errata list for eventual incorporation into a future amendment or corrigendum to the ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced Video Coding standard.
This document reportedly started with the JVT output document JVT-X201 as its basis. Changes were relative to that document. The document reportedly included all necessary issues of which the editors were aware prior to the 26th meeting.
Most issues were reported to have been identified by Danny Hong of Vidyo.
"r1" version also provided and presented.
JVT Disposition: Approved ("r1").
3.1.1.1.3 JVT-Z044-L (Late Errata 2.0/3.1) [V. Bottreau (Thomson)] On level limits common to scalable profiles – constraint "l" JVT members supporting presentation: – S. Pateux – H. Schwarz – S. Gao – Segall
Apology? Has been sent.
According to JVT-X201, constraint "l" sets limits to the number of reference layer macroblocks that can be encoded with mbType equal to I_PCM, I_16x16, I_8x8, I_4x4, or I_BL according to the number of enhancement layer macroblocks encoded with mbType equal to I_BL. It is understood that the primary intention of such a constraint was to limit the required decoder complexity. However, it was asserted that the impact of such a constraint may not have been sufficiently investigated. The contribution proposes to discuss the impact of this constraint from
116 an encoder perspective and highlights some use cases that such a constraint reportedly may preclude.
Reportedly, Equation G-370 may preclude some potential use cases. In addition, it was asserted that such a constraint imposes too strict and/or complex encoding rules from an encoder perspective. It was asserted that Equation G-370 sets encoding rules either on the reference layer or the enhancement layer by enforcing either a specific reference layer macroblock pattern to be encoded with mbType equal to I_PCM, I_16x16, I_8x8, I_4x4, or I_BL or an enhancement layer macroblock pattern to be encoded with mbType equal to I_BL. The contribution proposes that: – Equation G-370 be reformulated in order to better take into account the number of enhancement layer macroblocks eligible to be encoded with mbType equal to I_BL, i.e. for instance only the macroblocks lying within the cropping window; – And/or Equation G-370 be reformulated in order to minimize its impact on encoding mode selection for reference layer macroblocks, for instance by changing the 1.5 arbitrary factor; – Or that constraint "l" be removed.
JVT decision: This appears to be a valid report of an actual problem in the standard. The intent was to establish a constraint that constrains (only) the macroblocks in the base layer that are actually used in the decoding process of the enhancement layer. Under some conditions (e.g. involving cropping) the text does not seem to express that intended constraint. Further study may be needed to draft the final necessary correction.
4 Scalable video coding (SVC)
4.1 SVC bit depth and chroma format scalability
4.1.1.1.1 JVT-Z036 ( Prop Reqs) [A. Segall (Sharp)] On the requirements for bit- depth and chroma format scalability This document discusses the requirements for bit-depth scalability within the context of consumer applications. Current trends in display technology are the focus, and it is asserted that these trends motivate the need for higher bit-depth formats within consumer devices. Thus, it is proposed that development of any bit-depth scalable system should consider these applications.
The contribution contiains an emphasis on larger visual dynamic range, as opposed to increased precision representation of the same visual dynamic range (adding least significant bits).
The contribution suggests that 10 bit sample depth support is necessary for near term; 12 bits for longer term.
An approximate 1014:1 human visual dynamic range was reported; 104:1 in a short time interval. CRTs can do that, but with ambient light this is reduced to 50:1 or 100:1.
Multiple displays are on the market – getting brighter and extending their dynamic range, with very wide contrast ratios emerging. Example companies producing such technology: Sharp, LG, Brightside/Dolby.
An HDR image was demonstrated.
117 Remark: Justifying high bit depth and justifying scalable support for it are somewhat different subjects. Our previous work has already included increased bit depth support (except in the scalability context).
JVT conclusion: It seems generally agreed that support of a large visual dynamic range is an important capability to deliver in our work.
4.1.1.1.2 JVT-Z039 ( Info) [S. Liu, A. Vetro (MERL)] Requirements for bit-depth scalable coding This contribution considers a new application scenario for bit-depth scalable coding in which receiver-side editing of a high dynamic range video is desired. Requirements for bit-depth scalable coding are described, and preliminary results that aim to demonstrate the benefits of higher-bit depth video at the receiver are shown.
The contribution contains an emphasis on editing, tone mapping, etc., with "default" representation in the base layer and having an enhancement layer to provide greater quality.
Professional, "pro-sumer", and high-end consumer applications were emphasized.
The contribution suggests to consider high bit depth scalability support and to study the benefits of post-capture editing of HDR video.
Remark: Again, there's a distinction between the need for high quality / high bit depth, and the need for scalability, which is something else.
Response: There is the argument for an "easy to access" default representation supplemented by extra enhancement information – consumer quality plus extra data for high-quality subsequent processing.
4.1.1.1.3 JVT-Z045-Q (Late Prop 2.2/3.1) [Y. Yu, S. Gordon, M. Yang (Broadcom)] Bit depth SVC with a prediction filter Supporting presentation of the late contribution: – P. Topiwala – A. Vetro – Y. Chiu – A. Segall
Apology? Has been sent.
This contribution describes research work on bit depth SVC. By applying a filter to the reconstructed image from the lower layer, an average of 4.4% BDBR improvement, or an average of 0.15 dB BDPSNR improvement, can reportedly be achieved at the 10 bit top layer for "Viper" sequences. Higher gain is reportedly seen on input sequences with normal lighting conditions. (One sequence had a reported benefit exceeding 10%.)
The contribution describes the use of bit-depth scalability with tone mapping as in the current JSVM.
The contribution proposes modification of tone mapping operation.
Had some problem with BD measurements. 118 A presentation deck and "r1" version of the document were uploaded later.
Question: Has application of the filter after adding the enhancement layer been tried? Response: No. Remark: That might provide equivalent gain (without adding mandatory complexity to the decoding process).
Remark: Better gains in daytime scenes – good lighting conditions. Response: That's right. Remark: The sequences were intentionally chosen to provide a mixture of daytime and night-time content.
Further study is encouraged, including consideration of the post-processing comparison question.
4.2 SVC Conformance and verification
4.2.1 SVC conformance specification See JVT-Z003 and JVT-Z205. Proceeding to FPDAM status in the ISO/IEC approval process.
4.2.2 SVC verification testing The following two parent-body input contributions were noted in relation to SVC verification testing as discussed below. The SVC verification test report was produced as WG 11 N9577 (discussed below) and was conveyed to various organizations in liaison communications as found in WG 11 N9617 (discussed above).
4.2.2.1.1 M15108: Subjective results for the SVC verification test Subjective result plots were shown toward development of the SVC verification test report. Tested scenarios included Scalable Baseline (conversational and mobile TV applications); Scalable High (3-layer dyadic and 1080p on 720p applications); and Scalable High Intra (production application).
4.2.2.1.2 M15132: Verification of new SVC verification test streams Reports that the verification test bitstreams were appropriate for the test.
4.2.2.1.3 N9577 Report on SVC Verification Tests The verification test was conducted using conditions suitable for a range of possible application scenarios for progressive video, including: – Video-conferencing with quality scalability for CIF@30fps video, and spatial scalability for 640x352@60fps video with 1280x704@60fps enhancement – Mobile TV with quality scalability for QVGA@25fps video, and spatial scalability for [email protected] with VGA@25fps enhancement – HD TV with spatial scalability for 720p@50fps with 1080p@50fps enhancement – Movie production with spatial scalability for 1080p@25fps being the highest resolution, with two lower resolutions
119 For the performance evaluations, SVC was compared against AVC single layer coding by means of subjective testing. Subjective tests were performed following relevant international recommendations using a controlled environment and a high number of test subjects.
The results of these tests indicate that these various types of scalability for these applications can be achieved with a bit rate overhead typically equal to or less than 10% compared to AVC single layer coding using only the highest resolution in the test case. The bit rate savings obtained by SVC compared to AVC simulcast transmission depend on the particular test case, and were found to be between 17% and 40% of the simulcast bit rate. These bit rate savings relative to simulcast are particularly important for applications in which video must be provided with different spatial resolutions, for which simulcast would previously have been the only available AVC-based standardized solution.
Full detail is available in the WG 11 N9577 parent-body document, which was made a public document by WG 11. The drafting of the verification test report was coordinated by Tobias Oelbaum.
The JVT was pleased to see the good results from the verification test, which appear to validate the merit of the SVC design.
5 Multi-view coding (MVC)
5.1 Core experiment #1 & related docs: Fine motion matching for motion skip mode in MVC
5.1.1.1.1 JVT-Z021 ( Prop 2.2) [H. Yang, Y. Chang, J. Huo (Xidian University), S. Lin, S. Gao, L. Xiong (Huawei)] CE1: Fine motion matching for motion skip mode in MVC At the Shenzhen meeting, a fine motion matching technique was first proposed in JVT-Y037. Some issues about the technique were raised that needed clarification, and a core experiment was set up. This contribution reportedly continues with JVT-Y037, fulfills the mandates of the core experiment plan, and provides results for the proposed technique. The experimental results reportedly show that, compared with the performance of the current MVC JD and JMVM, BD gains of 0.169 dB / 4.11% and 0.083 dB / 2.11% can be obtained as an average over all views; with BD gains of 0.379 dB / 9.19% and 0.181 dB / 4.64% respectively, on average of the views that employs the proposed technique.
Clarification of reported percentage improvement relationships: – 2.11% average gain over all views (for 8 sequences) relative to current motion skip design of JMVM. – 4.64% average gain over the views that can use motion skip (for 8 sequences) relative to current motion skip design of JMVM. – 4.11% average gain over all views (for 8 sequences) relative to current JD, which does not include motion skip feature. – 9.19% average gain over the views that can use motion skip (for 8 sequences) relative to current JD, which does not include motion skip feature.
Current design of motion skip in MVC uses a 16-sample GDV (global disparity vector) resolution. Entire GOP uses same GDV value. Contribution proposes to change this to an 8- 120 sample resolution, with a block offset sent to determine the final 16x16 region in the reference picture to use for motion inference. Also send a flag to indicate whether to use a list 0 or list 1 reference picture in this process.
Supports single loop decoding.
Decoder complexity increase: Seems negligible – some extra syntax and minor calculations on MB level. Encoder complexity increase: Performs, e.g., 9*9*2 = 162 SSE calculations which were not otherwise needed (fast search is also obviously possible).
No change relative to what was proposed in JVT-Y037 (although some simplification of encoding process).
Proposal does not include CAVLC support.
Remark: That seems like a problem (perhaps for motion skip and intensity compensation in general, if CAVLC operation hasn't been tested for them).
Proposal does not include draft text. Considering that the book-keeping that is needed to infer the motion from the new non-macroblock-aligned positioning, the amount of necessary draft text might be substantial.
Remark: Gain seems small.
Remark: Might be good for support of single-loop decoding.
Remark: These results are for multi-loop decoding. How much do you get for single-loop decoding? After some discussion, rough estimates were 19% for multi-loop without new coding tools and 11% for single-loop.
Question: What would be the performance with 8-sample GDV but not adding the refinement? Response: Approximately no improvement from doing that alone – it is the refinement that provides the gain.
Remark: It seems clear that coding tools will not be in MVC phase 1.
Discussed again after further consideration of that issue.
Remark: Phase 1 seems sufficient serve the need of enabling the relevant applications that are ready to be deployed today. With or without low-level decoding tools, these tools still end up with a bit rate that is roughly proportional to the number of views. The market may not be ready yet for further profile definitions without a more major difference than this, e.g., 20% better compression than phase 1 MVC.
Later in the meeting, CAVLC operation was developed by the proponent, with roughly similar relative gain reported. Draft text was also drafted by the proponent.
JVT decision: Adopted into JMVM.
It was further agreed to set up an AHG to investigate MVC enhancements. But it was also agreed that we are in no rush to create more new MVC profiles.
121 The JMVM software is (already) structured in a way that makes it easy to remove experimental stuff from it. We need to make sure that this stays true.
5.1.1.1.2 JVT-Z033 ( Info) [Y. Chen (TUT), Y.-K. Wang (Nokia), S. Liu (USTC), M. M. Hannuksela, H. Li (Nokia)] CE1: Information on motion skip and CE 1 In this contribution, information is given on the performance of the current motion skip in JMVM and the performance and the functionalities of potential tools related to the CE 1. In multiple-loop decoding, the original motion skip reportedly contributes an average bit-saving of around 2%. The method proposed by Huawei in JVT-Z021 reportedly doubles the bit-savings to around 4%.
5.1.1.1.3 JVT-Z037-V ( Info) [Y. Su, A. Segall (Sharp)] Verif of JVT-Z021: Fine motion matching for motion skip mode in MVC (CE1) This document provides a verification report of Huawei's response to CE1 on motion skip mode in MVC. Huawei provided Sharp the software for JVT-Z037. Sharp reportedly carefully inspected the software, compiled the software, and generated results as specified in the CE. All results reportedly matched. JVT-Z021 was thus reported to have been verified.
5.2 MVC motion skip mode (without depth information) and related documents
5.2.1.1.1 JVT-Z030 ( Prop 2.2/3.1) [Y. S. Ho, K. J. Oh, C. Lee (GIST)] Regional disparity derivation for MVC motion skip mode This document described a method of regional disparity derivation for motion skip mode. The current motion skip mode in JMVM utilizes a global disparity vector of 16-sample precision to find the position of the corresponding macroblock for the current macroblock. However, since the multi-view scene consists of several objects and each object has its own disparity value, it was asserted that the global disparity is not enough to cover the disparity of the whole image. It was proposed to use regional disparities, instead of the global disparity, for the motion skip mode. The proposed scheme generates the disparity map for each anchor frame considering its motion vectors and then derives disparity maps for non-anchor frames using both forward and backward disparity maps. The temporal movement is also considered. Compared to JMVM 6.0, the proposed scheme reportedly achieved a similar coding gain with the previous scheme.
Results were reported for 5 sequences, 2 GOPs. Relative to the current motion skip mode, approximately the same coding performance was reported.
The proponent suggested combining with residual prediction and suggested that better results might be obtained that way.
Contribution noted.
5.2.1.1.2 JVT-Z031 ( Prop 2.2) [J. H. Park, B.H. Choi (KETI)] MVC motion skip mode with residual pred This document proposes a prediction structure for MVC which is reportedly a combination of motion skip mode (per JVT-Y058) and residual prediction. In terms of residual prediction, it was claimed that the proposed method is very similar to the residual prediction technique of SVC. The proposed method uses an integer precision global disparity vector and derives a disparity vector of motion skip mode from the global disparity vector by a shift operation. For such use, it 122 was asserted that a smoothing filter would be needed to reduce boundary artifacts, but investigation of such a filter reportedly could not be finalized due to lack of time. The proposed method reportedly showed some gain without a smoothing filter for "dense" sequences. The number of test sequences was limited – 1.4% average gain on set of 5 sequences. The contribution recommended the creation of a CE on this topic.
Remark: Adds substantial complexity (e.g. searching in encoder for integer-precision GDV value, residual handling and storage in decoder).
Contribution noted.
5.2.1.1.3 JVT-Z032 ( Prop 2.2) [J. H. Park, B.H. Choi (KETI)] Clarification of motion_skip_enable_flag This document requested a change of the conditions relating to the slice header syntax element motion_skip_enable_flag, which was newly introduced from JVT-Y207. This contribution suggested to change the conditions on the presence of the motion_skip_enable_flag so that it is not sent when it is not used, and to otherwise structure the syntax in a more logical fashion in relation to that syntax element.
JVT Decision: Adopted (conditioned on whether we actually will use the feature that this is refining the design of).
5.3 MVC motion skip mode with depth information
5.3.1.1.1 JVT-Z029 ( Prop 2.2/3.1) [G. Zhu, X. Xu, P. Yang, Y. He (Tsinghua U.), J. Zheng, X. Zheng (Hisilicon)] MVC inter-view skip mode using depth information This contribution proposed an inter-view skip mode using depth information. A motion vector (disparity vector) of this mode is derived from the corresponding camera parameters and depth map. Experiments on the one sequence with depth ("Breakdancers") reportedly showed an average benefit of 0.288 dB / 11.37% for the P frame portion of the bitsteam.
No temporal prediction used at all in this test.
Question: What would be the percentage savings of the total bitstream?
Remark: Similar prior investigations have been done.
Remark: Perhaps depth bit rate should be accounted for.
Only tested on P slices. Only applies if (high quality) depth map is available.
Investigation of this feature is reported to only be preliminary.
Contribution noted. This may be a useful area for investigation toward some future project that may include depth map support, but does not seem to be within the scope of the current JVT MVC project.
123 5.3.1.1.2 JVT-Z046-QV (Late Verif) [J.-Z. Xu (Microsoft)] Verif JVT-Z029 This contribution was provided to verify the results reported in JVT-Z029. Encoding and decoding results were reportedly verified.
The authors of JVT-Z029 provided their modified source code to the contributor. The encoder and the decoder were compiled to get the results of the proposed scheme of JVT-Z029. And the encoder of anchor was run to get the results of anchor.
The results were reported to be exactly the same as those listed in Table I of JVT-Z029.
5.3.1.1.3 JVT-Z048-QV (Late Verif) [H. Yang (Xidian Univ.)] Verif JVT-Z029 This document was provided to verify the results reported in JVT-Z029 from Tsinghua Univ. The software was provided to the contributor for verification, and was reportedly confirmed to match the method described in the proposal. An executable file was generated by compiling the provided software, and was used to decode the provided bitstream. The provided bitstreams were reported to have been successfully decoded, and all results reportedly matched. JVT-Z029 was thus reported to have been verified.
5.4 Core experiment #2: Adaptive reference filtering for MVC
5.4.1.1.1 JVT-Z020 ( Prop 2.2/3.1) [P. L. Lai (USC), P. Pandit, P. Yin, C. Gomila (Thomson)] CE2: Adaptive reference filtering for MVC This contribution reported work and results for CE2 (JVT-X302) to evaluate coding gain of adaptive reference filtering (ARF) for MVC. To avoid needing a two-pass encoding procedure (initial estimation, filter design, then encoding with filtered references), a fixed set of filters was reportedly designed by clustering the ARF filters with a method described in previous JVT documents (JVT-W065, JVT-X060, JVT-Y041). They were 3x3 filters with symmetric constraints as proposed in JVT-X060. The results provided in this document were for anchor- only sequence coding with P and B frames. The average bit rate savings for anchor-only sequence coding was asserted to be 5.69%, or equivalently 0.26 dB. The gains for anchor-only sequence coding were asserted to range from 0.07 dB to 0.45 dB. The asserted gain was larger for sequences with stronger focus mismatches such as Race1 and Rena.
Question: Are there applications that would use anchor-only coding? Response: Professional applications such as editing – note that there is a profile proposal for such a profile.
Question: What would the gain be when not using anchor-only coding? Seems very small.
Question: Is the gain additive to IC? Response: Not tested.
Question: Complexity impact? Extra filtering to be done during inter prediction.
Remark: Was combining the two filters considered.
Question: How is the reference filtering adaptive? A particular filter is assigned to each position in the reference picture list (no ability to change which filter is applied to which position). The only adaptivity is the dependence of the filtering on the reference picture list index.
JVT Disposition: Further study was encouraged.
124 5.5 Residual downsampling for MVC
5.5.1.1.1 JVT-Z023 ( Prop 2.2) [S. Cho, N. Hur, J. Kim, S.-I. Lee (ETRI)] Coding eff of stereoscopic video coding using residual downsampling This contribution reported an analysis of experimental results of stereoscopic video coding using the residual-downsampling algorithm which was contributed in JVT-Y052 at the last meeting. It also compared the coding efficiency of the residual-downsampling algorithm in JVT-Y052 with that of the JMVM including 8x8 transforms for only the non-base view.
The basic concept is analogous to RRU of H.263 Annex Q / MPEG-4 pt. 2, plus also supporting horizontal-only and vertical-only downsampling.
Tested with CAVLC, IPPP coding structure, no 8x8 transform in non-base view.
Tested on three non-common-conditions test sequences, one of which was animation content.
Significant coding gain was reported at lower bit rates (with or without 8x8 transform in non- base view).
With the 8x8 transform in both the reference and modified design, the contribution reports significant gain for the use of RRU: About 8% on two natural video sequences, and 16% on one animation sequence, as a percentage of the bit rate for the second view.
Remark: But this is the percentage savings of just the bits for one of the two views – not of the total bit rate. Response: Yes, as a percentage of the total, it would be about 3% for the natural sequences and perhaps 5% for the animation.
Remark: Does this make sense? How would it perform on regular single-view video?
Remark: Common conditions sequences are not stereoscopic sequence.
Remark: Is there really anything about this proposal that makes it specific to the stereoscopic context? Perhaps not.
Remark: JMVM with and without 8x8 transform – no significant difference in performance was reported between these two. Response: Probably because this is using QVGA resolution sequences.
Not verified, not using common conditions, just a few (low-res) test sequences.
Remark: Should test using MVC common conditions.
JVT Disposition: Further study was encouraged.
5.6 Mixed-resolution MVC
125 5.6.1.1.1 JVT-Z026 ( Prop 2.2.1/3.1) [Y. Chen (TUT), Y.-K. Wang (USTC), S. Liu, M. M. Hannuksela (Nokia), H. Li (Nokia)] On asymmetric MVC This is a follow-up proposal to JVT-Y054. "Asymmetric coding" involves the coding of two views of a stereoscopic video with different resolutions. In JVT-Y054, it was asserted that the proposed scheme has low complexity and requires a smaller decoded picture buffer size compared to downsampled inter-view prediction. Simulation results under common test condition were provided. In this proposal, results on stereoscopic video (two views) were provided. It was claimed that the proposed method has almost the same efficiency as downsampled inter-view prediction.
Rather than downsampling, uses 1/2 sample interpolation as 1/4 sample positions and uses odd integer samples as 1/2 sample positions, and even integer samples as integer-sample positions.
Results were compared to simulcast and to the downsampled reference technique. About the same performance, relative to downsampled reference technique, was reported (a small gain), and about 14% gain relative to simulcast-resolution use of MVC design.
Software? Can be made available.
Draft text? Not yet.
JVT Disposition: Further study was encouraged.
5.6.1.1.2 JVT-Z034 ( Prop 2.2) [S. Cho, B. Lee, N. Hur, J. Kim, S.-I. Lee (ETRI)] Prelim subjective test results for mixed resolution stereo video coding This document reported preliminary results of subjective tests for mixed resolution stereo video coding. Such coding methods were envisaged to be used for 3D DMB services and systems in the future. The presented results were partly achieved in collaboration with Fraunhofer HHI. They were reported to be considered to be preliminary and to report work in progress. Test conditions were reported to be "not yet perfect" and the results were reported to be not consistent and complete. So far the results reportedly do not allow drawing reliable conclusions. However, there was reported to be some indication that mixed resolution stereo coding may outperform MVC at least in some cases, under some conditions.
Shown on 3.5 inch display, each view having 320x480 resolution, 30 fps.
3 test sequences.
CAVLC, no 8x8 transform.
Results were characterized as preliminary.
Conclusions difficult – not consistent The author indicated an interest in providing improved measurements in the future.
Disposition: Further study was encouraged.
126 5.7 MVC high-level syntax and HRD
5.7.1.1.1 JVT-Z024 ( Info) [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic (HHI), Y.-K. Wang (Nokia), C. Ying (Tech. U. Tampere)] MVC decoding process and HRD design This informational contribution discussed some issues relating to the description of the decoding process – whether the decoding process specifies the output of only one view (i.e. repeating the decoding process when output of multiple views is needed) or the output of any number of views to be output. It was asserted that the decision on the issues may deeply affect the MVC specification text editorially. Furthermore, it was asserted that the design of the HRD for MVC may also be affected technically. Two methods with their pros and cons were discussed (single- view output or output of all view).
The goal of this contribution was reportedly to outline the issues for discussion and collect opinions from other JVT experts. It was recommended that the group carefully discuss these issues since it was asserted that they affect several important aspects of the MVC specification.
After discussion – an approach similar to that used for SVC and for separate_colour_plane_flag equal to 1 was suggested. The suggestion was to focus on the decoding of one view throughout time. Other views on which that view depends will be generated during the decoding process as needed to decode the target view. Mark the pictures of the other views as "not used for reference" after completing the decoding process for the target view (or consider them as marked as inter-view-only reference pictures and all such pictures are removed from this classification upon completion of the access unit decoding process for the target view). Pictures of other views that are not the target view are always marked as "not needed for output". Make a distinction between marking for temporal referencing and for inter-view referencing.
A BoG discussion group was formed (coordinated by A. Vetro) to further discuss the editing aspects – the exact phrasing was left to the editors.
5.7.1.1.2 JVT-Z027 ( Prop 2.2/3.1) [H. Nakamura, M. Ueda (JVC)] Comments on SPS MVC extension This contribution proposed two changes relating to view dependency information in the sequence parameter set MVC extension. Both proposals are for redundancy reduction. These were asserted to be independent proposals.
This contribution suggested modifying the sequence parameter set MVC extension syntax.
Aspect #1: Signaling for applying inter-view prediction
Analogous to simulcast AVC coding.
The contribution notes that various aspects of simulcast coding may need some (e.g., externally specified) application-level support external to the video elementary bitstream data.
The contribution proposed a flag to indicate a lack of any inter-view dependency (rather than using existing dependency info syntax).
127 This case seems easy to detect with existing syntax, and the savings in this case is very minor – the JVT concluded that there was no real need to treat this case as special. No action was therefore taken by the JVT in this regard.
Aspect #2: anchor_ref_l0[i][j] inference from num_anchor_refs[i] for i equal to 1
The contribution proposed eliminating syntax element presence when a value can be easily inferred.
The proposal was concluded to seem to be excessive fine-tuning with negligible real effect. No action was therefore taken by the JVT in this regard.
5.7.1.1.3 JVT-Z038 ( SEI Prop 2.0/3.1) [S. Yea, A. Vetro (MERL), A. Smolic, H. Brust (HHI)] Revised syntax for SEI message on multiview acquisition information This contribution proposed alternative syntax options for the SEI message on multiview acquisition information (camera parameters). In particular, two forms of syntax were considered: one being a floating-point representation with a variable-length mantissa, while the other also has a variable-length mantissa and follows the IEEE 754 format. The proposed options were asserted to enable a wider range of numerical values and precisions, which would reportedly overcome some shortcomings and problems with the existing syntax. Matlab scripts were provided for verifying the proposed formats.
Remark: Is there an ISO, IEC, or ITU-T spec for IEEE 754? Apparent answer: IEC 60559.
Remark: Is it OK for the SEI message to not be parsable without first parsing the SPS?
JVT decision: Adopted.
5.8 MVC profiles
5.8.1.1.1 JVT-Z028 ( Prop Profiles) [B.-M. Jeon (LG), W. S. Shim (Samsung), S. Cho (ETRI), G. H. Park (Kyung Hee U.), P. Pandit (Thomson), Y.-L. Lee (Sejong U.)] About MVC coding tools The JVT has put two new coding tools into the JMVM: illumination compensation (including loop filtering) and motion skip to improve the coding efficiency in MVC. However, the current JD does not contain either of these new coding tools that are found in the JMVM.
Compared with simulcast H.264/AVC, JMVM 5.0 without the two new coding tools reportedly achieves about a 19% bit rate savings on average, while JMVM 5.0 with two new coding tools reportedly achieves about a 27% bit rate savings on average, when JMVM common test conditions are used for all test sequences except the "Uli" sequence. This contribution recommended that the JVT adopt the two JMVM tools into the JD (and thus presumably into intended profile plans) at this meeting.
A description of some planned MVC service deployments in Korea was presented, including mobile services in particular.
128 Question: Are the described deployments planning to use CABAC or CAVLC? Response: CAVLC? Follow-up question: But these coding tools have never been tested with CAVLC, so how could we consider specifying to use this untested configuration?
Question: How many views? 3 (autostereoscopic).
Remark: The gain for these tools, when using only 3 views, will be less than what has been measured in our common conditions tests.
Remark: The best thing for us to do to enable the application at this stage is to establish a standard which is as easy to implement as possible, based on existing implementation designs for AVC. Once the application becomes established, it might then begin to make sense to consider a more customized design.
We seem to only be able to specify profiles at this stage (for completion of standardization by July) without MB-level coding tools, considering the lack of testing of CAVLC performance for such coding tools. Agreed.
Remark: It seems worthwhile to still consider longer-term work that could include new coding tools.
The JVT decided to plan AHG activity to include testing of CAVLC performance making that a mandate of the phase 2 MVC AHG.
5.8.1.1.2 JVT-Z047-Q / M15196 (Late Prop 2.0/3.1) [H. Kimata (NTT), H. Nakamura (JVC), T. Itoh (Fujitsu), T. Nomura (Sharp)] Proposal on Profiles for MVC (Multi-view Video Coding) Supporting allowing presentation of this late contribution: – A. Vetro – J. Ridge – F. Istiaq – P. Purvin
Apology? Was reportedly sent.
Proposes 4 profiles for MVC. – Profile A: With no inter-view prediction (basically a simulcast profile), up to 16 views – Profile B: No temporal prediction Remark: We ordinarily only specify aspects that have a decoder complexity impact. – Profile C: With inter-view and temporal prediction, possibly with temporal and SNR scalability – inter-view prediction only for anchor picturs Remark: Restricting inter-view prediction to anchor pictures seems questionable. – Profile D: Withdrawn.
The contribution advocated having some constraints on dependency structures. We should study this.
Suggestion: Define one profile with SPS-level switches of features.
Suggestion: Profile should be based on High profile.
129 Suggestion: Add a third dimension to capability specification: Not just profile & level, but profile & level & number of views.
Remark: Regarding the inclusion of SVC features, it seems like we haven't thought about how MVC and SVC can work together – does the high-level syntax work for such a scenario?
A. Vetro and H. Kimata considered these as break-out discussion subjects and reported back to the JVT. This was further discussed jointly with MPEG as reported elsewhere in this report. See JVT-Z049 and section 7.2 of this report.
5.8.1.1.3 JVT-Z049-B (BoG Report) BoG report on MVC profiles This document provided a report that summarized the results of break-out group activity discussions on MVC profiles during the Antalya meeting.
JVT decision: It was decided at the meeting to target the definition of a single Multiview High profile. Also, some dimensions of the level definition were outlined and discussed. The current draft text that would specify this profile and the level definition were provided.
5.9 MVC test sequences
5.9.1.1.1 JVT-Z035 / M15102 ( Info) [I. Feldmann, M. Mueller, F. Zilly, R. Tanger, K. Mueller, A. Smolic, P. Kauff, T. Wiegand (HHI)] Progress report on 3DTV video acquisition This contribution contained a progress report about work on content creation for 3DTV. The document described a multiview camera arrangement, details of the camera hardware, planned content, disparity range, and potential use of the data.
This contribution had been reviewed by MPEG as M15102 and was not presented in detail to the JVT.
6 New AVC Proposals
6.1 Adaptive MV precision
6.1.1.1.1 JVT-Z022 / M15185 ( Prop 2.2/3.1) [S. Sekiguchi, K.Otoi, Y. Yamada, K. Asai, T. Murakami (MEI)] 4:4:4 video coding perf with adaptive MV coding This contribution reports a potential compression performance improvement of 4:4:4 video coding with adaptive coded representation of motion vector information depending on the magnitude of the motion vector. The objective of this study was to investigate the possibility of further compression performance improvement for a 4:4:4 profile in order to apply it to consumer-level applications requiring much higher compression ratios. In the case of high- compression conditions, it was suggested to be necessary to consider further reduction of the coded bits for motion information. The studied approach here is to perform an adaptive motion vector search that limits fractional sample accuracy depending on the magnitude of motion vector, and to derive a coded representation of motion vectors assuming the adaptive accuracy motion search. Experimental results reportedly showed an advantage to the studied approach in the case of high-compression conditions.
130 This was a contribution toward a hypothetical future 4:4:4 profile oriented toward consumer applications.
The contribution suggested to use a relatively small search range with 1/4-sample search, a larger range with 1/2-sample search, and a yet larger integer search range.
On one test sequence ("Shimoda"), compared to the current JM search and syntax technology, an experiment yielded a reported 5.6% delta bit rate improvement over the 38-42 dB fidelity range.
What does this have to do with 4:4:4? Couldn't it apply equally well to other chroma formats? Response: Yes, it can.
A 50-63% bit rate savings was reported for the motion vector data portion of the bitstream with IPPP coding.
The contribution asserts a savings in memory bandwidth also.
Remark: The entropy coding aspect seems difficult to comprehend, since motion vectors are sent as deltas from predictors, whereas the precision depends on the actual MV value result rather than the delta – how would we determine the interpretation of the MVD? Response: A scheme is used which seems to work; it involves maintaining quarter-sample precision for the predictor – other details were considered difficult to explain, but were reported to work.
Question: What happens if you change the Lambda for the anchor? Response: Don't know.
Question: Was this using CAVLC or CABAC? Answer: CAVLC.
Presentation? Uploaded later.
JVT disposition: Further investigation was encouraged, including testing on more data and for other chroma formats.
6.2 SEI messages
6.2.1.1.1 JVT-Z040 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to convey suitable splice points in the bitstream This contribution proposed a new SEI message that would convey information about a potential splice point in the bitstream located N access units subsequent to (in bitstream order) the location of the SEI or the current access unit.
A "Suitable splicing point" of order M was defined as a point in the bitstream at which some specific number of pictures M is present in the DPB that are ready for output at consecutive clock ticks, prior to any gap in output times. For example, if the DPB contains F = 5 pictures, and if the bitstream is cut at that point, and it is a suitable splicing point with M = 2, there would be 2 pictures with consecutive output times and each of the other three pictures would have one of the following two properties: – There is a "time gap" after the output times of the M pictures and before the output time of the other picture arrives, or – The other picture has an output time that precedes the output times of the M pictures.
131 The proposed SEI message identifies a "suitable splice point" and conveys the value of M, N, and F.
Question: Fixed frame rate assumption? Response: Yes.
Question: What is the need for providing advance notice in the bitstream of these properties that are to be fulfilled at a later point in the bitstream? Response: Motivation is to reduce complexity and delay in the splicer, to enable "pre-conditioning" of the other stream that is to be spliced in, etc. Remark: Don't really understand that explanation.
Question: Why is an SEI message needed? Why can't the decoder/splicer just scan the bitstream and watch the picture properties to identify for itself the points in the bitstream that have these properties?
Remark: Need to consider the additive nature of the buffering in relation to the buffer capacity.
The contribution proposes to add flags/indicators whether the indicated "suitable splice point" immediately precedes an I or IDR.
Question: How is that aspect useful?
Question: Is there an assumption that the new data that would be concatenated after the "suitable splice point" would begin with an IDR picture? Response: Perhaps an all-Intra picture rather than an IDR picture.
Question: What to do with the other pictures that are waiting in the DPB if not followed by an IDR?
Response: 1) Send an MMCO to mark them "not used for reference", and 2) if they have not yet been output, perhaps this should not be indicated as a "suitable splice point".
Proposed syntax and semantics were shown. There were some other syntax elements in the proposal.
The proposal also includes signaling of some CPB properties.
Remark: As shown, the contribution may reflect a misconception about the meaning of PicOrderCnt (although the concept may be expressible in another way).
Remark: These proposals create "promises" that must be fulfilled later in the bitstream – but some of the "not yet fulfilled promises" may be broken by a splicing operation.
Remark: This is especially true for (e.g. older-generation) splicing equipment that has been designed without awareness of this proposed SEI message.
Response: We can specify that the promises go away upon encountering an IDR picture or end_of_stream NAL unit. Or we can specify not to make overlapping promises.
Remark: The proposal does not seem like it provides a complete solution to this "broken promises" problem.
132 Response: But we risk having other specifications developed that establish harmful constraints on bitstream or application behavior – resulting in reduced interoperability and loss of potential capability.
Remark: Managing CPB aspects is another aspect that needs study in this context.
Remark: Whenever the "M" is mismatched between the old stream and the new stream, the only thing the splicer can probably to is set no_output_of_prior_pics_flag to 1. A splice operation must manage both DPB output times and CPB output times (and decoding times, etc.). Is it really possible to produce seamless behaviour and/or maintain conforming behavior under some of these circumstances?
See further notes below.
6.2.1.1.2 JVT-Z041 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to control DPB output in non-seamless spliced bitstreams with end_of_stream This contribution proposed a new SEI message that would convey information related to the output of DPB pictures at the splice point of non-seamless concatenated bitstreams. It was asserted that the proposed SEI message could serve as a tool to aid splicing devices, along with the end_of_stream NAL unit and no_output_of _prior_pics_flag. The proposed SEI message would be provided in the bitstream prior to the end_of_stream NAL unit to identify its location and specify the output behavior of non-previously output pictures in the DPB subsequent to the end_of_stream NAL unit.
Information that specifies the output behavior of each non-previously output DPB picture reportedly allows for outputting a picture, not outputting, or outputting the picture for a number of consecutive times prior to outputting the subsequent picture from the first bitstream.
It was asserted that the proposed SEI message could be signaled ahead with information that points to the location of the end_of_stream
Discusses gaps – non-consecutive (fixed frame rate) output times for pictures in the DPB.
There is currently no equivalent of MMCO for output marking – just no_output_of_prior_pics_flag. This is basically what is suggested in this contribution.
Remark: We need to consider the conformance implications – we have a standard that establishes requirements for conformance that we can't change (at least not easily/substantially).
Remark: Consider the situation where a "left-over" picture from prior to the splice point ends up with the same output time as a picture from the new spliced-in coded video sequence.
See further notes below.
6.2.1.1.3 JVT-Z042 ( Prop 2.2) [A. A. Rodriguez, J. Au (SciAtl/Cisco)] Prop SEI message to forewarn location of end_of_stream This contribution proposed a new SEI message that would identify the location of an end_of_stream NAL unit in the bitstream (some number of access units prior to its placement).
133 The end_of_stream NAL unit is the last NAL unit in the access unit that ends a bitstream. In some system environments, a new bitstream may immediately follow the access unit that ended the bitstream. The proposed SEI message would indicate that an end of stream NAL unit will appear in the bitstream at a position N access units after the location of the SEI message.
See further notes below.
6.2.1.1.4 Discussion of JVT-Z040, JVT-Z041, and JVT-Z042 together JVT Disposition: These contributions seem to open an topic likely to require action. But further study is needed to determine exactly what to do.
Related activities may be under way in ITU-T SG 9 (J.181, incoming LS, on cue messages and codec changes) and SCTE (SCTE 35, which is a corresponding specification, and other activities on "conditioning" – how to manage the operation) with some interest also found in DVB.
The JVT planned to conduct further study of this subject, and to establish an AHG in which to perform such investigations.
6.3 Deployment issues In meeting discussions, it was noted that many products (esp. portable video players) have implemented the "toolbox" subset corresponding to profile_idc = 66 with constraint_set1_flag and that some application specifications have specified to use this subset, which is currently not a defined conformance point of the AVC standard. It was suggested to provide a new profile definition corresponding to those settings.
The suggested name for this new profile that arose in these discussions was the "Common Profile".
JVT Decision: The JVT suggests that its parent bodies and participants study this suggestion and provide their opinions about the desirability of this potential future action. A resolution on this topic was conveyed to the WG 11 parent body for inclusion in the parent body meeting resolutions. See item 4 of section 10 of this report.
7 Joint discussion with MPEG requirements
7.1 SVC bit depth, gamut, and chroma format scalability Remarks made in this discussion include the following: – How ready is the market for such a thing? – Display technology development is not mature. – Need ability to demonstrate benefits clearly. – Need good testing conditions for experiments, etc. – Need display capability for standard development. – SVC would be just an efficiency improvement of something that can be done already in another way.
7.2 MVC profiles and levels
134 JVT-Z049-B was reviewed and the following topics were further discussed with decisions as recorded below:
Use a constraint set flag with Main/High profile values to indicate compatibility with MVC. JVT decision: Agreed.
How do we specify the memory capacity? Use a fixed multiplier of buffering capacity and maximum ref_idx value, associated with a nominal number of views expected in the profile. The fixed multiplier should be 2. JVT decision: Agreed.
View random access friendliness constraint / parallel-processing friendliness constraint: Editors to put their best effort into the draft – others can review and comment as things move forward. JVT decision: Agreed.
Should we have a slice size constraint like we have in the "professional" profiles? Yes. JVT decision: Agreed.
7.3 3DV / FVV There was a presentation and discussion of MPEG's exploration work on 3D video / free- viewpoint video.
Two applications were discussed: – The application known as "3D video" is video for 3D display viewing. – The application known as "free viewpoint video" is video with support of extensive navigation capability within a 3D environment.
The current focus is on the first of these two application domains.
7.4 1080p50/60 MPEG-2 The JVT was informed of some MPEG-2 contributions and discussions within MPEG.
7.4.1.1.1 M14863 JNB comment on 1080p50/60 MPEG-2/H.262 This contribution to MPEG was provided in support of the M14869 technical proposal discussed below.
7.4.1.1.2 M14869 Technical proposal on 1080p50/60 MPEG-2/H.262 This contribution to MPEG contained a technical proposal for support of 1080p50/60 in MPEG- 2/H.262.
Max bit rate 80 Mbps = same as High level, buffer size = same as High level.
4:2:2 profile? And an Intra-only 4:2:2 profile? Leaving open for further study.
No interlace support? Maintain nesting decoding capability, but prohibit interlace in top level.
MPEG indicated that it would list 5 NBs supporting this proposal.
135 MPEG's plan was indicated to be to go to the PDAM stage of the ISO/IEC approval process at this meeting. Activities toward development of an associated conformance specification and reference software were planned.
8 JVT internal operating rules JVT decision: The following clarifications/adjustments of JVT operating rules have been adopted.
The JVT decided that participants shall to refrain from long (=more than 4 Minutes) presentations of their proposal, if the results of their coding efficiency experiments have provided less than 2% bit-rate on average (or equivalently 0.1 dB gain on average).
Presentations should also not use "cherry picking" of results for summary reporting in abstracts and presentations. Summary reports must be true summaries – not highlights of best results while ignoring worst results.
Regarding late contributions: Due to our difficulties with a large quantity of late-submitted contributions at previous meetings, the JVT has agreed that for its next meeting, no late-uploaded (non-AHG-report, non-liaison, non-verification) contribution will be presented without having a minimum of 4 JVT participants (working for separate organizations other than that of the primary contribution author) recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Such support to allow a presentation is to be understood to not necessarily imply support of the adoption of the content of the late contribution, but only as a positive expression that the document should be allowed to be presented. Additionally, the provider of such a presented late contribution shall send an email apology to the JVT email reflector. This rule does not apply to material requested by the JVT at the meeting (e.g., reports of JVT-authorized "break out group" side activities).
For all contributions that have presentation material that is used to present them to the group (e.g., PowerPoint presentations), the presentation material should be provided along with the written contribution (within the same zip container file). PDF is preferred over PPT for presentations when the PPT filesize is large and there is no need for the slide deck to be editable by others.
All submissions must be made in JVT-Zxxx.zip format with the word docs, excel sheets and other information being in the zip container. The document must contain an abstract and be accompanied with an e-mail notification containing title, authors and abstract (identical to the one in the doc) which is no longer than 200 words and no shorter than 25 words and is written in 3rd person language in a manner that does not express endorsement of the content of the document.
On filenames inside of .zip containers – use a filename so that if someone takes the files out of the zip container, they would still know what contribution they came from. Every file (or directory) in the .zip container for document JVT-Zxxx should start with JVT-Zxxx. Example: JVT-Zxxx.doc (main document), JVT-Zxxx_presentation.pdf, JVT-Zxxx_results1.xls, etc.
When providing additional or revised files, do not include copies of files that were already included in the prior .zip archive for the same contribution and do not re-use the same filenames without adding revision numbers (r1, r2, etc.) – this saves us needing to worry about whether the files someone obtains with the same filenames are the same or different.
Independent verification (necessary for adoption of a proposal) is provided either through
136 a) independent implementation by 1 or more organization different than that of the proponent based on the textual description (after adoption, both decoder source code versions must be made publicly available along with one encoder version), or b) providing source code to all CE participants prior to the meeting (CEs can only be joined at the meeting, when the CE is created. CEs are created at each meeting and last until the next meeting.)
Simply running binary executables provided by a proponent is not ordinarily considered independent verification. Source code should be provided and used, and the verifying party should invest a proper degree of effort to ensure that the “verification” they perform is a meaningful and professional study with significant depth rather than just a perfunctory procedural formality.
For every SEI message and every syntax element that are currently in the SVC/MVC draft, a showcase has to be provided in order to retain it in the JSVM/JMVM/JD. If such a showcase is not provided at the next meeting for an SEI message or parts of it, the SEI message or the respective parts will be removed from the JSVM/JMVM/JD. The source code and executables for the showcase must be made available.
When Core Experiments (CEs) are to be established, a first CE description should be available at the last day of the meeting (or at least within a few days). Changes of the CE description are only allowed until 3 weeks prior to the next meeting. These changes must be of evolutionary characteristic relative to the input documents on which the CE is based and must be agreed by those who contributed the respective input document(s) or be added as an option.
Contributions that are proposals of new technology that was not what was described as being tested in a CE (even if related to the tested technology) should not indicate that they are CE documents in their title and abstract.
9 List of AHGs established The following JVT “ad hoc groups” (AHGs) were established to progress work on identified topics until the next meeting of the JVT.
9.1 JVT project management and errata reporting Discussion: [email protected] Chair: Gary Sullivan, Jens Rainer Ohm, Ajay Luthra, and Thomas Wiegand Mandates: – Collect errata reports on standards under management of JVT – Coordinate overall interim JVT progress – Prepare status information for JVT status reporting
9.2 JM Text, reference software, bitstream exchange and conformance Discussion: [email protected] Chair: Thomas Wiegand, Karsten Sühring, Alexis Tourapis, Teruhiko Suzuki, Gary Sullivan Mandates: – Maintain and update JM algorithm description text – Maintain and update JM reference software and its usage manual – Facilitate exchange of test bitstreams to aid interoperability testing – Collect bitstreams for inclusion in (non-SVC) Conformance specifications – Identify and correct problems in Conformance specifications and associated bitstreams 137 9.3 SVC JSVM text, software and conformance Discussion: [email protected] Chair: Heiko Schwarz, Jérome Vieron, Thomas Wiegand, Mathias Wien, Alex Eleftheriadis, Vincent Bottreau Mandates: – Edit and deliver improved JSVM text – Coordinate JSVM software integration – Coordinate bug-fixing process for the JSVM software – Maintain JSVM software manual – Plan, edit, and collect bitstreams for SVC conformance specification
9.4 SVC bit depth, color gamut, and chroma format scalability Discussion: [email protected] Chair: Andrew Segall, Thomas Wiegand Mandates: – Identify applications – Work out suggestions for detailed needs – Find/create test material – Study bit-depth reduction techniques, e.g., tone-mapping tools – Study color space and/or gamma conversion requirements – Define experiments and test conditions – Investigate software and text modification needs – Identify complexity issues
9.5 SVC FGS applications and design simplification Discussion: [email protected] Chair: Justin Ridge, Marta Karczewicz Mandates: – Identify applications that may require FGS functionality and their characteristics – Determine to what extent new coding tools are needed to achieve the functionality – Define experiments and test conditions relating to FGS technology – Coordinate with JSVM software effort to align JSVM software with current design – Explore simplification of FGS tool design
9.6 MVC JD and JMVM text and software Discussion: [email protected] Chair: Hideaki Kimata, Aljoscha Smolic, Purvin Pandit, Anthony Vetro, Ying Chen Mandates: – Collect comments on draft, perform necessary editing and delivery. – Maintain JMVM and JD document and collect comments on the text. – Coordinate JMVM software integration – Coordinate bug-fixing process for the JMVM software – Maintain JMVM software manual
138 9.7 MVC JMVM coding tools Discussion: [email protected] Chair: Ying Chen, Shan Gao, Han-Suh Koo – Investigate simplification and improvement of current JMVM coding tools (IC and motion skip) – Investigate techniques for single loop decoding to reduce complexity starting with motion skip – Investigate approaches for enhancing MVC coding efficiency using spatial downsampling – Investigate low-complexity methods for mobile stereoscopic 3DTV applications – Investigate other potential approaches to achieving enhanced MVC capability – Coordinate software, test material, and experiment conditions for these techniques – Evaluate performance of enhanced MVC proposals (including CAVLC operation in particular)
9.8 Splicing operation Discussion: [email protected] Chair: Gary Sullivan, Arturo Rodriguez, Sam Narasimhan Mandates: – Study the use of bitstream splicing in applications – Investigate potential needs for SEI data to aid in splicing operations, including consideration of JVT-Z040, JVT-Z041, and JVT-Z042 and the issues raised in their discussion – Study the implications of ITU-T Rec. J.181 and the draft new ITU-T Rec. J.h-dpi – Gather information about activities of other relevant organization regarding the development of specifications relating to bitstream splicing
10 Resolutions reported to WG 11 parent body In addition to requesting approval of the texts described above in section 1.4 (and associated dispositions of WG 11 NB comments and expressions of thanks to WG 11 NBs for their input) and informing WG 11 of the AHGs established as described above in section 9, the following JVT resolutions were reported to the WG 11 parent body:
1. The the JVT and the video subgroup of WG 11 recommended approval of the WG 11 N9577 Report of the WG 11 SVC verification tests and the N9617 liaison statement template for conveying these results with WG 11 liaison letters. 2. The JVT and the video subgroup of WG 11 thanked Technische Universität München for use of its facilities and thanked the following participants for their substantial contributions of effort in the work on the SVC verification test: Vincent Bottreau (Thomson), Christian Keimel (Technische Universität München), Tobias Oelbaum (Technische Universität München), Heiko Schwarz (Fraunhofer HHI), and Mathias Wien (RWTH Aachen University). 3. The JVT and the video subgroup of WG 11 thanked the following companies for their financial support of the SVC verification test: Fraunhofer HHI, Microsoft, Orange, ST Microelectronics, and Vidyo. 4. The JVT and the video subgroup of WG 11, considering the apparent deployment of a significant number of products that support only the coding tool features that are in common between the Baseline, Main and High profiles of ISO/IEC 14496-10 (ITU-T Rec. H.264) Advanced Video Coding, requested NBs to provide comments regarding the potential need for specification of a new AVC "Common profile" consisting of the tool 139 constraints expressed by the syntax element combination of profile_idc equal to 66 with constraint_set1_flag equal to 1. It was suggested that comments should arrive prior to the April 2008 JVT and WG11 meetings. 5. The JVT chairmen proposed to hold the 27th JVT meeting during 23-29 April 2008 under the auspices of the meeting of ITU-T SG 16 in Geneva, CH. Further meetings are expected to be held during 20-25 July 2008 under WG 11 auspices in Hannover, DE; 12-17 October 2008 under WG 11 auspices in Busan, KR; and 27 January – 3 February 2009 under ITU-T SG 16 auspices in Geneva, CH.
Post-meeting note: The plans for the Geneva meeting were subsequently modified to start the JVT meeting on Thursday 24 April 2008 after lunch at 2:30 p.m. and end by lunchtime on Tuesday 29 April 2008, as announced by email to the JVT reflector on 18 February 2008.
11 Attendance Persons registered to attend the meeting, as recorded by a sign-in sheet circulated during the meeting, were the following (124 listed participants): 1. Amon, Peter (Siemens AG) 2. Andersson, Kenneth (Ericsson) 3. Asai, Kohtaro (Mitsubishi) 4. Bandoh, Yukihiro (NTT) 5. Bjøntegaard, Gisle (Tandberg) 6. Bottreau, Vincent (Thomson R&D France) 7. Bruls, Fons (Philips) 8. Budagavi, Madhukar (Texas Inst.) 9. Chen, Weizhong (Huawei Tech.) 10. Chen, Ying (Tampere Univ. Tech.) 11. Chiu, Yi-Jen (Intel) 12. Cho, Sukhee (ETRI) 13. Choe, Yoonsik (Yonsei Univ.) 14. Choi, Hae-Chul (ETRI) 15. Choi, Woongil (Samsung AIT) 16. Chujoh, Takeshi (Toshiba) 17. Cieplinski, Leszek (Mitsubishi Electric) 18. Cock, Jan De (Ghent Univ.) 19. de Casanove (Actimagine) 20. de Haan, Wiebe (Philips) 21. Divorra, Òscar (Thomson) 22. Domański, Marek (Poznań Univ. Tech.) 23. Eleftheriadis, Alex (Layered Media) 24. Fröjdh, Per (Ericsson) 25. Fujii, Toshiaki (Nagoya Univ.) 26. Fuldseth, Arild (Tandberg) 27. Gao, Shan 28. Gomila, Cristina (Thomson) 29. Guleryuz, Onur (Docomo USA Labs) 30. Han, Dong-hoon (Sejong Univ.) 31. Han, Ki Hun (Sejong Univ.) 32. Hannuksela, Miska (Nokia) 33. Horowitz, Michael (CoVi Tech. --> Vidyo) 34. Hsiang, Shih-Ta (Motorola)
140 35. Hu, Yi (Conexant Systems) 36. Husak, Walt (Dolby Labs) 37. Ishtiaq, Faisal (Motorola) 38. Itoh, Takashi (Fujitsu Labs) 39. Jeon, Byeong-Moon (LG Electronics) 40. Jeon, Byeungwoo (SKKU) 41. Jeon, Yongjoon (LG Electronics) 42. Jeong, Jechang (Hanyang Univ.) 43. Jeong, Seyoon (ETRI) 44. Jia, Jie (Sejong Univ.) 45. Jung, Joël (France Telecom R&D) 46. Kang, Jung Won (ETRI) 47. Karczewicz, Marta (Qualcomm) 48. Kim, Dae-Yeon (Sejong Univ.) 49. Kimata, Hideaki (NTT) 50. Klimansewski, Krynxtox (Poznań Univ. Tech.) 51. Kook, Seung Ryong (Kyunghee Univ.) 52. Lainema, Jani (Nokia) 53. Lee, Yung Ki (Sejong Univ.) 54. Lee, Yung-Lyul (Sejong Univ.) 55. Lim, Chong Soon (Panasonic) 56. Lim, Jung Eun (LG Electronics) 57. Lim, Sung Chang (Sejong Univ.) 58. Lin, Sixin (Huawei) 59. Liu, Yingjia (Huawei) 60. Lizcano, Leonardo (Telefonica R&D) 61. Luthra, Ajay (Motorola) 62. Ma, Siwei (Peking Univ.) 63. Masashi, Takahashi (Hitachi) 64. McAdoo, Kyle (Conexant Systems) 65. Moon, Joo Hee (Sejong Univ.) 66. Muczko, Marian (Telekomunikacja Polska) 67. Naito, Sei (KDDI) 68. Narasimhan, Sam (Motorola) 69. Narroschke, Matthias (Panasonic) 70. Nilsson, Mike (BT) 71. Nishi, Takahiro (Panasonic) 72. Oelbaum, Tobias (Tech. Univ. Munich) 73. Oh, Kwan-Jung (GIST) 74. Ohm, Jens-Rainer (RWTH Aachen Univ.) 75. Pandit, Purvin (Thomson) 76. Park, Gwang-Hoon (Kyung Hee Univ.) 77. Park, Hyoung-Mee (Sejong Univ.) 78. Park, Ji Ho (KETI) 79. Park, Jong Tae (Kyunghee Univ.) 80. Park, Joon-young (LG Electronics) 81. Park, Min-Cheol (Sejong Univ.) 82. Park, Min-woo (Kyung Hee Univ.) 83. Park, Seung-Wook (LG Electronics) 84. Pateux, Stephane (Orange - France Telecom) 85. Ridge, Justin (Nokia) 86. Rodriguez, Arturo (Scientific Atlanta / Cisco) 87. Sampedro, Jesus (Polycom) 141 88. Schwarz, Heiko (Fraunhofer HHI) 89. Segall, Andrew (Sharp Labs USA) 90. Sekiguchi, Shun-ichi (Mitsubishi) 91. Senoh, Takanori (Univ. Tokyo) 92. Shim, Seung-yong (Sejong Univ.) 93. Shim, Woo-Sung (Samsung Electronics) 94. Shimizu, Shinya (NTT) 95. Siong, Lee Wei (I2R) 96. Smolić, Aljoscha (Fraunhofer HHI) 97. Suh, Hyungsik (LG Electronics) 98. Sullivan, Gary (Microsoft Corp.) 99. Sun, Huifang (Mitsubishi) 100. Suzuki, Teruhiko (Sony) 101. Tan, Thiow Keng (NTT DoCoMo) 102. Tanizawa, Akiyuki (Toshiba) 103. Tomonobu, Yoshino (KDDI) 104. Ugur, Kemal (Nokia) 105. Um, Gimun (ETRI) 106. Van de Walle, Rik (Ghent Univ.) 107. Vartiainen, Juha (SPS) 108. Vermeirsch, Kenneth (affiliation ?) 109. Vetro, Anthony (Mitsubishi Electric) 110. Wang, Xianglin (Nokia) 111. Wiegand, Thomas (Fraunhofer HHI) 112. Wittmann, Steffen (Panasonic) 113. Won, Kwanghyun (SKKU) 114. Xiong, Lianhuan (Huawei) 115. Yamakage, Tomoo (Toshiba) 116. Yamamoto, Tomoyuki (Sharp) 117. Yamasaki, Takahiro (Oki Electric Industry) 118. Yang, Haitao (Xidian Univ.) 119. Yang, Jungyoup (SKKU) 120. Yann, Bodo (Joost) 121. Yao, Wei (I2R) 122. Yoo, Jeong-Ju (ETRI) 123. Yoo, Young Joe (Sejong Univ.) 124. Zhu, Gang (Tsinghua Univ.)
142 Annex I – Audio report
Source: Schuyler Quackenbush, Chair
1 Opening of the meeting...... 3 2 Administrative matters...... 3 2.1 Communications from the Chair 3 2.2 Approval of agenda and allocation of contributions 3 2.3 Creation of Task Groups 3 2.4 Approval of previous meeting report 3 2.5 Review of AHG reports 3 2.6 Joint meetings 3 2.7 Received National Body Comments and Liaison matters 3 3 Record of AhG meetings...... 3 3.1 AhG Meeting SAOC, Unified Speech and Audio Sunday 1000-1700 3 3.1.1 SAOC 1000-13000...... 3 3.1.2 Unified Speech and Audio 1400-1700...... 5 4 Task group activities...... 6 4.1 Joint meetings and documents from other groups 6 4.2 Task Group discussions 6 4.2.1 MPEG-4 audio, conformance, reference software...... 6 4.2.2 MPEG-D MPS...... 8 4.2.3 MPEG-D SAOC...... 8 4.2.4 MPEG-D Unified Speech and Audio...... 8 5 Meeting deliverables...... 9 5.1 Responses to Liaison and NB comments 9 5.2 Recommendations for final plenary 9 5.3 Establishment of Ad-hoc Groups 9 5.4 Approval of output documents 9 5.5 Press statement 9 6 Future activities...... 9 6.1 Schedule of future meetings 9 6.2 Agenda for next meeting 9 6.3 All other business 9 6.4 Closing of the meeting 9 Annex A Participants...... 10 Annex B Audio Contributions and Schedule...... 11 Annex C Task Groups...... 15 Annex D Output Documents...... 16 Annex E Agenda for the 84th MPEG Audio Meeting...... 18
143 1 Opening of the meeting The MPEG Audio Subgroup meeting was held during the 82nd meeting of WG11, October 22- 26, 2007 in Shenzhen, CN. The list of participants is given in A.
2 Administrative matters 2.1 Communications from the Chair The Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and proposed agenda items for discussion in Audio plenary. 2.2 Approval of agenda and allocation of contributions The agenda and schedule for the meeting was discussed, edited and approved. It shows the documents contributed to this meeting and presented to the Audio Subgroup, either in the task groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems and MDS to the attention of the group. It was revised in the course of the week to reflect the progress of the meeting, and the final version is shown in B. 2.3 Creation of Task Groups Task groups were convened for the duration of the MPEG meeting, as shown in C. Results of task group activities are reported below. 2.4 Approval of previous meeting report The 82nd Audio Subgroup meeting report was registered as a contribution, and was approved. 2.5 Review of AHG reports There were no requests to review any of the AHG reports. 2.6 Joint meetings There were no joint meetings with Audio over the course of the week. 2.7 Received National Body Comments and Liaison matters The NB Comments and Liaison documents for the meeting that require a response are as shown below. No. Title Response by M15072 Liaison Statement from ITU-T SG 9 [SC 29 N 9004] Audio Subgroup
3 Record of AhG meetings 3.1 AhG Meeting SAOC, Unified Speech and Audio Sunday 1000-1700 3.1.1 SAOC 1000-13000 Oliver Hellmuth, FhG, presented m15123 Information and Verification Results for CE Oliver Hellmuth on Karaoke/solo System Improving Johannes Hilpert Performance of MPEG SAOC RM0 Andreas Hölzer Leonid Terentiev Cornelia Falch This notes that RM0 does not provide a very satisfying level of performance for the difficult problem of muting a foreground object as in the Karaoke application. It reviewed the technology proposed as a CE at the previous MPEG meeting. If the Fore Ground Object (FGO) is stereo, it proposes to cascade TTT-1 boxes and shows that such a cascade can be formulated as a TTN-1 box, where N=3 if FGO is mono and N=4 if FGO is stereo. 144 Listening test results were presented, comparing SAOC RM0 and SAOC with the new TTN technology. In global mean performance TTN was better than SAOC RM0 in all tests at the 95% level of significance. Furthermore, for the operating points demonstrated, the SAOC TTN technology achieving scores that were solidly in the “good” region. Heiko Purnhagen, Dolby Labs, presented m15162 Cross Verification of SAOC CE on Karaoke Jonas Engdegard enhancement This contribution presents a listening test that provided a cross-check on the FhG Karaoke CE. In all cases, the mean performance of the TNN technology was better than the mean performance of RM0 at the 95% level of significance. Henney Oh, LGE, noted that FhG presented no evidence of performance for energy mode, and that there is no basis for incorporating this operating mode into the SAOC WD. The Chair suggested that this could be provided at the next meeting, perhaps even as a collaboration between FhG and LG. The AhG recommends that the Audio Subgroup accept the TTN prediction mode with residual coding into the SAOC WD. Jeongil Seo, ETRI, presented m15144 Consideration on enhanced Karaoke Jeongil Seo processing for stereo FGO Seungkwon Beack Kwang-ki Kim Kyeoungok Kang This contribution notes that the current performance of SAOC RM0 in the karaoke application (i.e. suppression of FGO) has limited quality. ETRI suggest an alternative structure for karaoke/solo modes based on a cascade of OTT boxes in the case of stereo FGO. It further notes that the OTT box required 2 parameters while the TTT box requires 3 parameters ETRI feels that the proposed technology can provide lower complexity and lower bitrate. The Chair welcomed ETRI to proceed with the CE, but noted that the proposed technology provided functionality similar to that of the FhG CE, which is recommended to be accepted into the SAOC WD. Hence there must be a significant increase in performance in order to displace the FhG CE technology. The Chair asked ETRI to give specific estimates of what, if any, resources ETRI might seed from the SAOC sometime during the MPEG week. Henney Oh, LG, presented m15112 Comments on SAOC applications and Henney Oh architectures Yang-Won Jung The contributions makes three suggestions: Downmix preprocessor – it suggests that mono to mono downmix be supported. Binaural transcoder - it suggests incorporating a separate binaural synthesis engine into the SAOC decoder. MBO architecture – it suggests that in the case of Multichannel Background Object (MBO), the downmix should be able to be either mono or stereo. The Chair noted that the suggested modification for binaural transcoding provides no additional functionality as compared to the SAOC and MPEG Surround combination. Oliver Hellmuth, FhG, noted that in real implementation, one is free to optimize the internals relating to how to combine the SAOC and MPEG Surround functionality. The Chair suggested that it may be good to add an informative section to the SAOC specification on how to “collapse” SAOC and MPEG Surround functionalities in the case of a unified implementation. It was agree that interested parties should continue to discuss this contribution and report to the Audio Subgroup mid-week. Osamu Shimada, NEC, presented m15110 A core experiment proposal for an additional Osamu Shimada
145 SAOC functionality of separating real- Toshiyuki Nomura environment signals into multiple objects Akihiko Sugiyama Osamu Hoshuyama The contribution notes that SAOC does not provide information on the nature or relationship of the multiple objects in the SAOC bitstream such that the decoder can meaningfully decode and place objects in a multi-channel presentation. Oliver Hellmuth, FhG, asked whether the current SAOC architecture with the addition of metadata that indicates that two objects are related (e.g. from the same microphone) could provide the same functionality. The Chair asked if NEC might clarify why the proposed technology (System 4) does not show significant improvement over what can be provided by the existing SAOC architecture (System 3). In conclusion, the Chair suggested that NEC have discussions with interested parties during the first part of the MPEG week and make a mid-week presentation that addresses the issues raised. 3.1.2 Unified Speech and Audio 1400-1700 Kristofer Kjörling, Dolby, presented m15158 Homework according to the joint speech and Kristofer Kjörling audio workplan Heiko Purnhagen This contribution reports the information requested in “Workplan for Candidate Test Items.” It did not find permission information on the item from NRSC, but did give information on where to get the DC associated with other items. Schuyler Quackenbush will contact David Layer, NRSC, to ask if MPEG can get access to this item. In addition, it presented a table that recommends the downmix, as L or (L+R)/2 and level adjustment, based on subjective evaluation. Schuyler Quackenbush, Audio Research Labs, presented m15095 Collected Set of Possible Evaluation S. Quackenbush Guidelines This contribution is merely the collection of text from various audio experts that was available on the Friday of the 82nd MPEG meeting. The presenter highlighted area in which a choice of methods must be made, but asked that discussion be deferred as the remaining contributions will a provide better vehicle for discussion. Werner Oomen, Philips, presented m15155 Evaluation criteria and test items for unified Werner Oomen speech and audio coding Erik Schuijers This contribution covers four topics Derivation of VC – for each item and each operating point a VC is selected. Candidate test items – remove items that might duplicate the effect of oncatenated test items. Figure of Merit – system of assigning points. Item Selection – to select a representative subset of the 38 items, as two sets: most critical items and items that are coded with very good performance The contribution presented the results of applying the item selection procedure using testing at 32 kb/s. Kristofer Kjörling, Dolby, presented m15160 Thoughts on evaluation criteria for joint Kristofer Kjörling speech and audio workitem Heiko Purnhagen This contribution covers five topics Derivation of VC – for each item, each operating point and each test site, a VC is selected. Figure of Merit – which operating points are evaluated, and how do we pick a winner. 146 Candidate all test items to make a single item to code – this prevents the opportunity of: o Per-item tuning o Bit buffer abuse Speech to Music transition – such items should be removed from the test, in that grading is difficult in that case that e.g. speech is handled well and music is not. Dolby endorses the notion of using items such as the “classic” 12 MPEG items for the speech and audio process, as these are difficult and diverse items that span a large space of possible encoder “tunings.” The Speech and Audio test set should be known at the close of the April MPEG meeting.
Johannes Boehm, Thomson, presented m15145 Thoughts on Speech and Audio Evaluation Oliver Wuebbolt Guidelines Johannes Boehm The contribution shows a method to combine the variances of a given system under test over all test sites. It recommends that the Evaluation Guidelines document Take care when building a measure of variance or use in determining 95% CI on a global mean performance Specify in advance what your information might be when you must “consider additional information” in order to choose a best system when the Figure of Merit fails to decide a winner. Miyoung Kim, Samsung, presented m15118 Comments on Unified Speech and Audio CfP Miyoung Kim Evaluation Guidelines Eunmi Oh JungHoe Kim The contribution proposes to Determine VC by pooling over all test sites Requirements – at 64 kb/s pool over all signal categories to get a single mean performance The Chair noted that pooling over all signal categories will result in a smaller confidence interval for that one score and thus may make the proposed 64 kb/s requirement more difficult to fulfil. Markus Multrus, FhG, presented m15165 Comments on Speech and Audio Evaluation Ralf Geiger Guidelines Markus Multrus Bernhard Grill The contribution raises a number of issues Confidence intervals on the grand mean performance should be used when comparing the performance of systems under test. The “winner” amongst systems with overlapping confidence intervals should be selected by considering additional information such as: Operation at higher bitrates, e.g. 128 kb/s That re-use of existing MPEG technology is desirable Miyoung Kim, Samsung, noted that it is undesirable to delay the selection process by running another listening test to get additional information. Anisse Taleb, Ericsson, stated that we cannot ask for subjective performance information at 128 kb/s because that operating point is not listed in the Call, and the Chair agreed with that statement. Ralf Geiger, FhG, noted that in a deadlocked situation an additional listening test may be the quickest way to resolve the deadlock. Schuyler Quackenbush, Audio Research Labs, presented m15096 Draft Workplan for Testing of SA Proposals S. Quackenbush This is a skeleton for the final workplan document. The presenter asked that interested audio experts please read and provide comments on components that are missing or could be improved. 147 The Chair presented the AhG report, which was approved the AhG members present.
4 Task group activities 4.1 Joint meetings and documents from other groups There were not joint meetings. 4.2 Task Group discussions 4.2.1 MPEG-4 audio, conformance, reference software Markus Schnell, FhG, presented m15151 Update on AAC-ELD Verification Test Markus Schnell Ralf Geiger This contribution is a draft of the AAC-ELD Verification Test Workplan. It proposes two tests, the first “application-driven” which asses performance in application-driven operating points for typical material, and the second “technology-driven” which asses performance over a range of operating points for critical material. All operating modes of AAC-ELD are being tested (e.g. block length, sampling rate). For the first test, contribution proposes to use speech items from a wide range of languages from both male and female talkers. This may be “corrupted” using a set of representative office noise signals which were recorded by FhG. There was much valuable discussion on test items and how to construct test items from the signal toolbox. The Chair urged the Audio Subgroup to help in the task of specifying a process to construct the final test items that represent best practice. Tilman Liebchen, LG, presented m15121 Update of ALS Conformance Tilman Liebchen This contribution reports some bugfixes in the current set of conformance data some new conformance data relating to MP4FF OAFI box Update of ASL Conformance data The contribution proposes to Issue a DCOR on MPEG-4 Conformance to o remove an equation from the spec and instead reference an equation in the MPEG- 4 ALS specification. o Replace incorrect ALS conformance data (due to a bug in the ASL reference software) Conformance data for OAFI to be generated by a tool rather than as a pre-stored waveform. It was the consensus of the Audio Subgroup to issue the DCOR from this meeting and to incorporate the new OAFI conformance data into the AAC-ELD Conformance amendment. Andreas Schneider, Dolby, presented m15161 Proposed correction to PS conformance and Andreas Schneider reference software Heiko Purnhagen This contribution corrects a restriction on how random access points interact with parapeter prediction in the combination of SBR and PS tools. It also corrects a disagreement between the PS specification and the PS reference software, in which the reference software must be corrected. In addition, new conformance bitstreams will be generated that removed the “bug” situation such that the old and new decoder both produces the current reference waveforms It was the consensus of the Audio Subgroup to issue the.DCOR on conformance at this meeting and a WD on an AAC-ELD Reference Software. Pierfrancesco Bellini, University of Florence, presented 148 m15078 Editors Study on ISO/IEC 14496- Pierfrancesco Bellini 4:2004/FPDAM 29, SMR Conformance Paolo Nesi Giorgio Zoia Maurizio Capanai This study presents corrections to the FPDAM text as requested by National Bodies. Ralph Sperschneider, FhG, presented m15180 WD on Audio part of MPEG-4 Conformance Manuela Schinn Ralph Sperschneider Since this is a large document, the presenter urged audio experts to review it as homework. Furthermore, he expects that this rollup of audio-related conformance may be complete at the April MPEG meeting. Takehiro Moriya, NTT, presented m15183 Proposed update of MPEG-4 ALS reference Noboru Harada software for OAFI Takehiro Moriya Yutaka Kamamoto This contribution proposes to add OAFI functionality to the ALS MPEG-4 Reference Software, and notes several bugs in the ALS reference software. It was the consensus of the Audio Subgroup to issue a DCOR on MPEG-4 Reference Software that will include the bugfixes and which will bring the Reference Software in line with the MPEG-4 Specification by incorporating the OAFI code. 4.2.2 MPEG-D MPS Andreas Schneider, Dolby, presented m15154 Update on MPEG Surround Conformance Andreas Schneider This update is summarizes as follows: Conformance text defines 32 sequences These 32 sequences are combined with AAC and HE-AAAC as core coders, giving a total of 64 sequences, of which 42 are available and 38 are cross-checked. It was the consensus of the Audio Subgroup to issue this work as FDAM with an editing period, but with only defining the 21 sequences that are available. 4.2.3 MPEG-D SAOC Yang-Won Jung, LG, presented m15111 A proposed CE on object parameter Yang-Won Jung estimation in SAOC Henney Oh The contribution proposes to modify OLD estimation such that transmition of DMG is not required, thus achieving bitrate savings. Oliver Hellmuth, FhG, noted that if DMG is not transmitted then it is not possible to recover the level (i.e. gain) of an object as input to the SAOC encoder. The Chair urged interested experts to discuss whether DMG in important for certain application scenarios and therefore should be available at the decoder. Jeongil Seo, ETRI, presented m15143 CE on efficient decoding of a controllable Jeongil Seo object and an MBO Seungkwon Beack Kwang-ki Kim Kyoungok Kang This contribution proposes an efficient decoding process for FGO or MBO solo application Leonid Terentiev, FhG, asked for more details on the complexity reduction, which were provided on an additional slide. Oliver Hellmuth, FhG, noted that in a real-world implementation the full machinery of MPEG Surround would not be invoked, and hence the complexity figures presented by ETRI are not realistic.
149 The Chair noted that the CE as proposed delivers only “fair” subjective results on the MUSHRA scale but at the same time appears to also deliver lower complexity. Howerver, compared the the Karaoke CE that was reviewed in the AhG meeting, the ETRI proposal appears to deliver lower quality and lower complexity, which is typically not the basis for accepting new technology. The Chair suggested that ETRI discuss these issues with the SAOC proponents and report back to the group. The task group produced a Workplan for progressing the CE work which had the consensus of all CE participants. 4.2.4 MPEG-D Unified Speech and Audio The task group continued the discussions of the AhG. The Chair proposed two additional pieces of text, one for selection of VC and the other for the Requirements. Subgroup experts gave valuable feedback to correct and clarify the mathematical expressions. The Chair identified remaining open issues with these two excerpts from the Evaluations Guidelines document, and the open issues will be discussed in break out groups. Later in the week, the Chair incorporated the new text into a revised version of the Evaluation Guidelines and added additional text for review. Identified open issues are: Evaluated operating points (i.e. test results) used in the Requirements calculations Evaluated operating points (i.e. test results) used in the Figure of Merit calculations The open issues in the Evaluation Guidelines document were discussed in the task group on Thursday, and several new versions of the document were produced. Friday the task group continued morning at 8AM. The group had the previous evening to review the last version of the Evaluation Guidelines document. The Chair presented that version of the document with the following additional changes and explained his motivation for the new text. Editorial changes e.g to correct nomenclature and clarify the text. Changes to the Requirements and Figure of Merit sections that o Remove the d statistic for a subset of the tests o Re-phrased the text so that the Requirements relate to the performance of the work item. The Chair noted that Samsung objects to the removal of the computation of the d statistic for a subset of the tests. Ericsson objects to the re-phrasing of the Requirements and Figure of Merit text. Considering how much time and effort was spend in the Audio Subgroup discussing this document, and how divergent were many of the expert positions, the Chair declared this document to represent the consensus of the Audio Subgroup.
5 Meeting deliverables 5.1 Responses to Liaison and NB comments The responses to Liaison and NB comments were prepared and approved. 5.2 Recommendations for final plenary The Audio recommendations were presented and approved.
5.3 Establishment of Ad-hoc Groups The following ad-hoc groups were established by the Audio subgroup: No. Title Mtg 9653 AHG on Audio Standards Maintenance No 9654 AHG on Unified Speech and Audio Coding and SAOC Yes
150 5.4 Approval of output documents All output documents, shown in D, were presented in Audio plenary and were approved.
5.5 Press statement There was no Audio contribution to the press statement.
6 Future activities 6.1 Schedule of future meetings Ad Hoc group meetings are indicated in Section 5.3. Unless otherwise indicated, Ad Hoc group meetings will be held at the location of the next MPEG meeting on the weekend preceding that meeting. 6.2 Agenda for next meeting The agenda for the next MPEG meeting is shown in E.
6.3 All other business There was none. 6.4 Closing of the meeting The 83rd Audio Subgroup meeting was adjourned Friday at 12:15, which the Chair noted has to be a record!
151 Annex A Participants
Countr First Name Last Name y Affiliation Pierfrancesco Bellini Italy DSI-UNIFI Johannes Boehm DE Thomson Ti Eu Chan SG I2R Ralf Geiger DE Fraunhofer IIS Bernhard Grill DE Fraunhofer IIS Oliver Hellmuth DE Fraunhofer IIS Yang-Won Jung KR LG Electronics Dong Soo Kim KR LG Electronics Junghoe Kim KR Samsung AIT Mi Young Kim KR Samsung Kristofer Kjörling SE Dolby Terentiev Leonid DE Fraunhofer IIS Tilman Liebchen DE LG Electronics Takehiro Moriya JP NTT Markus Multrus DE Fraunhofer IIS Toshiyuki Nomura JP NEC Takeshi Norimatsu JP Panasonic Henney Oh KR LG Electronics Philips Applied Werner Oomen NL Technologies France Pierrick Philippe FR Telecom R&D Heiko Purnhagen SE Dolby Schuyler Quackenbush USA ARL Coding Andreas Schneider DE Technologies Markus Schnell DE Fraunhofer IIS Jeongil Seo KR ETRI Osamu Shimada JP NEC Ralph Sperschneider DE Fraunhofer IIS Akihiko Sugiyama JP NEC Anisse Taleb SE Ericsson AB Yasuhiro Toguri JP Sony France David Virette FR Telecom R&D Sungyong Yoon KR LG Electronics
152 Annex B Audio Contributions and Schedule
Day / Time Task Group X Sunday 1000-1300 AhG: SAOC m15123 Information and Verification Results for CE on Karaoke/solo Oliver Hellmuth X System Improving Performance of MPEG SAOC RM0 Johannes Hilpert Andreas Hölzer Leonid Terentiev Cornelia Falch m15162 Cross Verification of SAOC CE on Karaoke enhancement Jonas Engdegard X m15144 Consideration on enhanced Karaoke processing for stereo Jeongil Seo X FGO Seungkwon Beack Kwang-ki Kim Kyeoungok Kang m15112 Comments on SAOC applications and architectures Henney Oh X Yang-Won Jung m15110 A core experiment proposal for an additional SAOC Osamu Shimada X functionality of separating real-environment signals into Toshiyuki Nomura multiple objects Akihiko Sugiyama Osamu Hoshuyama 1300-1400 Lunch 1400-1800 AhG: Unified Speech and Audio Coding m15158 Homework according to the joint speech and audio workplan Kristofer Kjörling X Heiko Purnhagen m15095 Collected Set of Possible Evaluation Guidelines S. Quackenbush X m15155 Evaluation criteria and test items for unified speech and audio Werner Oomen X coding Erik Schuijers m15160 Thoughts on evaluation criteria for joint speech and audio Kristofer Kjörling X workitem Heiko Purnhagen m15145 Thoughts on Speech and Audio Evaluation Guidelines Oliver Wuebbolt X Johannes Boehm m15118 Comments on Unified Speech and Audio CfP Evaluation Miyoung Kim X Guidelines Eunmi Oh JungHoe Kim m15165 Comments on Speech and Audio Evaluation Guidelines Ralf Geiger X Markus Multrus Bernhard Grill m15096 Draft Workplan for Testing of SA Proposals S. Quackenbush X Discussion Recommendations and review of AhG Report 1800- Chairs Meeting
153 Monday 0900-1130 MPEG Plenary 1200-1300 Audio Plenary Welcome and comments m15094 82nd MPEG Audio Report S. Quackenbush X m15043 Ad Hoc Group on Audio Standards Maintenance R. Sperschneider X m15044 Ad Hoc Group on Unified Speech and Audio Coding and S. Quackenbush X SAOC Eunmi Oh NB Comments Laison On seamless X M15072 Liaison Statement from ITU-T SG 9 [SC 29 N 9004] bitstream splicing.
Generate a Liaison statement to ETSI TC DECT to say that X NXXXX - AAC-ELD specification is attached - Verification performance data available in April 1300-1400 Lunch 1400-1730 MPEG-4 m15151 Update on AAC-ELD Verification Test Markus Schnell X Ralf Geiger m15121 Update of ALS Conformance Tilman Liebchen X m15161 Proposed correction to PS conformance and reference Andreas Schneider X software Heiko Purnhagen m15078 Editors Study on ISO/IEC 14496-4:2004/FPDAM 29, SMR Pierfrancesco Bellini X Conformance Paolo Nesi Giorgio Zoia Maurizio Capanai m15180 WD on Audio part of MPEG-4 Conformance Manuela Schinn X Ralph Sperschneider m15183 Proposed update of MPEG-4 ALS reference software for Noboru Harada X OAFI Takehiro Moriya Yutaka Kamamoto 1730-1800 MPEG Surround m15154 Update on MPEG Surround Conformance Andreas Schneider X
Tuesday 0900-1000 SAOC m15111 A proposed CE on object parameter estimation in SAOC Yang-Won Jung X Henney Oh m15143 CE on efficient decoding of a controllable object and an MBO Jeongil Seo X Seungkwon Beack Kwang-ki Kim Kyoungok Kang
154 1000-1300 Unified Speech and Audio Coding Task group activities Improve nomenclature of VC definition Discuss Requirements 1300-1400 Lunch 1400-1800 Task Group Activities 1900- Chairs
Wednesday 0900-1100 MPEG Plenary 1130-1300 Task Group Activities Report on Tuesday’s Chairs meeting S+A show of hands on FoM operating point statistics discussion on Requirements tests 1300-1400 Lunch 1400-1800 Task Group Activities AAC-ELD Verification Test Construction of test items from “toolbox” S+A Evaluation Workplan for test item selection Workplan for S+A Evaluation Test 1800-2030 Social
Thursday 0900-1300 Task Group Activities S+A Evaluation Workplan for test item selection Workplan for S+A Evaluation Test 1300-1400 Lunch 1400-1800 Task Group Activities M15072 Liaison Statement from ITU-T SG 9 [SC 29 N 9004]
Review Liaison statements 1800- Chairs Meeting
Friday 0800-0900 Unified Speech and Audio Coding Evaluation Guidelines 0900-1300 Audio plenary Recommendations for final plenary X Establishment of new Ad-hoc groups X
155 AhG Mandates X Get document numbers X 1000 Approve Responses to NB comments X 1030 Approval of output documents X Review of Audio presentation to MPEG plenary X Agenda for next meeting X A.O.B. X Closing of the Audio meeting X 1300-1400 Lunch 1400- MPEG Plenary
156 Annex C Task Groups
1. MPEG-D Unified Speech and Audio Coding 2. MPEG-D SAOC 3. MPEG-D MPS 4. MPEG-4 audio, conformance, reference software
157 Annex D Output Documents No. Title TBP Available 14496-3 Audio 9619 Workplan for AAC-ELD Verification Test No 08/01/18 No. Title TBP Available 14496-4 Conformance testing 9620 DoC on ISO/IEC 14496-4:2004/FPDAM 20, SLS Conformance No 08/01/18 9621 ISO/IEC 14496-4:2004/FDAM 20, SLS Conformance No 08/03/14 9622 ISO/IEC 14496-4:2004/AMD 11/DCOR 3, Parametric Stereo No 08/01/18 9623 ISO/IEC 14496-4:2004/AMD 19/DCOR 1, ALS No 08/01/25 ISO/IEC 14496-4:2004/AMD XX, WD on AAC-ELD, OAFI and No 08/01/18 9624 additional AAC Conformance 9625 DoC on ISO/IEC 14496-4:2004/FPDAM 29, SMR Conformance No 08/01/18 9626 ISO/IEC 14496-4:2004/FDAM 29, SMR Conformance No 08/01/18 9627 MPEG-4 Audio Conformance Rollup No 08/01/18 No. Title TBP Available 14496-5 Reference Software 9628 ISO/IEC 14496-5:2001/AMD 10/DCOR 2, ALS No 08/01/25 9629 ISO/IEC 14496-5:2001/AMD XX, WD on AAC-ELD Reference Sw. No 08/01/18 Study on ISO/IEC 14496-5:2001/FPDAM 20, Reference Software No 08/01/18 9630 for MPEG-1/2 Audio in MPEG-4 and BSAC Extensions No. Title TBP Available 23003-1 MPEG Surround DoC on ISO/IEC 23003-1:2006/FPDAM 1, MPEG Surround No 08/01/18 9631 Conformance 9632 ISO/IEC 23003-1:2006/FDAM 1, MPEG Surround Conformance No 08/03/14 9633 Workplan on MPEG Surround Conformance No 08/01/18 DoC on ISO/IEC 23003-1:2006/FPDAM 2, MPEG Surround No 08/01/18 9634 Reference Sw. 9635 ISO/IEC 23003-1:2006/FDAM 2, MPEG Surround Reference Sw. No 08/03/14 No. Title TBP Available 23003-2 SAOC 9636 Status and Workplan on SAOC Core Experiments No 08/01/18 9637 WD on SAOC Text and Reference Software No 08/02/15 No. Title TBP Available Exploration – Unified Speech and Audio coding 9638 Evaluation Guidelines for Unified Speech and Audio Proposals YES 08/01/18 9639 Workplan on Speech and Audio Material Selection No 08/01/18 Draft Workplan on Subjective Testing of Unified Speech and Audio No 08/01/18 9640 Coding Proposals No. Title TBP Available Liaison Statements 9641 Liaison Statement to ETSI TC DECT No 08/01/18 9660 Liaison Statement to ITU-T SG 16 No 08/01/18
158 Annex E Agenda for the 84th MPEG Audio Meeting
Agenda Item 1. Opening of the meeting 2. Administrative matters 2.1. Communications from the Chair 2.2. Approval of agenda and allocation of contributions 2.3. Review of task groups and mandates 2.4. Approval of previous meeting report 2.5. Review of AhG reports 2.6. Joint meetings 2.7. Received national body comments and liaison matters 3. Plenary issues 4. Task group activities 4.1. Spatial Audio Object Coding 4.2. Unified Speech and Audio Coding 4.3. MPEG Maintenance, including MPEG-1, MPEG-2, MPEG-4, SMR and MPEG Surround issues 5. Discussion of unallocated contributions 6. Meeting deliverables 6.1. Responses to Liaison and NB comments 6.2. Recommendations for final plenary 6.3. Establishment of new Ad-hoc groups 6.4. Approval of output documents 6.5. Press statement 7. Future activities 8. Agenda for next meeting 9. A.O.B 10. Closing of the meeting
159 Annex J – 3DG report
Source: Marius Preda, Chair
1 Opening of the meeting
1.1 Approval of the agenda The agenda is approved.
1.2 Goals for the week The goals of this week are: Review FAMC results and edit the FPDAM Review 3DGCM related contributions and edit the Study of CD Review on-going AFX experiments Promote the 3DGC profiles Review contributions on reference software and edit the related output documents Review contributions on conformance and edit the related output documents Review Liaisons to MPEG 3DG Review and promote 3DG related demonstrations Investigate future developments of MPEG 3D Graphics
1.3 Standards from 3DGC Std Pt Edit. Project Description CfP WD CD FCD FDIS PDAM FPDAM FDAM DCOR COR 4 4 2004 Amd.32 FAMC conformance 07/04 07/10 08/04 08/10 4 4 2004 Amd.33 Multiresolution profile 07/04 07/10 08/04 08/10 conformance 4 4 2004 Amd.34 3DGC Model Conf. 08/01 08/04 08/10 4 5 2001 Amd.21 FAMC reference 07/04 07/10 08/04 08/10 software 4 5 2001 Amd.22 3DGC Model RefSoft 06/07 08/01 08/04 08/10 4 16 2006 Amd.1/Cor.1 3DMC ext. corr. 07/10 08/04 4 16 2006 Amd.2 FAMC 07/07 08/01 08/07 4 16 2006 Amd.3 3D Multiresolution 07/07 08/04 07/10 profile 4 11 2006 Amd.xxx Scene partitioning 07/07 08/04 08/10 09/01 4 16 200x 3rd Ed. AFX 08/01 08/10 4 25 200x 3D Graphics 07/04 07/10 08/04 08/10 Compression model 4 16 Low complexity mesh compression
160 1.4 Room allocation 3DGC: Tombak
161 1.5 Allocation of contributions N° Title Schedule Activity D1 Monday D1 D1 MPEG MPEG Plenary 09:00~11:30 General D1 3DG Plenary 3DG General 12:30~13:00 Roll call, Agenda, Goals, FAQ, etc., Marius Preda Status of www.mpeg-3dgc.org/www.mpeg-3dgc.com Patrick Gioia Report of AHG on 3DGC documents, experiments and 15042 Jeong-Hwan Ahn, Nikolce Stefanoski software maintenance Dissemination Karsten Muller, Marius Preda RefSoft Policy Marius Preda D1 Lunch Break 13:00~14:00 D1 FAMC (AMD2) 14:00~15:30 FAMC decoder conformance Khaled Mamou, Titus Zaharia, Françoise m15149 Prêteux
m15150 FAMC integration into the MPEG-4 RefSoft Khaled Mamou, Titus Zaharia, Marius Preda, Françoise Prêteux
GNB comments on ISO/IEC 14496-16:2006/PDAM 2 Nikolce Stefanoski m15201 (FAMC) Jörn Ostermann Coffee Break 15:30~16:00 D1 Core Experiments (Low Complexity Mesh Encoding) CE 16:00~17:00 m15153 Low-complexity approach for static mesh compression Khaled Mamou, Titus Zaharia, Marius
162 N° Title Schedule Activity Preda Françoise Prêteux Joint D1 Scene partitioning Joint with Systems (in Systems) meeting 17:00~18:00 D2 Tuesday D2 D2 Core Experiments (Low Complexity Mesh Encoding) 09:00~09:30 xxx Open discussions all Joint Metaverse Joint with Req and Systems (in Systems) 09:30 – 10:00 meeting D2 Core Experiments (Low Complexity Mesh Encoding) 10:00~10:30 xxx Open discussions D2 3DGCM 10:30~11:00 m15085 Blagica Jovanova, Marius Preda, Francoise Software Implementation for P25 Preteux m15086 Blagica Jovanova, Marius Preda, Francoise 3DGC Conformance dataset for P25 Preteux Coffee Break 10:30~11:00 D1 Demo 11:00~11:30 Ivica Arsov, Marius Preda, Francoise m15087 3D graphics player for N93 and N95 Preteux D2 Profiles 11:30~12:00 Multi-Resolution Profile AMD 3 and Conformance Patrick Gioia D2 Lunch Break 12:00~14:00
163 N° Title Schedule Activity D2 Repository and benchmarking 14:00~15:30 Benoit Le Bonhomme m15084 Online platform for 3D graphics compression benchmarking Marius Preda Françoise Preteux Sikyung Kim m15219 Table of 3D models in the MPEG 3DGC repository Euee S. Jang Jeong-Hwan Ahn. KNB Comment on 14496-16:2006/AMD1.Corr1 (3D Mesh Coding m15198 Daiyong Kim. Extension Correction) Euee S. Jang Coffee Break 15:30~16:00 D2 16:00~16:30 Part 16 AMD2 FAMC Editing all FAMC RefSoft and Conformance Editing all D3 Wednesday D3 D3 MPEG MPEG Plenary 09:00~12:00 General D3 Lunch Break 12:00~14:00 D3 3DGC Plenary (Editing of documents) 14:00~18:00 DoC for FAMC all ISO/IEC 14496-16 2nd Ed. AMD1 Cor1 Editing all Coffee Break 15:30~16:00 RefSoftware report for AFX tools Francisco Moran, all Part 16 AMD3 Multiresolution Profile Editing all Multiresolution Profile Conformance all 3DGCM Editing all 3DGCM RefSoft Editing all 3DGCM Conformance Editing all D4 Thursday D4
164 N° Title Schedule Activity D3 3DGC Editing and other issues 09:30~12:00 Joint Scene Partitioning Systems (in Systems Room) 09:30 – 10:30 meeting GNB comment on FAMC Jorn Osterman Review of the CfP for Low complexity mesh encoding Françoise Preteux D3 Lunch Break 12:00~14:00 D3 3DGC Editing and other issues 14:00~18:00 Requirements for Low complexity 3D mesh all compression AFX 3rd Edition all Joint Scene Partitioning Systems (in Systems) 16:00 – 16:30 meeting AFX 3rd Edition all D5 Friday D5 D4 3DG output documents preparation 3DG General 09:00~12:00 AhGs and resolutions all D5 Lunch Break 12:00~14:00 MPEG MPEG Plenary D5 14:00~ General
165 1.6 Attendance list Name Country Company Marius Preda France IT Françoise Prêteux France IT Patrick Gioia France OrangeLabs Francisco Morán Burgos Spain UPM Karsten Muller Germany FHG-HHI Daiyong Kim Korea Hanyang Corey Manders Singapore IIR Farzam Farbiz Singapore IIR Inkwon Kim Korea Varovison Chan-Yang Kim Korea Varovison Dan Cernea Belgium VUB
2 General issues
2.1 General discussion
2.1.1 Reference Software It is recalled that the source code of both decoder AND encoder should be provided as part of the Reference Software for all technologies to be adopted in MPEG standards. Moreover, not providing the complete software for a published technology shall conduct to the removal of the corresponding technical specification from the standard.
2.1.2 Web site OrangeLabs proposed a new version of the web site, now available at www.mpeg-3dgc.com. The goal of the web site is to disseminate the group activities (documents, software and demonstration), to maintain the FAQ and to be active in providing answers through the use of the Forum. 3DGC contributors are kindly asked to check the web-site and provide comments.
3 AFX (14496-16) related activities
3.1 Experiments
3.1.1 CE1. Mesh Animation Compression Title Low-complexity approach for static mesh compression Khaled Mamou, Titus Zaharia, Marius Preda Authors Françoise Prêteux Summary - TFAN: encoding the connectivity based on triangle fan decomposition of a mesh - advantages: low complexity
166 - comparison of the performances with 3DMC, 6% better in compression performances and 50% in decoding time This technology together with the one presented during the 82nd Meeting shows evidences that mesh compression may be performed with lower complexity than Resolution current tools. A call for proposal will be issued this meeting. The requirements document will be updated to address low complexity.
3.1.2 Frame-based animation compression Title GNB comments on ISO/IEC 14496-16:2006/PDAM 2 (FAMC) Authors Nikolce Stefanoski, Jörn Ostermann The contribution addresses the problem of FAMC compression for temporal scalability. When using the delta prediction mode, it is possible than the variable Summary used for prediction belongs to a frame that is not decoded (due to sub sampling in time). The contribution proposes an identity prediction (as an alternative to existent delta prediction) Resolution: It was identified that prediction always takes place with respect to the static Resolution mesh. To clarify this aspect, the specification was updated with en explicative note.
Title FAMC decoder conformance Authors Khaled Mamou, Titus Zaharia, Françoise Prêteux The contribution proposes a set of MP4 formatted files showing FAMC Summary functionalities and describe the testing condition for them. Resolution Adopt the set of files for conformance
Title FAMC integration into the MPEG-4 RefSoft Authors Khaled Mamou, Titus Zaharia, Marius Preda, Françoise Prêteux The contribution presents the implementation of FAMC in IM1 indicating the Summary supported functionalities. A demonstration of the reference software was shown. Adopt the software provided as the RefSoft for FAMC. It is recommended to upload the software on the SVN when the latter will be ready. Resolution Since providing the encoder in source code is one of the conditions of accepting MPEG technologies, it is also requested to FAMC contributors to provide the FAMC encoder.
3.1.3 Scene partitioning SP will be followed as a joint activity between Systems and 3DGC. The technology will be integrated in Part 11.
3.2 Profiles
3.2.1 Proposal for 3D MultiResolution Profile Title Multi-Resolution Profile AMD 3 and Conformance Authors Patrick Gioia Summary The contribution shows the status of Conformance for AMD3. Some editorial
167 changes were performed. Some of the old bitstreams (3DMC and BBA) are not working in the last version of the IM1 player Once the ReferenceSoftware is available on the SVN, INT and Samsung will Resolution verify the broken bitstreams
3.3 Maintenance
3.3.1 3DMC Extension correction for support of multiple attribute per vertex KNB Comment on 14496-16:2006/AMD1.Corr1 (3D Mesh Coding Title Extension Correction) Authors Jeong-Hwan Ahn, Daiyong Kim, Euee S. Jang In order to preserve the backward compatibility, it is proposed to use an existing Summary variable (function_type) and extend its semantics. Resolution Accepted
3.3.2 AFX 3rd Edition The document was updated during the week. However, editing is not finished (an editing period of 2 weeks was accepted).
3.4 Dataset and benchmarking Title Online platform for 3D graphics compression benchmarking Authors Benoit Le Bonhomme, Marius Preda, Françoise Preteux The contribution introduces an online platform able to integrate encoder and decoder libraries for 3D graphics compression. The advantages of using it will Summary be the use of the same hardware for executing the programs, the access to a large database and the rapidness in obtaining the curves and other quantitative measures. Resolution Use the platform for benchmarking the tools submitted for standardization.
Title Table of 3D models in the MPEG 3DGC repository Authors Sikyung Kim, Euee S. Jang The contribution consists in a a table presenting the specificities of each file Summary (attributes per vertex, …) available in the MPEG database. Resolution Upload the table on the repository web site.
3.5 Software Title RefSoftware report for AFX tools Authors Francisco Morán Burgos Summary A document with the status of encoders for all the 3DGC bitstreams Resolution Include the document in the SVN repository, in the Reference Software section
3.6 Promotions Title 3D graphics player for N93 and N95 Authors Ivica Arsov, Marius Preda, Francoise Preteux Summary The contribution presents an implementation of the MPEG-4 3D Graphics
168 player for Symbian 9, able to decode and render static and animated objects Resolution -
3.7 Future
3.7.1 CfP Call for Proposal for Low Complexity 3D Mesh Compression. More information are provided in the output document w9651.
3.7.2 Metaverse A presentation was done in Joint meeting with Systems and Requirements. No resolution yet.
4 3D Graphics Compression Model (14496-25) activities
4.1 Textual specification The text was reviewed and a study was issued as the output document.
4.2 Software and conformance Title Software Implementation for P25 Authors Blagica Jovanova, Marius Preda, Francoise Preteux Contribution on an implementation of the P25 for COLLADA, including the Summary encoder (parser, 3DMCE, BBA, JP2K and GZIP encoders, multiplexer) and the decoder. Resolution Accept the software as the reference software for P25. To upload it on the SVN.
Title 3DGC Conformance dataset for P25 Authors Blagica Jovanova, Marius Preda, Francoise Preteux Contribution on a set of files implementing several functionalities (geometry, Summary texture and animation) Accept the files as Conformance Test for P25. To upload on the Conformance Resolution directory of the SVN.
5 Output documents and Resolutions of 3DGC
5.1 Part 4 Conformance testing
5.1.1 The 3DGC subgroup recommends approval of the following documents No. Title TBP Available 14496-4 Conformance testing Study on PDAM of ISO/IEC 14496-4:2004 AMD32 (FAMC No 08/01/18 9642 Conformance)
169 Study on PDAM of ISO/IEC 14496-4:2004 AMD33 No 08/01/18 9643 (MultiResolution Profile Conformance) 9644 ISO/IEC 14496-4:2004 PDAM 34 (3DGCM Conformance) No 08/01/18
5.1.2 The 3DGC subgroup recommends nominating Mark Callow (HI Corporation) as project editor for ISO/IEC 14496-4:2004/Amd.16.
5.2 Part 5 Reference Software
5.2.1 The 3DGC subgroup recommends approval of the following documents No. Title TBP Available 14496-5 Reference Software 9645 ISO/IEC 14496-5 PDAM 22 (3DGCM RefSoft) No 08/01/18
5.3 Part 16 Animation Framework eXtension (AFX)
5.3.1 The 3DGC subgroup recommends approval of the following documents No. Title TBP Available 14496-16 Animation Framework eXtension (AFX) 9646 Study of ISO/IEC 14496-16:2006/AMD1/DCOR1 No 08/01/18 DoC on ISO/IEC 14496-16:2006/PDAM2 (Frame-based Animated No 08/01/18 9647 Mesh Compression) Text of ISO/IEC 14496-16:2006/FPDAM2 (Frame-based Animated Yes 08/01/18 9648 Mesh Compression) 9649 WD2.0 of AFX 3rd Edition No 08/02/01 9650 Requirements for low-complexity 3D mesh compression Yes 08/01/18 9651 CfP for low-complexity 3D mesh compression Yes 08/01/18
5.3.2 The 3DGC subgroup thanks FNB and GNB for their comments on ISO/IEC 14496- 16:2006/Amd.2.
5.3.3 The 3DGC subgroup recommends the publication of all Scene Partitioning related technologies in Part 11 of ISO/IEC 14496 and its removal from Part 16, hence conducting the removal of Part 16 Amd.4.
5.3.4 The 3DGC subgroup thanks Samsung AIT for the creation and maintenance of the first version of the MPEG-3DGC web site and also thanks Orange Labs for taking over this project.
5.4 Part 25 3D Graphics Compression Model
5.4.1 The 3DGC subgroup recommends approval of the following documents No. Title TBP Available 14496-25 3D Graphics Compression Model 9652 Study of CD of ISO/IEC 14496-25 No 08/01/18
170 5.5 Establishment of 3DGC Ad-Hoc Groups N9661 AHG on 3DGC documents and software maintenance Mandate: 1. Coordinate 3DGC related conformance and reference software 2. Maintain and edit 3DGC documents 3. Coordinate editing of the www.mpeg-3dgc.com web-site Chairmen: Patrick Gioia Francisco Morán Burgos Duration: Until 84th Meeting Meetings Sunday before 84th meeting Reflector: mpeg-3dgc AT gti. ssr. upm. es Subscribe: http://www.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc
6 Closing of the Meeting See you in Archamps.
171