<<

3

Table of Contents

Guide Map...... 5 Schedule at a Glance...... 9 Monday, July 8, 2019...... 9 Tuesday, July 9, 2019...... 10 Wednesday, July 10, 2019...... 11 Thursday, July 11, 2019 ...... 12 Friday, July 12, 2019...... 13 Welcome Message from the General Chairs...... 14 Welcome Message from the Technical Program Committee Chairs...... 16 Organizing Committee...... 17 Keynote...... 24 K-01: Neural Circuit Plasticity: From Brain Research to Machine Learning and Back...... 24 K-02: AI Ethics: From Principles to Practices...... 25 K-03: Multimedia Driven Precise Medicine...... 26 Academic Panel...... 27 Towards an Excellent Academic Career ...... 27 Industry Panel...... 29 From Papers to Products: Bridging the Gap between Multimedia Research and Practical Applications...... 29 Multimedia Rising Star Panel...... 32 Multimedia Star Innovators...... 34 Multimedia Star Innovator Keynote Highlights...... 34 Multimedia Star Innovator Keynotes...... 36 Grand Challenges...... 37 Grand Challenge Highlights...... 37 G-01: Short Video Understanding Challenge -- Recommending All You Want to See...... 38 G-02: Grand Challenges of 106-p Facial Landmark Localization...... 40 G-03: Learning-Based Image Inpainting...... 42 G-04: Saliency4ASD: Visual attention modeling for Autism Spectrum Disorder...... 44 Tutorials...... 46 T-01: Big Data Intelligence: From Correlation Discovery to Casual Reasoning ...... 46 T-02: Human Behavior Understanding: From Human-Oriented Analysis to Action Recognition...... 47 T-03: Intelligent Image Enhancement and Restoration - From Prior Driven Model to Advanced Deep Learning ...... 48 T-04: Visual Search and Question Answering ...... 50 T-05: Object Detection Beyond Mask R-CNN and RetinaNet ...... 52

1 IEEE ICME2019

T-06: Computer Vision for Transportation...... 53 T-07: Causally Regularized Machine Learning ...... 55 T-08: Architecture Design for Deep Neural Networks ...... 57 T-09: Intelligent Multimedia Recommendation ...... 59 Oral Sessions...... 61 Best Paper Session...... 61 O-01: Content Recommendation and Cross-modal Hashing...... 62 O-02: Development of Multimedia Standards and Related Research...... 63 O-03: Classification and Low Shot Learning...... 64 O-04: 3D Media Computing...... 65 O-05: Special Session "Pedestrian Detection, Tracking and Re-identification in Videos"...... 66

O-06: Special Session "Multimedia Technologies Empowering Retail Experiences"...... 67 O-07: 3D and Low Level Vision...... 68 O-08: Object Detection I...... 69 O-09: Emerging Applications of Deep Learning...... 70 O-10: Multimedia Quality Assessment and Enhancement...... 71 O-11: Multimedia for Society and Health...... 72 O-12: Immersive Media...... 73 O-13: 3D and Stereo Computing ...... 74 O-14: Machine Learning Applications in Image and Video Coding I...... 75 O-15: Vision, Language and Text Processing...... 76 O-16: Media Classification and Segmentation II...... 77 O-17: AI for Human Understanding ...... 78 O-18: Image Quality Metrics ...... 79 O-19: Multimedia Recommendations ...... 80 O-20: Search and Retrieval...... 81 O-21: Media Understanding ...... 82 O-22: Super-resolution and Enhancement...... 83 O-23: Pose and Action Recognition II...... 84 O-24: Image and Video Enhancements I ...... 85 O-25: Face and Person Analysis ...... 86 O-26: Media Classification and Segmentation III...... 87 O-27: Image and Video Enhancements II ...... 88 O-28: Multimedia Learning and Adaptation...... 89 O-29: Person (Re-)Identification and People Detection...... 90

O-30: Multimedia and Language II...... 91 O-31: Multimedia Communications and Localization ...... 92 O-32: Multimedia Security, Privacy and Forensics II...... 93

2 O-33: Multimedia Sensing and Signal Processing ...... 94 O-34: Detection and Recognition ...... 95 O-35: Multi-modal Media Computing and Human-machine Interaction...... 96 Industry Track...... 97 Poster Sessions...... 98 Poster Session 1 & TMM Poster...... 98 P-01: Emerging Multimedia Applications and Technologies...... 98 P-02: Media Classification and Segmentation I...... 100 P-03: Oral-05 to Oral-12 ...... 102 TMM Poster...... 106 Poster Session 2...... 107

P-04: Multimedia Analysis, Search and Recommendation...... 107 P-05: Pose and Action Recognition I...... 109 P-06: Person and Emotion Understanding...... 111 P-07: Best Papers and Oral-01 to Oral-04...... 113 Poster Session 3 & Demo Session 1...... 116 P-08: Multimedia Creation and Enhancement...... 116 P-09: Multimedia and Vision I...... 118 P-10: Oral-17 to Oral-24...... 120 Demo Session 1...... 124 Poster Session 4 & Demo Session 2...... 125 P-11: Multimedia and Language I...... 125 P-12: Advances in Artificial Intelligence...... 127 P-13: Multimedia Security, Privacy and Forensics I...... 129 P-14: Machine Learning Applications in Image and Video Coding II...... 131 P-15: Multimedia and Vision II...... 133 P-16: Oral-13 to Oral-16...... 135 Demo Session 2...... 138 Poster Session 5 & Grand Challenge...... 139 P-17: Multimedia Understanding and Mixed Reality ...... 139 P-18: Media Classification and Segmentation IV ...... 141 P-19: Oral-29 to Oral-35...... 143 Grand Challenge ...... 147 Poster Session 6...... 149 P-20: Multimedia Communications, Networking and Mobility ...... 149

P-21: Object Detection II...... 151 P-22: Artificial Intelligence for Multimedia...... 152 P-23: Multimedia Quality Assessment and Metrics ...... 154

3 IEEE ICME2019

P-24: Oral-25 to Oral-28...... 155 Workshops...... 158 W-01: Multimedia Services and Technologies for Smart-health(MUST-SH)...... 158 W-02: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia (MMArt-ACM)...... 160 W-03: Visual Emotion Analysis: Theories and Applications...... 162 W-04: 1st International Workshop on Big Surveillance Data Analysis and Processing...... 163 W-05: Multimedia for Robot, Unmanned Aerial Vehicle and Driverless Car...... 165 W-06: Information Theory and Multimedia Computing (ITMC)...... 167 W-07: 6th IEEE International Workshop on Mobile Multimedia Computing (MMC)...... 170 W-08: Time-sequenced Multimedia Computing...... 172

W-09: Smart Camera Gigavision ...... 174 ( ) W-10: Cross-media Big Data Analysis for Semantic Knowledge Understanding...... 176 W-11: AI Technology for Visual Fashion Computing...... 178 W-12: 2nd IEEE International Workshop on Faces in Multimedia(FacesMM)...... 179 W-13: The Third Workshop on Human Identification in Multimedia (HIM)...... 180 Student Program...... 182 Student Career Lunch...... 182 3MT Competition ...... 182 Social Events...... 184 ICME 2019 Reception...... 184 ICME 2019 Student Career Dinner...... 184 ICME 2019 Banquet...... 184 Side Meetings...... 185 Area Chairs...... 186 Technical Program Committee Members...... 189 Sponsors...... 195 Organizational Sponsors...... 196 Whova Event App User Tutorial...... 197

4 Guide Map

5 IEEE ICME2019

6 7 IEEE ICME2019

8 Schedule at a Glance

T: Tutorial W: Workshop K: Keynote O: Oral P: Poster G: Grand Challenges

Posters of papers presented in the oral sessions of the regular program will be presented on the same day in one of the poster sessions

Monday, July 8, 2019

3E 3G 5A 5F 5H 5I 5J

8:30 T-03: Intelligent W-02: International Image Joint Workshop W-04: 1st W-06: T-01: Big Data W-01: Multimedia Enhancement T-05: Object on Multimedia International Information Intelligence - Services and and Restoration Detection Beyond Artworks Analysis Workshop on Theory and From Correlation Technologies - from Prior Mask R-CNN and and Attractiveness Big Surveillance Multimedia Discovery to Casual for Smart-health Driven Model to RetinaNet Computing in Data Analysis and Computing Reasoning (MUST-SH) Advanced Deep Multimedia Processing (ITMC) Learning (MMArt-ACM)

10:00 Coffee break - Meeting room Foyer (5F)

10:30 T-03: Intelligent W-02: International Image Joint Workshop W-04: 1st W-06: T-01: Big Data W-01: Multimedia Enhancement T-05: Object on Multimedia International Information Intelligence - Services and and Restoration Detection Beyond Artworks Analysis Workshop on Theory and From Correlation Technologies - from Prior Mask R-CNN and and Attractiveness Big Surveillance Multimedia Discovery to Casual for Smart-health Driven Model to RetinaNet Computing in Data Analysis and Computing Reasoning (MUST-SH) Advanced Deep Multimedia Processing (ITMC) Learning (MMArt-ACM)

12:00 Lunch Time

13:30 T-02: Human W-05: W-06: Behavior W-01: Multimedia Multimedia T-04: Visual W-03: Visual Information Understanding: T-06: Computer Services and for Robot, Search and Emotion Analysis: Theory and From Human- Vision for Technologies Unmanned Question Theories and Multimedia Oriented Analysis Transporation for Smart-health Aerial Vehicle Answering Applications Computing to Action (MUST-SH) and Driverless (ITMC) Recognition Car

15:00 Coffee break - Meeting room Foyer (5F)

15:30 T-02: Human W-05: W-06: Behavior W-01: Multimedia Multimedia T-04: Visual W-03: Visual Information Understanding: T-06: Computer Services and for Robot, Search and Emotion Analysis: Theory and From Human- Vision for Technologies for Unmanned Question Theories and Multimedia Oriented Analysis Transporation Smart -health Aerial Vehicle Answering Applications Computing to Action (MUST-SH) and Driverless (ITMC) Recognition Car

17:00

18:00 Welcome Reception - Pearl Hall (7F)

21:00 End of day

9 IEEE ICME2019

Tuesday, July 9, 2019 I & 3rd Floor TMM Poster Poster Session 2: Poster Session 1: Understanding & Segmentation I & and Technologies & and Technologies Recommendation & P-03: O-05 to O-12; P-06: Person and Emotion P-02: Media Classification and P-04: MM Analysis, Search and P-04: MM P-01: Emerging MM Applications MM P-01: Emerging P-05: Pose and Action Recognition P-05: Pose and P-07: Best Papers and O-01 to O-04 (7F) Buffet Grand Lunch Ballroom 5DE O-12: Media Media O-04: 3D Immersive Detection I Computing O-08: Object 5BC O-11: O-11: O-03: Vision Learning and Health Low Level for Society Multimedia Classification O-07: 3D and and Low Shot 3HI O-10: O-02: Retail Quality Session Research Standards Multimedia and Related "Multimedia Empowering Experiences" Technologies Technologies Development Enhancement O-06: Special of Multimedia Assessment and End of day 3CD Learning O-01: Content O-09: Emerging O-09: Emerging Tracking and Re- Tracking Applications of Deep Cross-modal Hashing Recommendation and O-05: Special Session "Pedestrian Detection, Coffee break - Meeting room Foyer (3F) Coffee break - Meeting room Foyer (3F) identification in Videos" identification in Break Break 5H TC meeting 3(ICME SC) 5F TC meeting 2(MMSP TC) 2(MMSP 5J TC meeting 1(TMM SC) Openning Auditorium 3F Best Paper Session From Brain Research to Excellent Academic Excellent Career Machine Learning and Back Academic Panel: Towards an Towards Academic Panel: K-01: Neural Circuit Plasticity: 16:30 16:45 17:00 17:45 15:00 15:30 8:15 8:30 9:30 10:00 11:00 11:15 12:30 13:30 14:00

10 Wednesday, July 10, 2019 & & & II & 3rd Floor Coding II & Language I & P-11: MM and P-11: Demo Session 1 Demo Session 2 Poster Session 3: Poster Session 4: in Image and Video Video in Image and P-08: MM Creation P-13: MM Security, P-13: MM Security, P-10: O-17 to O-24; P-16: O-13 to O-16; and Enhancement & P-12: Advances in AI P-15: MM and Vision Vision P-15: MM and P-14: Applications ML P-09: MM and Vision I Vision P-09: MM and Privacy and Forensics I (7F) Buffet Lunch Grand Ballroom 3B Keynotes Multimedia Star Innovator Industry Track 5DE Retrieval and Video and Video O-24: Image O-16: Media Enhancements I Segmentation II O-20: Search and Classification and 5BC Coffee break - Meeting room Foyer (3F) Processing O-15: Vision, O-15: Vision, Recognition II O-19: Multimedia Recommendations Language and Text Language and Text O-23: Pose and Action Pose and O-23: 3HI Coding I Learning O-18: Image O-22: Super- Enhancement resolution and O-14: Machine Applications in Quality Metrics Image and Video Image and Video Break End of day 3CD O-17: AI for Human O-21: Media O-13: 3D and Understanding Understanding Coffee break - Meeting room Foyer (3F) Stereo Computing Break 5I 3MT 3MT Student Student Student Program: Program: Competition Career Lunch Gala Banquet & Student Career Dinner - Grand Ballroom (7F) 6 5H (TCMC) TC meeting 5 5F MMTC) (ComSoc TC meeting 4 5J (TMM EB) TC meeting Practices Highlights Applications Auditorium 3F Industry Panel: From Papers to Multimedia Research and Practical Multimedia Star Innovator Keynote K-02: AI Ethics: From Principles To To AI Ethics: From Principles K-02: Products: Bridging the Gap between 15:00 15:30 17:00 17:45 8:30 9:30 10:00 11:00 11:15 12:30 13:30 14:00 16:30 16:45 17:30 18:00 21:00

11 IEEE ICME2019

Thursday, July 11, 2019 3rd Floor Mixed Reality & Grand Challenge Poster Session 5: Poster Session 6: P-23: MM Quality P-24: O-25 to O-28 P-22: AI for MM & P-22: P-19: O-29 to O-35; and Segmentation IV & and Segmentation IV P-18: Media Classification Assessment and Metrics & P-21: Object Detection II & Networking and Mobility & P-20: MM Communications, P-17: MM Understanding and (7F) Buffet Grand Lunch Ballroom 3B See G-04: Video Video Disorder Spectrum Modeling Landmark for Autism G-01: Short Challenge -- Localization 106-p Facial G-02: Grand Challenges of Understanding Saliency4ASD: Recommending All You Want to Want All You Visual Attention Visual 5DE O-28: O-32: machine Security, Security, Interaction Adaptation Computing Multimedia Multimedia Privacy and Forensics II and Human- O-35: Multi- Learning and modal Media 5BC and Video and Video O-27: Image O-34: Detection and Recognition and Localization Enhancements II Communications O-31: Multimedia Break End of day II III and 3HI O-30: O-33: Sensing and Signal Processing Multimedia Multimedia O-26: Media Classification Segmentation and Language Break Coffee break - Meeting room Foyer (3F) Coffee break - Meeting room Foyer (3F) 3CD O-29: G-03: Analysis Detection Learning- Inpainting and People and Person O-25: Face Person (Re-) Based Image Identification 5H TC meeting 9 (MSA TC) (MSA 5F TC EB) (IEEE meeting 8 MM-MAG 7 5J OC) (ICME 2019/2020 TC meeting Panel Highlights Auditorium 3F Grand Challenge Precise Medicine Multimedia Rising Star K-03: Multimedia Driven 8:30 9:30 10:00 11:00 11:15 12:30 13:30 14:00 15:00 15:30 16:30 16:45 17:00 17:45

12 Friday, July 12, 2019 5J Multimedia (HIM) Multimedia (HIM) Workshop on Faces in Workshop Multimedia (FacesMM) W-11: AI Technology AI for Technology W-11: Visual Fashion Computing Visual on Human Identification in on Human Identification in W-13: The Third Workshop The Third Workshop W-13: The Third Workshop W-13: W-12: 2nd IEEE International W-12: 5I (Gigavision) (Gigavision) W-09: Smart camera W-09: Smart camera W-09: W-10: Cross-media Big W-10: Cross-media Big W-10: Knowledge Understanding Knowledge Understanding Data Analysis for Semantic Data Analysis for Semantic Data 5F Lunch (MMC) (MMC) End of Conference Workshop on Mobile Workshop on Mobile Workshop W-08: Time-sequenced Time-sequenced W-08: Time-sequenced W-08: Multimedia Computing Multimedia Computing Multimedia Computing Multimedia Computing W-07: 6th IEEE International W-07: 6th IEEE International W-07: Coffee break - Meeting room Foyer (5F) Coffee break - Meeting room Foyer (5F) 5DE Recommendation Recommendation Deep Neural Networks Deep Neural Networks T-09: Intelligent Multimedia T-09: Intelligent Multimedia T-09: T-08: Architecture Design for T-08: Architecture Design for T-08: 5BC Machine Learning Machine Learning T-07: Causally Regularized T-07: Causally Regularized T-07: 8:30 10:00 12:00 15:00 18:00 10:30 13:30 15:30

13 IEEE ICME2019

Welcome Message from the General Chairs

On behalf of the Organizing Committee, it is our great pleasure to welcome you to the 2019 IEEE International Conference on Multimedia and Expo (ICME 2019) and the beautiful city of Shanghai. Shanghai is the financial center of China and a popular tourist destination renowned for its historical landmarks such as The Bund, City God Temple and Yu Garden, as well as the extensive and growing Lujiazui skyline. It has been a real honor and privilege to serve as the General Chairs of this conference. Since 2000, ICME has been the flagship multimedia conference sponsored by four IEEE societies: Circuits and Systems, Communications, Computer, and Signal Processing. It serves as a premier forum to promote the exchange of the latest advances in multimedia technologies, systems, and applications from both the research and development perspectives of the four research communities.

ICME 2019 will enable you to enjoy an outstanding program, exchange your ideas with leading researchers in various disciplines of multimedia, and make new friends in the international science community. Some highlights include three Keynote talks on the latest exciting topics of multimedia, ranging from the fundamental topic of human brain analysis to the fast-growing artificial intelligence (AI) applications; a wide range of tutorials and workshops; the best paper session; aca- demic and industry panel discussions; the newly established multimedia star innovator award with its highlight presentations and keynotes; four grand challenges with over 1,000 participants; industrial programs with very exciting demonstrations and talks; a student-focused program, and other exciting technical and social events. The Technical Program Chairs, Marta Mrak (BBC R&D, UK), Jun Wu (Tongji University, China), Zhu Li (University of Missouri, Kansas City, USA) representing the IEEE Signal Processing Society Multimedia Signal Processing Technical Committee (MMSP), Honggang Wang (University of Massachusetts Dartmouth, USA) representing the IEEE Communications Society Multimedia Communications Technical Committee (MMTC), Lei Zhang (Microsoft, USA) representing the IEEE Circuits and Systems Society Multimedia Systems & Applications Technical Committee (MSA), and Roger Zimmermann (National University of , Singapore) repre- senting the IEEE Computer Society Technical Committee on Multimedia Computing (TCMC), put tremendous effort into the creation of an exciting program which is composed of one third of the around 1,000 submitted papers.

Many individuals and organizations contributed to the success of this conference. We would like to acknowledge the efforts of the Panel Session Chairs, Chang Wen Chen (CUHK-SZ, China / SUNY-Buffalo, USA), Chia-Wen Lin (National Tsing Hua University, Taiwan) and Fernando Pereira (Instituto Superior Técnico, Portugal); the Workshop Chairs, Susanne Boll (University of Oldenburg, Germany), Jingdong Wang (Microsoft Research Asia, China) and Z. Jane Wang (University of British Columbia, Canada); the Tutorial Chairs, Jiebo Luo (University of Rochester, USA) and Zheng-Jun Zha (University of Science and Technology of China, China); the Special Session Chairs, Junwei Han (Northwestern Polytechnical University, China) and Enrico Magli (Politecnico di Torino, Italy); the Grand Challenge Chairs, Gene Cheung (York University, Canada) and Jiaying Liu (Peking University, China); the Award Chairs, Mei-Ling Shyu (University of Miami, USA) and Yonggang Wen (Nanyang Technological University, Singapore); the Industrial Program Chairs, Liang Lin (Sun Yat-Sen University, China), Chonggang Wang (InterDigital, USA) and Xiaoqing Zhu (Cisco, USA); the Student Program Chairs, Weiyao Lin (Shanghai Jiao Tong University, China), Xiaoyan Sun (Microsoft Research Asia, China) and Shaoen Wu (Ball State Universi- ty, USA); the Demo Chairs, Yu-Gang Jiang (Fudan University, China), Cong Shen (University of Science and Technology of China, China) and Dong Tian (InterDigital, USA); the Web Chairs, Wu Liu (JD AI Research, China) and Dalei Wu (University of Tennessee, USA). Together with the Technical Program Committee, they worked diligently to select papers and speakers that met the criteria of high quality and relevance to various fields within the scope of IEEE ICME. It takes time and effort to review a paper carefully, and every member of the Technical Program Committee is to be commended for his or her contri- bution to the success of this conference. The TPC chairs selected six papers as candidates for the Best Paper Award and these were submitted to the Award Committee and will be presented in the Best Paper Session. The winners will be announced during the banquet of ICME 2019 in Shanghai.

We would like to further extend our appreciation to the Local Chairs, Chong Luo (Microsoft Research Asia, China), Hanli Wang (Tongji University, China) and Dan Zeng (Shanghai University, China); the Sponsorship Chairs, Le Dong (Univ of Electronic Science and Technology of China, China), Nian Tong (University of Science and Technology of China, China), Junsong Yuan (State University of New York, Buffalo, USA) and Yongdong Zhang (University of Science and Technology of China, China); the Publication Chairs, Qi Tian (University of Texas at San Antonio, USA), Rui Wang (Tongji University, Chi- na) and Jian Zhang (University of Technology Sydney, Australia); the Publicity Chairs, Wen-Huang Cheng (National Chiao Tung University, Taiwan), Richang Hong (Hefei University of Technology, China), Shiwen Mao (Auburn University, USA) and Shui Yu (University of Technology Sydney, Australia); the Finance Chairs, Chengcui Zhang (University of Alabama at Birmingham, USA) and Dongdong Zhang (Tongji University, China); and the Registration Chairs, Dong Liu (University of Science and Technology of China, China), Haoqi Ren (Tongji University, China) and Liquan Shen (Shanghai University, Chi- na).

The conference would not have been possible without the dedication and the hard work of all members of the Organizing Committee. In addition to members of the Organization Committee, many volunteers have contributed to the success of the

14 conference. Volunteers helped in editing this conference booklet, and helped with local arrangements and on-site setups, and many other important tasks. While it is difficult to list all their names here, we would like to take this opportunity to sincere- ly thank them all.

Special thanks to our keynote speakers, Nozha Boujemaa (MEDIAN Technologies, France), Mu-Ming Poo (Institute of Neu- roscience, Chinese Academy of Sciences, China), and Harry Shum (Microsoft, USA). We greatly value their participation and look forward to their insightful vision and thoughts. Our thanks also go to all invited speakers for tutorials, panels, work- shops, rising star forum, grand challenges, and hands-on expos.

We are very grateful to the academic panelists, Frederic Dufaux (Paris-Sud University, France), Abed El Saddik (University of Ottawa, Canada), Lina Karam (Arizona State University, USA), Jay Kuo (University of Southern California, USA), Yong Lian (University of Singapore, Singapore), Dapeng Wu (University of Florida, USA), and the industry panelists, Xinxin Gao (The Jiangmen, China), Nozha Boujemaa (MEDIAN Technologies, France), Xian-Sheng Hua (DAMO Academy/Al- ibaba Cloud, China), Wenjun Zeng (Microsoft Research Asia, China), Marta Mrak (BBC, UK), Qibin Sun (University of Science and Technology and Founder of Xietong Info-Tech Pte. Ltd). Special thanks to Chang Wen Chen (CUHK-SZ, China / SUNY-Buffalo, USA) who set up the selection procedure of newly established Multimedia Rising , and the pan- elists, Yi-Hsuan Yang (Academia Sinica, Taiwan), Jiwen Lu (Tsinghua University, China), Weiyao Lin (Shanghai Jiao Tong University, China), Lu Fang (Tsinghua University, China).

We are very grateful to members of the Multimedia Star Innovator Award Board, Alex Acero, (Senior Director, Siri at Apple, USA), Nikhil Balram (Sr. Director of Engineering, AR/VR at Google, USA), Hanno Basse (CTO, 20th Century Fox, USA), Achin Bhowmik (CTO & Executive VP Engineering, Starkey Hearing Tech, USA), Nozha Boujemaa (Chief Science & Inno- vation Officer, Median Technologies, France), Baining Guo (Assistant Managing Director, Microsoft Research Asia, China), Ramesh Jain (Professor, UC Irvine, USA), Kevin Jou (CTO, MediaTek, China), Chuen-Chien Lee (SVP, Sony Corporation of America), Shipeng Li (VP, iFlyTek, China), Pieter J. Mosterman (Chief Scientist and Director, The Mathworks, USA), Yong Rui (CTO, Lenovo, China), Anthony Vetro (VP, MERL, USA), Susie Wee (SVP, CISCO, USA), and Bowen Zhou (VP, JD.com, China). The Multimedia Star Innovator Award was initiated this year to recognize pioneers of transformative tech- nologies and business models in areas within the technical scope of IEEE ICME. The Award showcases innovations that have had great impact on human experiences with technology or are anticipated to do so in the near future. We received fourteen nominations, out of which four finalists were selected through voting by the Award Board. The four finalists will be compet- ing on-site at ICME 2019 for the Multimedia Star Innovator Award. Conference attendees can vote for the top winner with the deadline to cast all votes before the banquet. The top-voted finalist will be announced at the banquet.

We are grateful to the strong support of the ICME Steering Committee, the four sponsoring societies and respective Techni- cal Committees. ICME is unique because of the joint support of these four societies, and we are honored to serve as general co-chairs for such a unique interdisciplinary conference. We would also like to thank our industrial sponsors, including Kuaishou, JD.COM, MEGVII, iQIYI, Microsoft, DiDi, Alibaba, Horizon Robotics, Baidu, Lenovo Research, Unisound, SenseTime, and The Jiangmen. Last but not least, we would like to extend our most sincere congratulations to all authors and speakers for a job well done. We would also like to acknowledge the exhibitors for their supports and contributions to the ICME 2019 program.

We look forward to welcoming you in person in Shanghai and we hope that you will enjoy ICME 2019 and the beautiful summer of Shanghai!

General Chairs

Lina J. Karam

Arizona State University, USA

Tao Mei

AI Research of JD.COM, China

Feng Wu

University of Science and Tech of China, China

15 IEEE ICME2019

Welcome Message from the Technical Program Committee Chairs

On behalf of the ICME 2019 Technical Program Committee (TPC), we are delighted to welcome you to Shanghai! During its 20 years of history, ICME has been presenting advances in the field of multimedia from various IEEE research communities, which this year resulted in a record popularity according to the number of submissions to the main conference track. This has been achieved by strong engagement of its four IEEE sponsors: the IEEE Signal Processing Society, IEEE Communication Society, IEEE Circuit and System Society and IEEE Computer Society.

9-11 July 2019 are the core days of this year’s conference. During these three days, the program of each morning is organized into a single track starting with a keynote talk from the most distinguished experts in our communities. Keynote talks will be followed by a comprehensive programme which includes sessions for Best Papers, the Rising Star Program, and many more. Each afternoon will be busy with numerous parallel sessions composed of accepted submissions, including two Special Ses- sions.

One day before and one day after the core conference days, you will have the opportunity to attend workshops which are traditionally held in conjunction with ICME. This year we have the pleasure to bring to you 13 workshops, covering recent developments in visual fashion computing, visual emotion analysis and smart-health technologies, among other emerging topics.

This year is special for ICME. In addition to a notable 20th anniversary, we are pleased to report a record high number of submissions to the conference main track. From more than 1,000 papers submitted to the conference, approx. 30% were accepted (313 papers). The Technical Program Co-chairs recruited 72 Area Chairs who first of all assisted in the recruitment of reviewers to cover 32 distinctive subject areas. This year the selection of subject areas mainly reflected the growing use of machine learning and artificial intelligence in multimedia applications. Out of 32 subject areas, 15 covered various topics of deep learning and artificial intelligence for multimedia, which were then addressed by approx. 1/2 of all submissions. The next most popular topic was Multimedia and Vision, with approx. 1/8 of the submissions to this area.

With a target to have at least three high quality reviews for each paper, 617 reviewers provided on average 3.85 reviews per paper, with 99% of papers receiving 3 or more reviews. Finally, based on the obtained reviews, our Area Chairs provided recommendations for each paper so that the difficult task of making decisions for acceptance could be performed. Finally, the TPC recommended papers for 36 oral and 18 poster sessions. Amongst a number of highly rated manuscripts, 6 of the very best papers have been shortlisted for awards, where the final selection will be decided during the conference in association with the Awards Chairs.

Our gratitude goes to the Area Chairs and the reviewers whose technical expertise and dedication were not only thorough and crucial for the technical assessment of the selection of papers, but also inspirational in making the whole process even more pleasurable. We will recognise those colleagues who made the most valuable contributions with special awards for Area Chairs and reviewers.

Lastly, we thank the General Chairs Tao Mei, Feng Wu and Lina Karam as well as ICME Steering Committee Chair Yap- Peng Tan for their patience and guidance. Many thanks also to all the members of the Organizing Committee for their full support in preparation of the conference. Finally, we would like to thank our authors from all over the world whose valuable and novel contributions are essential for both the continued success of ICME and the advancement of technology for humani- ty.

ICME 2019 Technical Program Committee Co-Chairs Marta Mrak, BBC R&D, UK

Jun Wu, Tongji University, China

Honggang Wang, University of Massachusetts Dartmouth, USA

Roger Zimmermann, National University of Singapore, Singapore

Zhu Li, University of Missouri, Kansas City, USA

Lei Zhang, Microsoft, USA

16 Organizing Committee

General Chairs

Lina J. Karam Tao Mei Feng Wu

Arizona State University, USA JD AI Research, China University of Science and Technology of China, China

Program Chairs

Jun Wu Marta Mrak Zhu Li

Tongji University, China BBC R&D, UK University of Missouri, Kansas City, USA

Honggang Wang Lei Zhang Roger Zimmermann

University of Massachusetts Microsoft, USA National University of Singapore, Dartmouth, USA Singapore

17 IEEE ICME2019

Panel Chairs

Chang Wen Chen Chia-Wen Lin Fernando Pereira

CUHK-SZ, China / National Tsing Hua Instituto Superior Técnico, SUNY-Buffalo, USA Portugal University, Taiwan

Workshop Chairs

Susanne Boll Jingdong Wang Z. Jane Wang

University of Oldenburg, Microsoft Research Asia, University of British Columbia, Germany China Canada

Tutorial Chairs

Jiebo Luo Zheng-Jun Zha

University of Rochester, USA University of Science and Technology of China, China

18 Special Session Chairs

Junwei Han Enrico Magli

Northwestern Polytechnical Univer- Politecnico di Torino, Italy sity, China

Grand Challenges Chairs

Gene Cheung Jiaying Liu

York University, Canada Peking University, China

Award Chairs

Mei-Ling Shyu Yonggang Wen

University of Miami, USA Nanyang Technological University, Singapore

19 IEEE ICME2019

Industrial Program Chairs

Liang Lin Chonggang Wang Xiaoqing Zhu

Sun Yat-Sen University, China InterDigital, USA Cisco, USA

Student Program Chairs

Weiyao Lin Xiaoyan Sun Shaoen Wu

Shanghai Jiao Tong University, China Microsoft Research Asia, Ball State University, USA China

Poster/Demo Chairs

Yu-Gang Jiang Cong Shen Dong Tian

Fudan University, China University of Science and Tech- InterDigital, USA nology of China, China

20 Web Chairs

Wu Liu Dalei Wu

JD AI Research, China University of Tennessee, USA

Local/Event Chairs

Chong Luo Hanli Wang Dan Zeng

Microsoft Research Asia, China Tongji University, China Shanghai University, China

Sponsorship Chairs

Le Dong Nian Tong Junsong Yuan

University of Electronic Science and University of Science and State University of New York, Buffalo, Technology of China, China Technology of China, China USA

21 IEEE ICME2019

Yongdong Zhang

University of Science and Tech- nology of China, China

Publication Chairs

Qi Tian Rui Wang Jian Zhang

University of Texas at San Tongji University, China University of Technology Sydney, Antonio, USA Australia

Publicity Chairs

Wen-Huang Cheng Richang Hong Shiwen Mao

National Chiao Tung University, Hefei University of Auburn University, USA Taiwan Technology, China

22 Shui Yu

University of Technology Sydney, Australia

Finance Chairs

Chengcui Zhang Dongdong Zhang

University of Alabama at Birming- Tongji University, China ham, USA

Registration Chairs

Dong Liu Haoqi Ren Liquan Shen

University of Science and Tongji University, China Shanghai University, China Technology of China, China

23 IEEE ICME2019

Keynote

Tuesday, July 9, 2019

K-01: Neural Circuit Plasticity: From Brain Research to Machine Learning and Back

Time: 8:30 - 9:30 AM

Room: Auditorium 3F

Speaker: Mu-Ming Poo Institute of Neuroscience, Chinese Academy of Sciences & CAS Center for Excellence in Brain Sci- ence and Intelligence Technology, China

Chair: Feng Wu University of Science and Technology of China, China

Abstract

The most important feature of the brain is the plasticity of neural circuits, i.e., the structure and function of neural circuits could be modified by experience. This plasticity is the basis of most of our cognitive functions, such as sensory perception, multisensory integration, learning and memory, pattern recognition, attention and decision-making. In this lecture, I will summarize our current concept of neural circuit plasticity, which represents a major achievement of neuroscience in the past decades. First, the architecture of the neural circuits is shaped by experience through experience-induced sculpting (pruning) of connections, a process most prominent in early brain development and continued to a limited extent in the adult brain. Second, the efficiency of signal transmission in specific neural circuits at the junctions (synapses) between nerve cells (neu- rons) could be modified by experience-induced neural activities in a manner that depends on the pattern (frequency and tim- ing) of electrical spikes in the pre- and postsynaptic neurons. Activity-induced long-term potentiation (LTP) and long-term depression (LTD) of existing synapses are the predominant mechanism underlying learning and memory of the adult brain. Concepts of neural circuit architecture and synaptic plasticity had triggered the emergence of efficient machine learning -al gorithms in the past decades, and will continue to inspire the development of new machine learning methods in the future. Conversely, I will argue that findings in the field of machine learning and artificial neural networks could help to facilitate the elucidation of new features and functions of neural circuits in the brain. This was illustrated by our own discovery of back-propagating LTP and LTD in neural circuits, in a line of research inspired by the back-propagation algorithm used in supervised machine learning. Finally, I will summarize our effort in exploring whether higher cognitive functions such as self-awareness may originate from experience-dependent neural circuit plasticity. We discovered that mirror self-recognition, a hallmark of self-awareness known to be limited to humans and great apes, could be acquired by rhesus monkeys following extensive training for visual-somatosensory or visual-proprioceptive association, thus providing a new experimental system for studying neural circuit mechanism and plasticity underlying self-awareness. Thus, crosstalk between researchers in neuro- science and machine learning will trigger new development in both fields. Mu-Ming Poo Institute of Neuroscience, Chinese Academy of Sciences & CAS Center for Ex- cellence in Brain Science and Intelligence Technology, China

Bio: Mu-ming Poo is the founding and current director of Institute of Neuroscience, Chinese Academy of Sciences, director of CAS Center for Excellence in Brain Science and Intelligence Technology, and Paul Licht Distinguished Professor in Biology Emeritus at University of Cali- fornia, Berkeley. He studied physics at Tsinghua University (Taiwan) and received Ph D in bio- physics from Johns Hopkins University. He had served on the faculty of University of California at Irvine, Yale University, Columbia University, and University of California at San Diego, and University of California, Berkeley. He has made seminal contributions in studying neuronal differentiation, axon guidance and synaptic plasticity. He is a member of Academia Sinica, US National Academy of Sciences, Chinese Academy of Scienc- es, and Academy of Science of Hong Kong. He had received Ameritec Prize, Docteur Honoris Causa from Ecole Normale Supérieure, Paris and from Hong Kong University of Science and Technology (2014), P. R. China International Science & Technology Cooperation Award (2005), Qiushi Distinguished Scientist Award (2011), and Gruber Neuroscience Prize (2016). He is currently on the editorial board of more than 10 academic journals, including Neuron, and serve as the Executive Edi- tor-in-Chief for National Science Review.

24 Wednesday, July 10, 2019

K-02: AI Ethics: From Principles to Practices

Time: 8:30 - 9:30 AM

Room: Auditorium 3F

Speaker: Harry Shum Executive Vice President of Microsoft’s Artificial Intelligence and Research Group, USA

Chair: Lina J. Karam Arizona State University, USA

Abstract

Recent achievements and advancements in AI have outpaced what anyone would have thought imaginable even five to ten years ago. For instance, we are fast approaching human parity across many areas of AI — speech, vision, language and knowledge. But many practitioners building AI technology and deploying AI products have not always thought through the societal implications such as fairness and transparency. In this talk, I will discuss how we can best address these societal chal- lenges before the next AI innovation and development cycle. Up to this point, the answer has centered on principles – guide- lines to help companies and countries navigate the complexities and implications of AI. But principles alone are no longer enough—industry, academia and government need to take actions now to move from principles to practices. I will share what we have been practicing in Microsoft AI and Research from doing research in explainable and interpretable AI, to leveraging useful tools like datasheets and checklists commonly used in other industries, to forming an internal AI ethics committee providing guidelines for shipping AI products, to sharing and learning best practices with other companies through the Part- nership in AI.

Harry Shum Executive Vice President of Microsoft’s Artificial Intelligence and Research Group, USA

Bio: Harry Shum is executive vice president of Microsoft’s Artificial Intelligence (AI) and Research group. He is responsible for driving the company’s overall AI strategy and for- ward-looking research and development efforts spanning infrastructure, services, apps and agents. He oversees AI-focused product groups including Bing and Cortana. He also leads Microsoft Research, one of the world’s premier computer science research organizations, and its integration with the engineering teams across the company. Previously, Dr. Shum served as the corporate vice president responsible for Bing search product development from 2007 to 2013. Prior to his engineering leadership role at Bing and online services, he oversaw the research activities at Microsoft Research Asia and the lab’s collaborations with universities in the Asia Pacific region, and was responsible for the Internet Services Research Center, an applied research organization dedicated to advanced technology investment in search and advertising at Microsoft. Dr. Shum joined Microsoft Research in 1996 as a researcher based in Redmond, Washington. In 1998 he moved to as one of the founding members of Microsoft Research China (later renamed Microsoft Research Asia). There he began a nine-year tenure as a researcher, subsequently moving on to be- come research manager, assistant managing director and managing director of Microsoft Research Asia and a Distinguished Engineer. Dr. Shum is an IEEE Fellow and an ACM Fellow for his contributions to computer vision and computer graph- ics. He received his Ph.D. in robotics from the School of Computer Science at Carnegie Mellon University. In 2017, he was elected to the National Academy of Engineering of the United States.

25 IEEE ICME2019

Thursday, July 11, 2019

K-03: Multimedia Driven Precise Medicine

Time: 8:30 - 9:30 AM

Room: Auditorium 3F

Speaker: Nozha Boujemaa Chief Science and Innovation Officer, MEDIAN Technologies, France

Chair: Tao Mei AI Research of JD.COM, China

Abstract:

Multimedia Research studies have yet multiple applications to empower AI technologies deployment in health and precise medicine domain. For example today, clinical decision-making in oncology is often guided by the results of a biopsy, an inva- sive procedure that is potentially at risk for the patient and does not take into account the tumor context as a whole, because the tumors are in essence heterogeneous. One of the major challenges for personalized and predictive medicine is to identify whether or not a patient will respond to treatment. In immuno-oncology, 80% of patients receive treatments to which they are not responders. Using imaging to perform virtual biopsies, we fully analyze the tumor and its environment, non-invasively from medical imaging modalities such as scanner or MRI. This talk will show some examples of how multimodal informa- tion retrieval can identify certain cancer phenotypes by extracting image signatures and mapping tumor heterogeneity. Deep learning will help predict the patient’s prognosis and select and target patients for clinical trials. The great advantage of AI is ensuring patient well-being, reducing the time to market for drug innovations and informing clinical decisions. AI technology robustness is key to achieve trustworthy and responsible AI services in this context.

Nozha Boujemaa Chief Science and Innovation Officer, MEDIAN Technologies, France

Bio: Nozha Boujemaa is a Key Opinion Leader in the field of Artificial Intelligence and data sciences. She is a Director of Research at Inria (the French National Institute for computer science and applied mathematics). She was the scientific Head of the IMEDIA/Inria Research Group (Large Scale Multimedia Content Search) for 10 years before becoming Director of the Inria Saclay Research Center from 2010 to 2015 and Advisor to the Chairman and CEO of Inria in Data Sciences. In 2017, Nozha Boujemaa founded the DATAIA Institute, an interdisciplinary Institute on Data Sciences, Artificial Intelligence & Society, which she will run until the end of 2018. As an expert in Interactive Visual Content Indexing and Retrieval and in unsupervised & semi-supervised learning, Nozha Boujemaa has contributed to the emergence of next-generation large-scale multimedia search engines. She managed several flagship collaborative projects with French and European indus- trials, is the co-author of over 150 publications in peer-reviewed journals and international conferences and has supervised over 25 PhD and master students.

Nozha Boujemaa is Knight of the National Order of Merit and Member of the Board of Directors of Big Data Value Associ- ation (BDVA), Vice-Chair of the Artificial Intelligence High Level Expert Group (AI HLEG) of the European Commission and member of the AI Group of Experts of the OECD (AIGO). Nozha is also International Advisor for Japan Science and Technology Agency Program “Advanced Core Technologies for Big Data Integration” and Senior Scientific Advisor for “The AI Initiative“ (Harvard Kennedy School). She is President of the Scientific Council of the Institute of Technological Research “SystemX” until the end of 2018.

26 Academic Panel

Tuesday, July 9, 2019

Towards an Excellent Academic Career

Time: 11:15 AM - 12:30 PM

Room: Auditorium 3F

Synopsis:

The panelists of this panel are leading academic scholars who have achieved distinguished status in various areas of multi- media research. They are from well-known universities around the world and shall share with the ICME2019 attendants their precious experiences at different stages of academic career, including their successful strategies working towards an excellent academic career. Their experiences shall not only just applicable to those who are pursuing academic career or who intend to pursue academic career, but also closely relevant for those researchers who are working in research labs and corporate R&D divisions. We all look forward to this exciting panel which will definitely benefit to the growth of our next generation young multimedia researchers.

Moderators:

Chang Wen Chen Fernando Pereira

CUHK-SZ, China / Instituto Superior Técnico, SUNY-Buffalo, USA Portugal

27 IEEE ICME2019

Panelists:

Frederic Dufaux Abed El Saddik Lina Karam Universite Paris-Sud, France University of Ottawa, Canada Arizona State University, USA

Jay Kuo Yong Lian Dapeng Wu University of Southern University of Singapore, University of Florida, USA California, USA Singapore

28 Industry Panel

Wednesday, July 10, 2019

From Papers to Products: Bridging the Gap between Multimedia Research and Practical Applications

Time: 11:15 AM - 12:30 PM Room: Auditorium 3F

Synopsis: The ultimate goal of innovations in any field of engineering is to make an impact in practice. The path from brilliant research ideas to their practical applications, however, is almost never straightforward. At times, the barriers to adoption of novel tech- nologies may even originate from non-technical sources. At ICME 2019, we gather a panel of distinguished panelists to share their personal journeys and perspectives in navigating the landscape of applied research. Our panelists draw from their own distinguished careers to discuss how to best foster innovation in industry, for instance, how to select research topics with both practical relevance and intellectual merit. They will also discuss and debate about potential pitfalls to avoid along the journey of innovation.

Moderator: Xinxin Gao CEO, The Jiangmen, China

Bio: Gao Xinxin Vanessa is the co-founder and CEO of Jiangmen. Jiangmen is a venture capital firm focuses on early stage tech companies’ investment, with an innovative model. Jiangmen Runs high-qual- ity AI Chinese researchers, scientists and tech experts’ tech community in China. Jiangmen also partner with over 50+ global Fortunes 500 companies and China leading companies in Retail, Logistic, Finance, Healthcare, Automobile and more, to connect technology application into business scenarios. Before founding Jiangmen, Gao Xinxin Vanessa was the CEO of Microsoft Ventures China. Before that she worked for Microsoft Research and Microsoft China. Gao Xinxin Vanessa received her master degree from INSEAD, and from Tsinghua University.

Panelists: Nozha Boujemaa Chief Science and Innovation Officer, MEDIAN Technologies, France

Bio: Nozha Boujemaa is a Key Opinion Leader in the field of Artificial Intelligence and data sciences. She is a Director of Research at Inria (the French National Institute for computer science and applied mathematics). She was the scientific Head of the IMEDIA/Inria Research Group (Large Scale Multi- media Content Search) for 10 years before becoming Director of the Inria Saclay Research Center from 2010 to 2015 and Advisor to the Chairman and CEO of Inria in Data Sciences. In 2017, Nozha Bou- jemaa founded the DATAIA Institute, an interdisciplinary Institute on Data Sciences, Artificial Intelli- gence & Society, which she will run until the end of 2018. As an expert in Interactive Visual Content In- dexing and Retrieval and in unsupervised & semi-supervised learning, Nozha Boujemaa has contributed to the emergence of next-generation large-scale multimedia search engines. She managed several flag- ship collaborative projects with French and European industrials, is the co-author of over 150 publications in peer-reviewed journals and international conferences and has supervised over 25 PhD and master students. Nozha Boujemaa is Knight of the National Order of Merit and Member of the Board of Directors of Big Data Value Associ- ation (BDVA), Vice-Chair of the Artificial Intelligence High Level Expert Group (AI HLEG) of the European Commission and member of the AI Group of Experts of the OECD (AIGO). Nozha is also International Advisor for Japan Science and Technology Agency Program “Advanced Core Technologies for Big Data Integration” and Senior Scientific Advisor for “The AI Initiative” (Harvard Kennedy School). She is President of the Scientific Council of the Institute of Technological Research “SystemX” until the end of 2018.

29 IEEE ICME2019

Xian-Sheng Hua Head of AI Center/Distinguished Engineer, DAMO Academy/Alibaba Cloud, China

Bio: Xiansheng Hua is now a Distinguished Engineer/VP of Alibaba DAMO Academy. He received the B.S. and Ph.D. degrees in applied mathematics from Peking University in 1996 and 2001, respectively. He joined Microsoft Research Asia in 2001, as a Researcher. He was a Principal Research and a Development Lead in multimedia search with the Microsoft Search Engine in USA, from 2011 to 2013. He was a Senior Researcher with Microsoft Re- search Redmond from 2013 to 2015. He became a Researcher and the Senior Director of the Alibaba Group in 2015, where he is also leading the Visual Computing Team, Search Divi- sion, Alibaba Cloud, and then DAMO Academy. He is currently a Distinguished Engineer/ Vice President of the Alibaba Group, where he is leading a team working on large-scale visual intelligence on the cloud. He has authored or co-authored more than 200 research papers and has filed more than 90 patents. His research interests include big multimedia data search, advertising, understanding, and mining, pattern recognition, and machine learning. He is an IEEE Fellow and an ACM Distinguished Scientist. He was one of the recipients of the 2008 MIT Technology Review TR35 Young Innovator Award for his outstanding contributions on video search. He was also a recipient of the Best Paper Awards at ACM Multimedia 2007, and the Best Paper Award of the IEEE Transactions on Circuits and Systems for Video Technology in 2014. He served as a Program Co-Chair for IEEE ICME 2012, ACM Multimedia 2012, and IEEE ICME 2013. He will be serving as a General Co-Chair of ACM Multimedia in 2020.

Wenjun Zeng Principal Research Manager, Microsoft Research Asia, China

Bio: Wenjun (Kevin) Zeng is a Principal Research Manager and a member of the Senior Lead- ership Team (SLT) at Microsoft Research Asia. He is a Fellow of the IEEE. He has been leading the video analytics research powering the Microsoft Cognitive Services, Azure Media Analytics Services, Microsoft Office, and Windows Machine Learning (ML) since 2014. He was with Univ. of Missouri from 2003 to 2016, most recently as a Full Professor. Prior to that, he had worked for PacketVideo Corp., Sharp Labs of America, Bell Labs, and Panasonic Technology. He received his B.E., M.S., and Ph.D. degrees from Tsinghua Univ., the Univ. of Notre Dame, and Princeton Univ., respectively. Dr. Zeng is on the Editorial Board of International Journal of Computer Vision. He was an Associate Editor-in-Chief of IEEE Multimedia Magazine, an Associate Editor of IEEE Trans. on Circuits & Systems for Video Technology (TCSVT), IEEE Trans. on Info. Forensics & Security, and IEEE Trans. on Multimedia (TMM), and was on the Steering Committee of IEEE Trans. on Mobile Computing and IEEE TMM. He was a Special Issue Guest Editor for the Proceedings of the IEEE, TMM, ACM TOMCCAP, TCSVT, and IEEE Communications Magazine. He served as the Steering Committee Chair of IEEE ICME in 2010 and 2011, and has served as the General Chair or TPC Chair of several IEEE conferences (e.g., ICME2018, ICIP2017). He was the recipient of several best paper awards.

Marta Mrak Lead Research Engineer at BBC, UK

Bio: Marta Mrak is a Lead R&D Engineer at BBC R&D working on video compression, new con- tent experiences and data analytics. She is also an Honorary Professor at Queen Mary University of London (QMUL), Multimedia and Vision Research Group. Within IEEE, Marta is active in numerous activities, e.g. she serves as General Co-Chair of IEEE ICME 2020, Lead TPC Co-Chair for IEEE ICME 2019 and Vice Chair of Technical Committee on Multimedia Signal Processing.

Qibin Sun Professor at University of Science and Technology and Founder of Xietong Info-Tech Pte. Ltd., China

Bio: Dr. Qibin Sun is currently a Professor at University of Science and Technology of China and the Founder of Xietong Info-Tech Pte. Ltd. He was a Senior Manager in HP, and later a Distin- guished Engineer in Cisco. Qibin owns full-stack experiences from research, productization, to commercialization.

30 Lei Zhang Algorithm Scientist at Kuaishou Technology, China

Bio: Dr. Lei Zhang graduated from Institute of Computing Technology of the Chinese Academy of Sciences in 2015 with a PhD degree. Lei’s research primarily focuses on large-scale multi- media retrieval. He has published 11 IEEE journal articles in the field of computer vision and multimedia research. He is the lead author among 7 of the 11 published papers, which have been widely cited and used. After graduation, Lei joined Huawei’s 2012 Hardware Engineering Institute and worked on developing the mobile phone camera algorithm. He participated in the research of Huawei’s flagship mobile phone core camera algorithms including the first binocular flagship P9 binocular fusion, Mate 9 hybrid zoom, and P20 night scene shooting. His research specialization includes mobile phone camera algorithms, image processing and deep learning. Lei joined Kuaishou in 2018, and is currently engaged in researching target retrieval algorithms. He is also responsible for building the Hangzhou AI team in the Multimedia Understanding Department of Kuashou.

31 IEEE ICME2019

Multimedia Rising Star Panel

Thursday, July 11, 2019

Time: 10:00 - 11:00 AM

Room: Auditorium 3F

Synopsis:

This is a unique panel featuring young and accomplished researchers working in emerging areas of multimedia research. The selection process for these Multimedia Rising Stars has been very competitive. Each of four ICME sponsoring IEEE Societ- ies is asked to nominate up to three candidates who have graduated from their PhD within 10 years. The final four stars have been selected after thorough assessment of each candidate collectively by the Panel Chairs. Multiple factors have been care- fully considered, including their scholastic achievements after their PhD degree, their professional service records, and their proposed presentation topics. These emerging topics range from “Machine Learning for Creative AI Applications in Music” to “Deep Metric Learning for Multimedia Content Understanding”, and from “Large-scale Multimedia Semantic Information Extraction and Coding” to “Computational Light Field Imaging and Intelligent Reconstruction.” This Rising Star panel in- deed shall provide a live feast for multimedia researchers at all levels.

Moderators:

Chang Wen Chen Chia-Wen Lin

CUHK-SZ, China / SUNY-Buf- National Tsing Hua Universi- falo, USA ty, Taiwan

Panelist 1: Yi-Hsuan Yang

Talk Title: Machine Learning for Creative AI Applications in Music

Yi-Hsuan Yang

Academia Sinica, Taiwan

32 Panelist 2: Jiwen Lu

Talk Title: Deep Metric Learning for Multimedia Content Understanding

Jiwen Lu

Tsinghua University, China

Panelist 3: Weiyao Lin

Talk Title: Large-scale multimedia semantic information extraction and coding

Weiyao Lin

Shanghai Jiao Tong University, China

Panelist 4: Lu Fang

Talk Title: Multiscale Camera Array for Future Vision Intelligence

Lu Fang

Tsinghua University, China

33 IEEE ICME2019

Multimedia Star Innovators

Wednesday, July 10, 2019

Multimedia Star Innovator Keynote Highlights

Time: 10:00 - 11:00 AM

Room: Auditorium 3F

Chair: Lina J. Karam Arizona State University, USA

* Note: the onsite voting will start from 11:00AM till 5:45PM through the conference mobile app.

Innovation Award Finalists

Achin Bhowmik Chief Technology Officer & Executive VP Engineering, Starkey Hearing Technologies, USA

Bio: Dr. Achin Bhowmik is the chief technology officer and executive vice president of engi- neering at Starkey Hearing Technologies, a privately-held medical devices business with 6,000 employees and operations in over 100 countries worldwide. In this role, he is responsible for overseeing the company’s technology strategy, product development and engineering depart- ments, and is leading the drive to redefine medical wearable devices with advanced sensors and artificial intelligence technologies.

Prior to joining Starkey, Dr. Bhowmik was vice president and general manager of the Perceptual Computing Group at Intel Corporation. There, he was responsible for the R&D, engineering, operations, and businesses in the areas of 3D sensing and interactive computing, computer vision and artificial intelligence, autonomous robots and drones, and immersive virtual and merged reality devices. Previously, he served as the chief of staff of the Personal Computing Group, Intel’s largest business unit with >$30B annual revenues in 2010.

As an adjunct professor and guest lecturer, Dr. Bhowmik advises graduate research and teaches courses on human-computer interactions and perceptual computing technologies at the University of California, Berkeley, Stanford University, Liquid Crystal Institute of the Kent State University, Kyung Hee University, Seoul, and the Indian Institute of Technology, Gandhi- nagar.

Dr. Bhowmik was elected a Fellow of the Society for Information Display (SID). He serves on the board of advisors for the Fung Institute for Engineering Leadership at UC Berkeley, the executive board for SID, and the board of directors for OpenCV. He also serves on the board of directors and advisors for several technology startup companies. He received the Industrial Distinguished Leader Award from the Asia-Pacific Signal and Information Processing Association. He has over 200 publications, including two books and 34 issued patents.

Touradj Ebrahimi Professor, Swiss Federal Institute of Technology (EPFL), Switzerland

Bio: Touradj Ebrahimi received his M.Sc. and Ph.D., both in Electrical Engineering, from the Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland, in 1989 and 1992 re- spectively. In 1993, he was a research engineer at the Corporate Research Laboratories of Sony Corporation in Tokyo, where he conducted research on advanced video compression techniques for storage applications. In 1994, he served as a research consultant at AT&T Bell Laboratories working on very low bitrate video coding. He is currently Professor at EPFL heading its Multi- media Signal Processing Group. He is also the Convenor of JPEG standardization Committee. He was also adjunct Professor with the Center of Quantifiable Quality of Service at Norwegian University of Science and Technology (NTNU)between 2008 and 2012.

Prof. Ebrahimi has been the recipient of various distinctions and awards, such as the IEEE and Swiss national ASE award, the

34 SNF-PROFILE grant for advanced researchers, Four ISO-Certificates for key contributions to MPEG-4 and JPEG 2000, and the best paper award of IEEE Trans. on Consumer Electronics . He became a Fellow of the international society for optical engineering (SPIE) in 2003. Prof. Ebrahimi has initiated more than two dozen National, European and International coop- eration projects with leading companies and research institutes around the world. He is a co-founder of Genista SA, a high- tech start-up company in the field of multimedia quality metrics. In 2002, he founded Emitall SA, start-up active in the area of media security and surveillance. In 2005, he founded EMITALL Surveillance SA, a start-up active in the field of privacy and protection. He is or has been associate Editor with various IEEE, SPIE, and EURASIP journals, such as IEEE Signal Pro- cessing Magazine, IEEE Trans. on Image Processing, IEEE Trans. on Multimedia, EURASIP Image Communication Journal, EURASIP Journal of Applied Signal Processing, SPIE Optical Engineering Magazine. Prof. Ebrahimi is a member of Sci- entific Advisory Board of various start-up and established companies in the general field of Information Technology. He has served as Scientific Expert and Evaluator for Research Funding Agencies such as those of European Commission, The Greek Ministry of Development, The Austrian National Foundation for Scientific Research, The Portuguese Science Foundation, as well as a number of Venture Capital Companies active in the field of Information Technologies and Communication Systems. His research interests include still, moving, and 3D image processing and coding, visual information security (rights protec- tion, watermarking, authentication, data integrity, steganography), new media, and human computer interfaces (smart vision, brain computer interface).

He is the author or the co-author of more than 200 research publications, and holds 14 patents. Prof. Ebrahimi is a member of IEEE, SPIE, ACM and IS&T.

Henrique "Rico" S. Malvar Chief Scientist, Microsoft Research, USA

Bio: Henrique (Rico) Malvar was born in Brazil. He got a PhD from MIT in signal processing in 1986. He joined Microsoft Research in 1997, where he currently is the Chief Scientist. Rico and his teams developed new technologies for multimedia compression used in Windows, Of- fice, Xbox, Skype, and Azure. He contributed to standard formats such as G.722.1 and H.264, and was a main architect for the WMA and JPEG XR formats. He also worked on audio signal enhancement and beamforming technologies, used in Windows, Skype, Kinect, and HoloLens. He has authored or co-authored over 160 technical publications and over 120 issued US pat- ents. He is a Fellow of the IEEEE, and received the Technical Achievement Award from the IEEE Signal Processing Society in 2002. He was elected to the US National Academy of Engineering in 2012.

In 2015, Rico founded the Microsoft Research NExT Enable group, which leverages new multimedia interfaces to develop systems and applications to improve the lives of people with disabilities. The group co-developed, with the Microsoft Win- dows engineering team, new eye tracking software interfaces and the new Eye Control user interfaces, which allows a person to have full control of a device running Windows 10 using only their eye movements. The group also developed and shipped the Soundscape application, which uses 3D audio to help people with low or no vision to build a mental map of points of interest, as they walk about in the world, without the need to look at screens. The Enable group works closely with NGOs dedicated to accessibility, such as Tam Gleason, the ALS Association, Guide Dogs for the Blind, and the Lighthouse for the Blind.

Rajan Patel Rajan Patel, Senior Director, Google, USA

Bio: Rajan Patel is a Senior Director leading Augmented Reality engineering teams. Prior to working on AR, he worked on Google’s Search algorithm, leading teams working to im- prove topicality and freshness of search results.

He received a Ph.D. in from the Biostatistics department at Emory University, where he developed statistical methods to analyze the functional connectivity of the brain using func- tional magnetic resonance imaging (fMRI) data.

35 IEEE ICME2019

Aparna Chennapragada Aparna C, Vice President, Google, USA

Bio: Aparna Chennapragada is the Vice President of Augmented Reality and Google Lens. She also serves on the board of Capital One.

Aparna previously worked as a Senior Director and Technical Assistant to the CEO of Google, helping drive company-wide product efforts. She also led Google Now, a proactive digital assis- tant effort, and worked on many areas in Google Search and YouTube over the years. With over 20 years of experience in the tech industry as a computer scientist and product leader, she is ex- cited about the potential of AI and algorithms to build products that improve everyday life.

Wednesday, July 10, 2019

Multimedia Star Innovator Keynotes

Time: 15:30 - 17:30 PM

Room: 3B

Chair: Lina J. Karam Arizona State University, USA

36 Grand Challenges

Thursday, July 11, 2019

Grand Challenge Highlights

Time: 11:15 AM - 12:30 PM

Room: Auditorium 3F

Chair: Gene Cheung York University, Canada

Jiaying Liu Peking University, China

Schedule:

11:15 - 11:20 Opening Remarks

11:20 - 11:35 Grand Challenge: 106-p Facial Landmark Localization

Overview Hailin Shi AI Platform and Research, JD.com, China

Winner Talk

11:35 - 11:50 Grand Challenge: Learning-Based Image Inpainting

Overview Dong Liu University of Science and Technology of China, China

Winner Talk

11:50 - 12:05 Grand Challenge: Short Video Understanding Challenge -- Recommending All You

Want to See

Overview Changhu Wang Bytedance AI Lab, China

Winner Talk

12:05 - 12:30 Grand Challenge: Saliency4ASD: Visual attention modeling for Autism Spectrum

Disorder

Overview Patrick Le Callet University of Nantes, France

Winner Talk for Track 1

Winner Talk for Track 2

37 IEEE ICME2019

Thursday, July 11, 2019

G-01: Short Video Understanding Challenge -- Recommending All You Want to See

Time: 14:00 - 15:00 PM

Room: 3B

Description:

This challenge provides multi-modal video features, including visual features, text features and audio features, as well as user interactive behavior data, such as click, like, and follow. Each participant needs to model the user’s interest through a video and user interaction behavior data set, and then predict the user’s click behavior on another video dataset. The rank of our challenge accords to the model and predicted results submitted by the participants, based on a predefined score specified in the evaluation criteria. Website: http://ai-lab-challenge.bytedance.com/tce/vc/

Organizers:

Changhu Wang Yi Ma Wei-Ying Ma

Bytedance AI Lab, China University of California, Bytedance AI Lab, China Berkeley, USA

38 14:00 - 14:06 OPENING REMARKS

Dr. Changhu Wang

Bytedance AI Lab, China

14:06 - 14:14 ORAL SESSION:

Enhanced Short Video Understanding by Integrating User Behavior and Multimedia

Content Information

Lin Zhu

Ctrip Travel Network Technology Co., Limited, China

14:14 - 14:22 PREDICTING USER BEHAVIOR USING ITEM2VEC WITH FREQUENC

Chun Tao1, Haocheng Xu2, Kang Yang3, Feng Lu4, Xue Du5

1Nanjing Tech University, China, 2University of Electronic Science and Technology of China, China, 3Sichuan University, China, 4Hisense, China, 5Chongqing Univers -ty of Posts and Telecommunications, China

14:22 - 14:30 TRUNCATED SVD-BASED FEATURE ENGINEERING FOR SHORT VIDEO

UNDERSTANDING AND RECOMMENDATION

Tsun-Hsien Tang1,2, Kuan-Ta Chen2, Hsin-Hsi Chen1,3

1National Taiwan University,Taiwan, 2Academia Sinica, Taiwan, 3MOST Joint Research Center for AI Technology and All Vista Healthcare, Taiwan 14:30 - 14:38 SHORT VIDEO CONTENT UNDERSTANDING AND RECOMMENDATION

BASED ON GRADIENT BOOSTING TREE AND DEEP NETWORK

Yaxi Wu1, Majing Lou2, Zhibin Lian3 1JD, China, 2mininglamp technology, China, 3South China Normal University, China

14:38 - 14:46 ENGINEERING IMPLEMENTATION IN ICME GRAND CHALLENGE

Guanhao Cheng, Huiqin Xiao, Jianwei Li, Dongwei Zhao, Xiaosheng Wu

Netease, China

14:46 - 14:54 BUILDING EFFECTIVE SHORT VIDEO RECOMMENDATION

Yang Liu1, Cheng Lyu1, Zhiyuan Liu1, and Dacheng Tao2

1Southeast University, China, 2the University of Sydney, Australia

14:54 - 15:00 AWARDING SESSION

39 IEEE ICME2019

Thursday, July 11, 2019

G-02: Grand Challenges of 106-p Facial Landmark Localization

Time: 15:30 - 16:30 PM

Room: 3B

Description:

As the deep learning methods have been largely developed in facial landmark localization task, the requirements of practical applications are growing fast. However, for large poses and occlusion, the accuracy of localization needs to be improved. Here, JD AI Research and NLPR, CASIA sincerely invited researchers and developers from academia and industry to partici- pate in this competition and encourage further discussion on technical and application issues.

Website: https://facial-landmarks-localization-challenge.github.io

Organizers:

Hailin Shi Xiaobo Wang Xiangyu Zhu

JD AI Platform and Research, JD AI Platform and Research, Chinese Academy of Sciences, China China China

Yinglu Liu Hao Shen

JD AI Platform and Research, JD AI Platform and Research, China China

40 15:30 -15:40 OPENING REMARKS

Dr. Yinglu Liu

JD AI

15:40 - 15:55 FACIAL LANDMARK LOCALIZATION BASED ON AUTO-STACKED HOURGLASS NETWORK AND EXPECTATION CONSENSUS

Zhibin Hong, Hanqi Guo, Ziyuan Guo, Yanqin Chen, Bi Li, Teng Xi

Department of Computer Vision Technology (VIS), Baidu Inc, China

15:55 - 16:10 MULTI-SCALE DENSELY U-NETS REFINE NETWORK FOR FACE ALIGNMENT.

Jun Yu1, Haonian Xie1, Guochen Xie1, Mengyan Li1, Zengfu Wang2

1University of Science and Technology of China, China, 2Institute of Intelligent Machines, Chinese Academy of Sciences, China

16:10 - 16:25 IMPROVED HOURGLASS STRUCTURE FOR HIGH PERFORMANCE FA CIAL LANDMARK DETECTION

Shenqi Lai, Zhenhua Chai, and Xiaoming Wei

Vision and Image Center of Meituan, China

16:25 - 16:30 AWARDS

41 IEEE ICME2019

Thursday, July 11, 2019

G-03: Learning-Based Image Inpainting

Time: 16:45 - 17:45 PM

Room: 3CD

Description:

Image inpainting, also known as image completion, is the process of filling-in the missing areas of an incomplete image so that the completed image is visually plausible. While this task is indispensable in many applications, such as disocclusion, object removal, error concealment, and so on, the task is still regarded very difficult thus far. Traditionally, several different approaches have been proposed for image inpainting, including partial differential equation-based inpainting, constrained tex- ture synthesis, structure propagation, database-assisted, and so on.

In recent years, deep learning has revolutionized the research of image inpainting, and a number of deep models have been designed. Nonetheless, the lack of a public, widely acknowledged dataset has been a significant issue in developing advanced, learning-based inpainting solution.

This challenge is meant to consolidate research efforts about image inpainting using learning, especially deep learning ap- proach. We will prepare two tracks: error concealment (EC) and object removal (OR). In the EC track, we simulate the case of transmission error that incurs missing areas (usually square blocks) in a decoded image. In the OR track, we carefully select some objects in an image to be removed, and produce missing areas with irregular shapes. In both tracks we challenge the researchers to inpaint the incomplete image. The major difference between the two tracks is that, in the first track, we want to recover the missing areas so that the completed image is similar to the original (although this can be very difficult!), and in the second track, we are satisfied as long as the completed image is visually plausible and pleasing.

We are aware of a previous competition in conjunction with ECCV 2018, which also addresses the problem of image (and video) inpainting. Different from that competition, in our challenge we evaluate the quality of completed images by both ob- jective metrics (PSNR, SSIM) and subjective evaluation (MOS).

Website: https://icme19inpainting.github.io/

Organizers:

Dong Liu Ming-Hsuan Yang

University of Science and University of California at Technology of China, China Merced, USA

42 16:45 - 16:55 OPENING REMARKS

Dr. Dong Liu

University of Science and Technology of China, China

16:55 - 17:10 INTERLEAVED ZOOMING NETWORK FOR IMAGE INPAINTING

Sen Liu, Zongyu Guo, Jiale Chen, Tao Yu, Zhibo Chen

University of Science and Technology of China, China

17:10 - 17:25 MSMC-NET: IMAGE INPAINTING USING DEEP MULTI-SCALE AND MULTI-CONNECTION NETWORKS

Miaohui Wang, Xiaoming Chen, Weiqian Chen, Yuan Yuan

Shenzhen University, China

17:25 - 17:40 IMAGE INPAINTING UNDER CHESSBOARD-LIKE MASKING

Shiqi Lin, Jilong Liu, Zhiyuan Zhou, Haoran Zhang, Xueliang Liu

Hefei University of Technology, China

17:40 - 17:45 AWARD CEREMONY

43 IEEE ICME2019

Thursday, July 11, 2019

G-04: Saliency4ASD: Visual attention modeling for Autism Spectrum Disorder

Time: 16:45-17:45 PM

Room: 3B

Description:

The purpose of the Grand Challenge Saliency4ASD is to drive efforts of visual attention modeling community towards a healthcare societal challenge. Gaze features related to saccades and fixations have demonstrated their usefulness in the iden- tification of mental states, cognitive processes and neuropathologies (Tseng et al., 2013; Itti, 2015), notably for people with ASD (Autism Spectrum Disorder).

Website:https://saliency4asd.ls2n.fr

Organizers

Guangtao Zhai Zhaohui Che

Shanghai Jao Tong University, Shanghai Jao Tong University, China China

Jesus Guttirez Patrick Le Callet

University of Nantes, France University of Nantes, France

44 16:45 - 16:55 OPENING REMARKS 16:55 - 17:10 SALIENCY PREDICTION VIA MULTI-LEVEL FEATURES AND DEEP SUPERVISION FOR CHILDREN WITH AUTISM SPECTRUM DOISORDER

Weijie Wei1, Zhi Liu1, Lijin Huang1, Alexis Nebout2, Olivier Le Meur2 1Shanghai University, China, 2 University of Rennes 1, France

VISUAL ATTENTION MODELING FOR AUTISM SPECTRUM DISORDER BY U-NET

Yuming Fang, Hanqin Huang, Boyang Wan, and Yifan Zuo Jiangxi University of Finance and Economics, China

PREDICTING SALIENCY MAPS FOR ASD PEOPLE

Alexis Nebout1, Weijie Wei2, Zhi Liu2, Lijin Huang2, Olivier Le Meur1 1University of Rennes 1, France, 2Shanghai University, China

CLASSIFYING AUTISM SPECTRUM DISORDER BASED ON SCANPATHS AND SALIENCY

Mikhail Startsev, Michael Dorr Technical University of Munich, Germany

EXPLOITING VISUAL BEHAVIOUR FOR AUTISM SPECTRUM DISORDER IDENTIFICATION

Giuliano Arru, Pramit Mazumdar, Federica Battisti Roma Tre University, Italy

SP-ASDNET: CNN-LSTM BASED ASD CLASSIFICATION MODEL USING OBSERVER

SCANPATHS

Yudong Tao, Mei-Ling Shyu University of Miami, USA

PREDICTING AUTISM DIAGNOSIS USING IMAGE WITH FIXATIONS AND SYNTHETIC SAC- CADE PATTERNS

Chongruo Wu1, Sidrah Liaqat2, Sen-ching Cheung2, Chen-Nee Chuah1, Sally Ozonoff1 1University of California, Davis, USA, 2University of Kentucky, USA

17:10 - 17:15 ANNOUNCEMENT OF THE RESULTS 17:15 - 17:30 ORAL PRESENTATION WINNER TRACK 1. 17:30 - 17:45 ORAL PRESENTATION WINNER TRACK 2.

45 IEEE ICME2019

Tutorials

Monday, July 8, 2019

T-01: Big Data Intelligence: From Correlation Discovery to Casual Reasoning

Time: 8:30 AM - 12:00 PM

Room: 3E

Speaker: Fei Wu Zhejiang University, Hangzhou, China

Abstract

The discovery of correlations from large scale of data set is an interested issue nowadays. Artificial intelligence is now head- ing towards how to integrate data-driven learning and knowledge-guided inference to perform better reasoning and decision instead of correlation learning via metric matching. This talk will discuss the potential ways to fuse symbolic AI, data-driven learning and reinforcement learning to support causal reasoning.

Speaker

Fei Wu Zhejiang University, Hangzhou, China Bio: Fei Wu received his B.Sc., M.Sc. and Ph.D. degrees in computer science from Lanzhou University, University of Macau and Zhejiang University in 1996, 1999 and 2002 respectively. From October 2009 to August 2010, Fei Wu was a visiting scholar at Prof. Bin Yu’s group, University of California, Berke- ley. Currently, He is a Qiushi distinguished professor of Zhejiang University at the college of computer science. He is the vice-dean of college of computer science, and the director of Institute of Artificial Intelligence of Zhejiang University. He is the chairman of IEEE CAS Hangzhou-Chapter since Oct, 2018. He is currently the Associate Editor of Multimedia System, the editorial members of Frontiers of Information Technology & Electronic Engineering. He has won various honors such as the Award of National Science Fund for Distinguished Young Scholars of China (2016). His research interests mainly include Artificial Intelligence, Multimedia Analysis and Retrieval and Machine Learning.

46 Monday, July 8, 2019

T-02: Human Behavior Understanding: From Human-Oriented Analysis to Action Recognition

Time: 13:30 - 17:00 PM

Room: 3E

Speakers: Ting Yao JD AI Research, Beijing, China Wu Liu JD AI Research, Beijing, China

Abstract

Analyzing human behaviour in videos is one of the fundamental problems of computer vision and multimedia understanding. The task is very challenging as video is an information-intensive media with large variations and complexities in content. With the development of deep learning techniques, researchers have strived to push the limits of human behaviour under- standing in a wide variety of applications from action recognition to event detection. This tutorial will present recent advanc- es under the umbrella of human behavior understanding, which range from the fundamental problem of how to learn “good” video representations, to the challenges of categorizing video content into human action classes, finally to multimedia event detection and surveillance event detection in complex scenarios.

Speakers

Ting Yao JD AI Research, Beijing, China Bio: Ting Yao is currently a Principal Researcher in Vision and Multimedia Lab at JD AI Research, Beijing, China. His research interests include video understanding, large-scale multimedia search and deep learning. Prior to joining JD AI Research, he was a Researcher with Microsoft Research Asia in Beijing, China. Ting is an active participant of several benchmark evaluations. He is the principal designer of several top-performing multimedia analytic systems in worldwide competitions such as COCO Image Captioning, Visual Domain Adaptation Challenge 2017, ActivityNet Large Scale Ac- tivity Recognition Challenge 2018, 2017 and 2016, THUMOS Action Recognition Challenge 2015, and MSR-Bing Image Retrieval Challenge 2014 and 2013. He is one of the organizers of the MSR Video to Language Challenge 2017 and 2016. For his contributions to Multimedia Search by Self, External and Crowdsourc- ing Knowledge, he was awarded the 2015 SIGMM Outstanding Ph.D. Thesis Award.

Wu Liu JD AI Research, Beijing, China Bio: Wu Liu is a Senior Researcher in JD AI Research, China. He received his Ph.D. degree from the Institute of Computing Technology, Chinese Academy of Science in 2015. His current research inter- ests include video analytics, human behavior analysis, and intelligent video surveillance. He has pub- lished more than 30 papers in prestigious conferences and journals in computer vision and multimedia, including CVPR, ACM MM, IJCAI, AAAI, UBICOMP, IEEE T-MM, IEEE T-CYB, etc. He received Chinese Academy of Sciences Outstanding Ph.D. Thesis Award in 2016, Best Student Paper Awards at ICME in 2016, and the Deans Special Award of Chinese Academy of Sciences in 2015, etc. He is also the founding member of ACM FCA, the guest editor of MTAP and MVA, and the Web Chair of ICME 2019.

47 IEEE ICME2019

Monday, July 8, 2019

T-03: Intelligent Image Enhancement and Restoration - From Prior Driven Model to Advanced Deep Learning

Time: 8:30 AM - 12:00 PM

Room: 3G

Speakers: Jiaying Liu Peking University, Beijing, China Wenhan Yang National University of Singapore, Singapore Chen Change Loy Nanyang Technological University, Singapore

Abstract

Intelligent image/video editing is a fundamental topic in image processing which has witnessed rapid progress in the last two decades. Due to various degradations in the image and video capturing, transmission and storage, image and video include many undesirable effects, such as low resolution, low light condition, rain streak and rain drop occlusions. The recovery of these degradations is ill-posed. With the wealth of statistic-based methods and learning-based methods, this problem can be unified into the cross-domain transfer, which cover more tasks, such as image stylization. In our tutorial, we will discuss recent progresses of image stylization, rain streak/drop removal, image/video super-resolution, and low light image enhancement. This tutorial covers both traditional statistics based and deep-learning based methods, and contains both biological-driven model, i.e. Retinex model, and data-driven model. An image processing viewpoint that con- siders the popular deep networks as a traditional Maximum-a-Posteriori (MAP) Estimation is provided. The side priors, de- signed by researchers and learned by multi-task learnings, and automatically learned priors, captures by adversarial learning are two kinds of important priors in this framework. Three works under this framework, including single image super-resolu- tion, low light image enhancement, and single image raindrop removal are presented. Single image super-resolution is a classical problem in computer vision. It aims at recovering a high-resolution image from a single low-resolution image. This problem is an underdetermined inverse problem, of which solution is not unique. In this tutorial, we will discuss how we can solve the problem by deep convolutional networks in a data-driven manner. We will review different model variants and important techniques such as adversarial learning for image super-resolution. We will then discuss recent work on hallucinating faces of unconstrained poses and with very low resolution. Finally, the tutorial will discuss challenges of implementing image super-resolution in real-world scenarios.

Speakers

Jiaying Liu Peking University, Beijing, China Bio: Jiaying Liu is currently an Associate Professor with the Institute of Computer Science and Technology, Peking University. She received the Ph.D. degree (Hons.) in computer science from Peking University, Beijing China, 2010. She has authored over 100 technical articles in refereed journals and proceedings, and holds 34 granted patents. Her current research interests include multi- media signal processing, compression, and computer vision. Dr. Liu is a Senior Member of IEEE and CCF. She was a Visiting Scholar with the University of Southern California, Los Angeles, from 2007 to 2008. She was a Visiting Researcher with the Microsoft Research Asia in 2015 supported by the Star Track Young Faculties Award. She has served as a member of Multimedia Systems & Applications Technical Committee (MSA-TC), Visual Signal Processing and Communications Technical Committee (VSPC) and Education and Outreach Tech- nical Committee (EO-TC) in IEEE Circuits and Systems Society, a member of the Image, Video, and Multimedia (IVM) Technical Committee in APSIPA. She has also served as the Technical Program Chair of IEEE VCIP-2019/ACM ICMR- 2021, the Publicity Chair of IEEE ICIP-2019/VCIP-2018, the Grand Challenge Chair of IEEE ICME-2019, and the Area Chair of ICCV-2019. She was the APSIPA Distinguished Lecturer (2016-2017). In addition, Dr. Liu also devotes herself to teaching. She has run MOOC Programming Courses via Coursera/edX/Chi- neseMOOCs, which have been enrolled by more than 60 thousand students. She is also the organizer of the first Chinese

48 MOOC Specialization in Computer Science. She is the youngest recipient of Peking University Outstanding Teaching Award.

Wenhan Yang National University of Singapore, Singapore Bio: Wenhan Yang is a Postdoc research fellow with the Department of Computer Science, City University of Hong Kong. Wenhan Yang received the B.S degree and Ph.D. degree (Hons.) in computer science from Peking University, Beijing, China, in 2012 and 2018. Dr. Yang was a Visiting Scholar with the National University of Singapore, from 2015 to 2016. He has authored over 30 technical articles in refereed journals and proceedings. His current research interests include deep-learning based image processing, bad weather restoration, related applications and theories.

Chen Change Loy Nanyang Technological University, Singapore Bio: Chen Change Loy is a Nanyang Associate Professor with the School of Computer Science and Engineering, Nanyang Technological University, Singapore. He is also an Adjunct Asso- ciate Professor at the Chinese University of Hong Kong. He received his PhD (2010) in Com- puter Science from the Queen Mary University of London. Prior to joining NTU, he served as a Research Assistant Professor at the MMLab of the Chinese University of Hong Kong, from 2013 to 2018. He is the recipient of 2019 Nanyang Associate Professorship (Early Career Award) from Nanyang Technological University. His research interests include computer vision and deep learning, with a focus on face analysis, image processing, and visual surveillance. He has published more than 100 papers in top journals and conferences of computer vision and machine learn- ing. He and his team proposed a number of important methods for image super-resolution including SRCNN, SFTGAN and ESRGAN. As a co-author, his journal paper on SRCNN was selected as the `Most Popular Article’ by IEEE Transactions on Pattern Analysis and Machine Intelligence in 2016. It remains as one of the top 10 articles to date. ESRGAN has been widely used to remaster various classic games such as Half-Life, Resident Evil 2, Morrowind, and Final Fantasy 7. He serves as an Associate Editor of the International Journal of Computer Vision (IJCV) and IET Computer Vision Journal. He also serves/served as the Area Chair of CVPR 2019, BMVC 2019, ECCV 2018, and BMVC 2018. He is a senior member of IEEE.

49 IEEE ICME2019

Monday, July 8, 2019

T-04: Visual Search and Question Answering

Time: 13:30 - 17:00 PM

Room: 3G

Speakers: Lu Jiang Google Cloud AI, Sunnyvale, CA, USA Liangliang Cao University of Massachusetts, Amherst, MA, USA Yannis Kalantidis Facebook AI, California

Abstract

Personal photo and video data are being accumulated at an unprecedented speed. For example, 14 petabytes of personal pho- tos and videos were uploaded to Google Photo1 by 200 million users in 2015, while a tremendous amount of personal photos and videos are also being uploaded to Flickr every day. How to efficiently search and organize such data presents a huge challenge to both academic research and industrial applications. To attack this challenge, this tutorial will review the research efforts in related subjects and showcases of successful industri- al systems. We will discuss traditional visual search methods and the improvement of visual presentations brought by deep neural networks. The instructors will also share their experience of building large-scale fashion search and Flickr similarity search systems and bring insights on the challenges of extending the academic research to industrial applications. This tutorial will discuss the queries and logs of search engines, and analyze how to address the characteristics of personal media search. By leveraging searching techniques to visual question answering, this tutorial will introduce a new task named MemexQA: given a collection of photos or videos from the user, can we automatically answer questions that help users re- cover their memory about events captured in the collection? New datasets and algorithms of MemexQA will be reviewed. We hope MemexQA will shed light on the next generation computer interface of exploding amount of personal photos and videos.

Speakers

Lu Jiang Google Cloud AI, Sunnyvale, CA, USA Bio: Lu Jiang is a research scientist at Google CLoudAI, advised by Dr. Jia Li and Dr. Fei-Fei Li. He received his Ph.D. in Artificial Intelligence (Language Technology) from the Carnegie Mellon Uni- versity in 2017, advised by Dr. Alexander Hauptmann and Dr. Teruko Mitamura. Dr. Tat-Seng Chua and Dr. Louis-Philippe Morency are his thesis advisors. He was an intern scientist in Yahoo Research during the 2016 summer, working with Dr. Liangliang Cao, Dr. Yannis Kalantidis and Sachin Farfade on the personal photo and video search on Flickr. Prior to that, he was an intern in Google Research working with Dr. Paul Natsev and Dr. Balakrishnan Varadarajan on large-scale deep learning on the noisy YouTube-8M dataset. He interned at Mi- crosoft Research Asia in 2010, working with Dr. Qiang Wang and Dr. Dongmei Zhang on data mining. Before that, he was a research assistant at Xi’an Jiaotong University, supervised by Dr. Jun Liu on text mining, information retrieval. Lu’s primary interests lie in the interdisciplinary field of Multimedia, Machine Learning, Computer Vision, Information Retrieval, which, specifically, include video understanding and search, weakly supervised learning, deep learning, cloud machine learning, etc. He regularly serves on the programme committee of premier conferences such as ACM Multimedia, AAAI, and IJCAI. Lu is the recipient of the Yahoo Fellowship, Erasmus Mundus Scholar. He received the best poster award at IEEE Spoken Language Technology and the best Paper nomination at ACM International Conference on Multimedia Re- trieval.

50 Liangliang Cao University of Massachusetts, Amherst, MA, USA Bio: Liangliang Cao is a Staff Research Scientist at Google. He is also affiliated with UMass CICS as a research associate professor. His research interests include AI and large scale data learning, spanning computer vision, language, and speech. Before joining Google, he worked as a co-founder of HelloVe- ra, and earlier a senior scientist at Yahoo Labs and a research staff member at IBM Watson Research Center. He is an associate editor of the Visual Computer and JVIS. He won the 1st place of ImageNet LSVRC Challenge in 2010. He is a recipient of ACM SIGMM Rising Star Award.

Yannis Kalantidis Facebook AI, California

Bio: Yannis Kalantidis is a research scientist at Facebook AI in Menlo Park, California. He grew up in Athens, Greece and lived there till 2015, with brief breaks in Sweden, Spain and the United States. He got his PhD on large-scale search and clustering from the National Technical University of Athens in 2014. He was a postdoc and research scientist at Yahoo Research in San Francisco for two years, lead- ing the visual similarity search project at Flickr and participated in the Visual Genome dataset efforts with Stanford. He is currently conducting research on video understanding, representation learning and modeling of vision and language.

51 IEEE ICME2019

Monday, July 8, 2019

T-05: Object Detection Beyond Mask R-CNN and RetinaNet

Time: 8:30 AM - 12:00 PM

Room: 5A

Speakers: Gang Yu Face++, Beijing, China Yichen Wei Face++, Beijing, China Xiangyu Zhang Face++, Beijing, China

Abstract

Object detection is a fundamental problem in the computer vision society with numerous applications. Recently, as the devel- opment of Mask R-CNN and RetinaNet, the pipeline of the object detection seems to be mature. However, the performance for the current state-of-art object detection is still far from the requirements from the visual applications. In this tutorial, we will delve into the details of the object detection and present the improvements from five aspects: backbone, head, scale, batchsize, and post-processing. For the backbone, we will discuss a novel network called DetNet, which is specifically designed for the detection task. Det- Net preserves the spatial information of the network structure compared with traditional ImageNet pretrained backbone. For the head design, we will introduce Light-Head R-CNN for the fast inference speed and moreover a novel Localization Sen- sitive Head (LSH) will be discussed which decouples the classification and regression tasks into two branches. For the scale issue, we present a novel algorithm called SFace which can address the large scale variation problem in the Face Detection problem. Also, large batch-size detector will be discussed to significantly reduce the number of model training time. Besides, a new dataset called CrowdHuman will be discussed to address the NMS issue during the post-processing stage.

Speakers

Gang Yu Face++, Beijing, China Bio: Gang YU is the team leader for the detection in MEGVII. He graduated from Nanyang Technolog- ical University, Singapore, in 2014. He then joined MEGVII and his research interest focuses on com- puter vision and machine learning, including object detection, segmentation, skeleton, and human action analysis. Gang Yu has obtained the winners of COCO2017 and COCO2018 detection challenge and Keypoint challenge.

Yichen Wei Face++, Beijing, China

Bio: Dr. Wei joined Megvii on July, 2018. Before that, he spent 12 years in Visual Computing group, Mi- crosoft Research Asia. He received my Ph.D degree in Hong Kong University of Science and Technology in 2006, and B.S. degree in Peking University in 2001, respectively.

Dr Wei’s research interests include 3D vision, object recognition, detection, tracking and pose estimation. His Google Scholar citation is about 5,700, h-index is 31. His work has been transferred to Kinect Identi- ty in XBox, Windows Hello, Microsoft Cognitive Service, Bing, Office, and Microsoft XiaoIce, etc.

Xiangyu Zhang Face++, Beijing, China Bio: Xiangyu Zhang is currently the team leader of base model group in MEGVII Research. He received his doctoral degree from Xi’an Jiaotong University in 2017 and then joined MEGVII Technology. His re- search interest mainly focuses on deep learning models for computer vision, including CNN architecture design, network pruning and acceleration, neural architecture search and object detection/segmentation. He got CVPR 2016 best paper award and won a series of vision competitions such as ILSVRC 2015, COCO 2015/2017/2018. The total number of his Google Scholar citations is over 30000.

52 Friday, July 12, 2019

T-06: Computer Vision for Transportation

Time: 13:30 - 17:00 PM

Room: 5A

Speakers: Haifeng Shen AI Labs, Didi Chuxing, China Zhengping Che AI Labs, Didi Chuxing, China Guangyu Li AI Labs, Didi Chuxing, China Yuhong Guo AI Labs, Didi Chuxing & Carleton University, China Jieping Ye Didi Chuxing & University of Michigan, Ann Arbor

Abstract

Computer vision in transportation has recently received increasing attention from both industry and academia due to the popularity of modern mobile transportation platforms and the rapid development of autonomous driving. In this tutorial, we systematically introduce the recent progresses of computer vision techniques and their applications in transportation. Spe- cifically, we will provide a general overview of the key problems, common formulations, existing methodologies and future directions. This tutorial will inspire the audience and facilitate research in computer vision for transportation. The tutorial mainly consists of three parts: Lecture 1: Challenges using object recognition, optical character recognition and face recognition in transportation. • The recent progresses of the object recognition, optical character recognition and face recognition technologies. • The difficulties and problems when applying these technologies in transportation. • The solutions and applications.

Lecture 2: Towards Driving Scenario Understanding. • Object detection, tracking, and segmentation in driving scenarios. • Vision based 3D reconstruction of driving scenario. • Driving behavior modeling and safety risk analysis.

Lecture 3: Applying transfer learning in CV. • The introduction of the recent transfer learning technologies. • The applications of the transfer learning technologies in CV.

Speakers

Haifeng Shen AI Labs, Didi Chuxing, China

Bio: Haifeng Shen is a senior expert algorithm engineer in Didi Chuxing and leads the computer vision group in the AI Labs. He received his Ph.D. degree in signal and information processing from Beijing University of Posts and Telecommunications, in 2006. He has worked at Panasonic, Baidu, and Microsoft. He built the first speech recognition interface for XiaoIce chatbot in Mic- rosoft and his current work focuses on computer vision in transportation. His research interests include computer vision, speech recognition and natural language processing.

53 IEEE ICME2019

Zhengping Che AI Labs, Didi Chuxing, China Bio: Zhengping Che is a senior research scientist at DiDi AI Labs. He received his Ph.D. in Computer Science from the University of Southern California. Before that, he received his B.E. in Computer Science from Pilot CS Class (Yao Class), Tsinghua University. His current research interests lie in machine learning, deep learning and data mining with applications to temporal data and vision data. He has published several papers in ICML, KDD, ICDM, AMIA and other venues and interned at DiDi AI Labs, Mayo Clinic, IBM Research, Google and Hulu.

Guangyu Li AI Labs, Didi Chuxing, China Bio: Guangyu Li is a senior research scientist at DiDi AI Labs. In this role, he works on intelligent vehicles regarding autonomous driving, intelligent cockpit, and IoT systems. Before that, he de- veloped perception algorithms for self-driving trucks at TuSimple, an autonomous truck unicorn. Besides industrial experience, he is also a PhD candidate in University of Southern California. His research interests lie in computer vision, large scale sensor systems, and virtual/augmented/mixed reality with a focus on their applications in modern intelligent transportation.

Yuhong Guo AI Labs, Didi Chuxing & Carleton University, China Bio: Yuhong Guo is a principal research scientist at Didi Chuxing. She is also an associate profes- sor at Carleton University, a faculty affiliate of the Vector Institute, and a Canada Research Chair in Machine Learning. She received her PhD from the University of Alberta, and has previously worked at the Australian National University and Temple University. Her research interests in- clude machine learning, artificial intelligence, computer vision, and natural language processing. She has won paper awards from both IJCAI and AAAI. She has served in the Senior Program Committees of AAAI, IJCAI and ACML, and is currently serving as an Associate Editor for TPA- MI.

Jieping Ye Didi Chuxing, China & University of Michigan, Ann Arbor, USA

Bio: Jieping Ye is Head of DiDi AI Labs, a VP of Didi Chuxing and a DiDi Fellow. He is also a Professor at the University of Michigan, Ann Arbor. His research interests include data mining and machine learning with applications in transportation and biomedicine. He has served as a Senior Program Committee/Area Chair/Program Committee Vice Chair of many conferences including NIPS, ICML, KDD, IJCAI, AAAI, ICDM, and SDM. He serves as an Associate Edi- tor of Data Mining and Knowledge Discovery and IEEE Transactions on Knowledge and Data Engineering. He won the NSF CAREER Award in 2010. His papers have been selected for the outstanding student paper at ICML in 2004, the KDD best research paper runner up in 2013, and the KDD best student paper award in 2014.

54 Friday, July 12, 2019

T-07: Causally Regularized Machine Learning

Time: 8:30 AM - 12:00 PM

Room: 5BC

Speakers: Peng Cui Tsinghua University, Beijing, China Kun Kuang Tsinghua University, Beijing, China Bo Li Tsinghua University, Beijing, China

Abstract

Owing to the popularity of Big Data, abundant multimedia data are accumulated in various domains. At the same time, many machine learning methods are proposed to exploit these data for prediction. These methods have been proved to be success- ful in prediction-oriented applications. However, the lack of interpretability of most predictive algorithms makes them less attractive in many settings, especially those requiring decision making. How to improve the interpretability of learning algo- rithms is of paramount importance for both academic research and real applications? Causal inference, which refers to the process of drawing a conclusion about a causal connection based on the conditions of the occurrence of an effect, is a powerful statistical modeling tool for explanatory analysis. In this tutorial, we focus on caus- ally regularized machine learning, aiming to explore causal knowledge from observational data to improve the explainability and stability of machine learning algorithms. First, we will give some examples on how machine learning algorithms today focus on correlation analysis and prediction, and why those methods are not insufficient for decision making. Then, we will give introduction to causal inference and introduce some recent data-driven approaches to explore causal knowledge from observational data, especially in high dimensional setting. Aiming to bridge the gap between causal inference and machine learning, we will introduce some recently causally regularized machine learning algorithms for improving the stability and interpretability of prediction on multimedia data. Finally, we will discuss future directions of the landscape of open research and challenges in machine learning with causal inference.

Speakers

Peng Cui Tsinghua University, Beijing, China Bio: Peng Cui is an Associate Professor in Tsinghua University. He got his PhD degree from Tsinghua University in 2010. His research interests include network representation learning, social dynamics modeling and human behavior modeling. He has published more than 60 papers in prestigious con- ferences and journals in data mining and multimedia. His recent research won the ICDM 2015 Best Student Paper Award, SIGKDD 2014 Best Paper Finalist, IEEE ICME 2014 Best Paper Award, ACM MM12 Grand Challenge Multimodal Award, and MMM13 Best Paper Award. He is the Area Chair of ICDM 2016, ACM MM 2014-2015, IEEE ICME 2014-2015, ICASSP 2013, Associate Editor of IEEE TKDE, ACM TOMM, Elsevier Journal on Neurocomputing. He was the recipient of ACM China Rising Star Award in 2015.

Kun Kuang Tsinghua University, Beijing, China Bio: Kun Kuang received the B.E. degree from the Department of Computer Science and technology of Beijing Institute of Technology in 2014. He is a fifth- year Ph.D. candidate in the Department of Computer Science and Technology of Tsinghua University. His main research interests including data mining, high dimensional inference and data driven causal model. He has published several papers on data-driven causal inference and high dimensional inference in top data mining and machine learning conferences/journals of the relevant field such as SIGKDD, AAAI, and ICDM etc.

55 IEEE ICME2019

Bo Li Tsinghua University, Beijing, China Bio: Bo Li received a Ph. D degree in Statistics from the University of California, Berkeley, and a bach- elor’s degree in Mathematics from Peking University. He is an Associate Professor at the School of Eco- nomics and Management, Tsinghua University. His research interests are statistical methods for high-di- mensional data, statistical causal inference and data-driven decision making. He has published widely in academic journals across a range of fields including statistics, management science and economics.

56 Friday, July 12, 2019

T-08: Architecture Design for Deep Neural Networks Time: 8:30AM - 12:00PM

Room: 5DE

Speakers: Gao Huang Tsinghua University, Beijing, China Jingdong Wang MSRA, Beijing, China Lingxi Xie Huawei Inc., Beijing, China

Abstract

Recent years have witnessed great success in the deployment of deep learning for various tasks. Neural architecture innova- tion plays an important role in advancing this research direction. From AlexNet and VGG to ResNet and DenseNet, better architecture design has pushed the depth limit of deep models from 7 layers to over one thousand layers. The unprecedent depth endows neural networks with strong representation power. This tutorial will review classical convolutional network architectures, discuss their underlying design principles, and analyze their strengths and weaknesses. Particularly, we will address the recent trend of developing highly efficient light-weighted deep models for practical applications with limited computational resources, e.g., mobile phones and wearable devices. Besides hand designed structures that incorporate human intuition, neural architectures obtained via automatic search have gained great popularity in the recent two years. This newly emerged research direction, usually referred as AutoML, will also be covered in this tutorial.

Speakers

Gao Huang Tsinghua University, Beijing, China Bio: Dr. Gao Huang is an Assistant Professor with the Department of Automation at Tsinghua University. Previously, he was a postdoc with the Department of Computer Science at Cornell University. His re- search interests lie in machine learning and computer vision, with a special focus on deep learning. He has authored more than 30 papers, which collect more than 6000 citations. He is a recipient of the CVPR Best Paper Award (DenseNet), CAA Doctoral Dissertation Award and the Super AI Leader - Pioneer Award.

Jingdong Wang MSRA, Beijing, China Bio: Jingdong Wang is a Senior Researcher with the Visual Computing Group, Microsoft Research, Beijing, China. His areas of current interest include CNN architecture design, human pose estimation, semantic segmentation, person re-identification, large-scale indexing, and salient object detection. He has authored one book and 100+ papers in top conferences and prestigious international journals in computer vision, multimedia, and machine learning. He authored a comprehensive survey on learning to hash in TPAMI. His paper was selected into the Best Paper Finalist at ACM MM 2015. Dr. Wang is an Associate Editor of IEEE TPAMI, IEEE TCSVT and IEEE TMM. He was an Area Chair or a Senior Program Committee Member of top conferences, such as CVPR, ICCV, ECCV, AAAI, IJCAI, and ACM Multimedia. He is an ACM Distinguished Member and a Fellow of the IAPR. His homepage is https://jingdongwang2017.github.io/.

His representative works include deep high-resolution representation learning (HRNet), interleaved group convolutions, su- pervised saliency detection (discriminative regional feature integration, DRFI), neighborhood graph search (NGS) for large scale similarity search, composite quantization for compact coding, the Market-1501 dataset for person re-identification, and so on. He has shipped a dozen of technologies to Microsoft products, including Bing search, Bing Ads, Cognitive service, and XiaoIce Chatbot. His NGS algorithm is a foundational element of many products. He has developed Bing image search color filter using his efficient salient object algorithm. He has developed the first commercial color-sketch image search system.

57 IEEE ICME2019

Lingxi Xie Huawei Inc., Beijing, China Bio: Lingxi Xie is currently a senior researcher at Noah’s Ark Lab, Huawei Inc. He obtained B.E. and Ph.D. in engineering, both from Tsinghua University, in 2010 and 2015, respectively. He also served as a post-doctoral researcher at the CCVL lab from 2015 to 2019, having moved from the University of Cali- fornia, Los Angeles to the Johns Hopkins University. His homepage is http://lingxixie.com/. Lingxi’s research interests lie in computer vision, in particular the application of deep learning models. His research covers image classification, object detection, semantic segmentation and other vision tasks. He is also interested in medical image analysis, especially object segmentation in CT or MRI scans. Lingxi has published over 40 papers in top-tier international conferences and journals. In 2015, he received the outstanding Ph.D. thesis award from Tsinghua University. He is also the winner of the best paper award at ICMR 2015.

58 Friday, July 12, 2019

T-09: Intelligent Multimedia Recommendation

Time: 13:30 - 18:00 PM

Room: 5DE

Speakers: Jialie Shen Queen’s University Belfast, Belfast, United Kingdom Jian Zhang University of Technology Sydney, Sydney, Australia

Abstract

Due to the rapid growth of multimedia big data and related novel applications, intelligent recommendation systems have be- come more and more important in our daily life. During last decades, various multimedia technologies have been developed by different research communities (e.g., multimedia systems, information retrieval, and machine learning). Meanwhile, rec- ommendation techniques have been successfully leveraged by commercial systems (e.g., Amazon, Youtube and Spotify) to assist general users to deal with information overload and provide them high quality contents, interactions and services. While several tutorials and courses were dedicated to multimedia recommendation in the last few years, to the best of our knowledge, this tutorial should be the advanced and comprehensive one focusing on intelligent content analytics and its core applications on recommending various types of media contents. We plan to summarize the research along this direction and provide a good balance between theoretical methodologies and real system development (including several industrial ap- proaches). Core contributions to literature largely include: • Introducing why advanced recommendation system is important for Web scale multimedia retrieval, understand- ing and sharing. • Examining current commercial systems and research prototypes, focusing on comparing the advantages and the disadvantages of the various strategies and schemes for different types of media documents (e.g., image, video, audio and text) and their composition. • Reviewing key challenges and technical issues in building and evaluating modern recommendation systems under different contexts. • Discussing and reviewing various limitations of the current generation of systems. • Make predictions about the road that lies ahead for the scholarly exploration and industrial practice in multimedia and other related communities.

We also plan to have open discussion in this tutorial on several promising research directions with significant technical im- portance and explore potential solutions. Thus, we hope that this study provides an impetus for further research on this im- portant direction.

Speakers

Jialie Shen Queen’s University Belfast, Belfast, UK Bio: Dr. Jialie Shen is a Reader in Computer Science, School of Electronics, Electrical Engi- neering and Computer Science, Queen’s University Belfast (QUB), Belfast, United Kingdom. He received his PhD in Computer Science from the University of New South Wales (UNSW), Australia in the area of large-scale media retrieval and database access methods. Dr. Shen worked as a faculty member at Hong Kong, Singapore, Australia and England and researcher at information retrieval research group (Led by Professor Keith van Rijsbergen), the University of Glasgow, Scotland before moving to the QUB. Dr. Shen’s main research interests include in- formation retrieval, machine learning, multimedia systems and audio/video analytics. His research has been published or is forthcoming in leading journals and international conferences, including ACM SIGIR, ACM Multimedia, IJCAI, AAAI, IEEE Transactions and ACM Transactions.

59 IEEE ICME2019

Jian Zhang University of Technology Sydney, Sydney, Australia Bio: A/Prof. Jian Zhang: Dr. Jian Zhang is an Associate Professor in School of Electrical & Data Engineering, University of Technology Sydney, Australia. He received a PhD in electrical engineering from the University of New South Wales (UNSW), Sydney, Aus- tralia in area of image processing and video communication. From 1997 to 2003, he was with the Visual Information Processing Laboratory, Motorola Labs, Sydney, as a Principal Research Engineer and Research Manager of Visual Communications. From 2004 to July 2011, he was a Principal Researcher and a Project Leader with Data61 (formerly INCTA) Australia and a Conjoint Associate Professor with the School of Computer Science and Engineering, UNSW. He is currently an Associate Professor with the Global Big Data Technologies Centre, School of Electrical and Data Engineering, Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney. He is the author or co-author of more than 150 paper publications, book chapters, and six issued US and China patents. His current research interests include social multimedia signal processing, large scale image and vid- eo content analytics, retrieval and mining, 3D based computer Vision and intelligent video surveillance systems. Dr. Zhang was the General Co-Chair of the International Conference on Multimedia and Expo in 2012 and Technical Program Co-Chair of IEEE Visual Communications and Image Processing 2014. Currently, he is an Associated Edi- tors for the IEEE TRANSACTIONS ON MULTIMEDIA and the EURASIP Journal on Image and Video Processing (2016 – now). He was an Associate Editor for the IEEE TRANSCTIONS ON CIRCUITS AND SYSTEMS FOR VID- EO TECHNOLOGY (2006 – 2015).

60 Oral Sessions

Tuesday, July 9, 2019

Best Paper Session

Time: 10:00 - 11:00 AM

Room: 3CD

Chair: Yonggang Wen Nanyang Technological University, Singapore

Marta Mrak BBC, UK

10:00 AN END-TO-END ARCHITECTURE FOR CLASS-INCREMENTAL OBJECT DETECTION WITH KNOWLEDGE DISTILLATION

Yu Hao1, Yanwei Fu1, Yu-Gang Jiang1,2, Qi Tian3

1Fudan University, China, 2Jilian Technology Group(Video++) ,China, 3Huawei Noah’s Ark Lab, China

10:15 REAL-TIME INDOOR SCENE RECONSTRUCTION WITH RGBD AND INERTIAL IN PUT

Zunjie Zhu1, Feng Xu2, Chenggang Yan1, Xinhong Hao3, Xiangyang Ji2, Yongdong Zhang4, Qionghai Dai2

1Hangzhou Dianzi University, China, 2Tsinghua University, China, 3Beijing Institute of Technology, China, 4University of Science and Technology of China, China

10:30 DOUBLY SEMI-SUPERVISED MULTIMODAL ADVERSARIAL LEARNING FOR CLASSIFICATION, GENERATION AND RETRIEVAL

Changde Du1, Changying Du2, Huiguang He1

1Institute of Automation Chinese Academy of Sciences, China, 2Qihoo 360 Search Lab, China

10:45 TOWARDS DIGITAL RETINA IN SMART CITIES: A MODEL GENERATION, UTILIZATION AND COM- MUNICATION PARADIGM

Yihang Lou1,4, Ling-Yu Duan1,4, Yong Luo1,4, Ziqian Chen1,4, Tongliang Liu2, Shiqi Wang3, Wen Gao1,4

1Peking University, China, 2University of Sydney, Australia, 3City University of Hongkong, China, 4The Peng Cheng Laboratory, Shenzhen, China

61 IEEE ICME2019

Tuesday, July 9, 2019

O-01: Content Recommendation and Cross-modal Hashing

Time: 14:00 - 15:00 PM

Room: 3CD

Chair: Chengcui Zhang University of Alabama at Birmingham, USA

14:00 SDP: AN IMPROVED BASELINE ESTIMATION MODEL BASED ON STANDARD DEVIATION PRO- PORTION

Zhenhua Tan, Danke Wu, Liangliang He, Qiuyun Chang, Bin Zhang

Northeastern University, China

14:15 CITATION RECOMMENDATION BASED ON WEIGHTED HETEROGENEOUS IN FORMATION NET- WORK CONTAINING SEMANTIC LINKING

Jie Chen, Yang Liu, Shu Zhao, Yanping Zhang

Anhui University, China

14:30 FUSION-SUPERVISED DEEP CROSS-MODAL HASHING

Li Wang, Lei Zhu, En Yu, Jiande Sun, Huaxiang Zhang

Shandong Normal University, China

14:45 DOMAIN UNCERTAINTY BASED ON INFORMATION THEORY FORCROSS-MODAL HASH RETRIEV- AL

Wei Chen1, Nan Pu1, Yu Liu2, Erwin M. Bakker1, Michael S. Lew1,

1Leiden University, Holland, 2ESAT-PSI, KU Leuven, Belgium

62 Tuesday, July 9, 2019

O-02: Development of Multimedia Standards and Related Research

Time: 14:00 - 15:00 PM

Room: 3HI

Chair: Cheolkon Jung Xidian University, China

14:00 ADAPTIVE PLANE PROJECTION FOR VIDEO-BASED POINT CLOUD CODING

Eurico Lopes, João Ascenso, Catarina Brites, Fernando Pereira

Instituto Superior Técnico, Universidade de Lisboa - Instituto de Telecomunicações, Lisboa, Portugal

14:15 FAST CU PARTITIONING ALGORITHM FOR H.266/VVC INTRA-FRAME CODING

Ting Fu1, Hao Zhang 1, Fan Mu1, Huanbang Chen2

1Central South University, China, 2Huawei Base, China

14:30 TWO-STAGE FAST MULTIPLE TRANSFORM SELECTION ALGORITHM FOR VVC INTRA COD- ING

Ting Fu1, Hao Zhang 1, Fan Mu1, Huanbang Chen2

1Central South University, China, 2Huawei Base, China

14:45 HISTORY-BASED MOTION VECTOR PREDICTION FOR FUTURE VIDEO CODING

Junru Li1, Meng Wang2, Li Zhang3, Kai Zhang3, Hongbin Liu3, Shiqi Wang2, Siwei Ma1, Wen Gao1

1Peking University, China, 2City University of Hong Kong, China, 3Bytedance Inc., San Diego CA. 92122 USA, USA

63 IEEE ICME2019

Tuesday, July 9, 2019

O-03: Classification and Low Shot Learning

Time: 14:00 - 15:00 PM

Room: 5BC

Chair: Yu-Gang Jiang Fudan University, China

14:00 AMS-SFE: TOWARDS AN ALIGNMENT OF MANIFOLD STRUCTURES VIA SEMANTIC FEATU- REEXPANSION FOR ZERO-SHOT LEARNING

Jingcai Guo, Song Guo

The Hong Kong Polytechnic University, China

14:15 LOW-SHOT PALMPRINT RECOGNITION BASED ON META-SIAMESE NET WORK

Xuefeng Du1, Dexing Zhong1,2, Pengna Li1

1Xi’an Jiaotong University, China, 2Research Institute of Xi’an Jiaotong University, China

14:30 SR-GAN: SEMANTIC RECTIFYING GENERATIVE ADVERSIAL NETWORK FOR ZERO-SHOT LEARNING

Zihan Ye1,5, Fan Lyu1,2, Linyan Li3, Qiming Fu1,6, Jinchang Ren4, Fuyuan Hu1,7

1Suzhou University, China, 2Tianjin University, China, 3Suzhou Institute of Trade & Commerce, China, 4University of Strathclyde, UK, 5Virtual Reality Key Laboratory of Intelligent Interaction and Application Technology of Suzhou, China, 6Key Laboratory of Intelligent Building Energy Efficiency,China, 7Suzhou Key Laboratory for Big Data and Information Service, China

14:45 COMPARE MORE NUANCED: PAIRWISE ALIGNMENT BILINEAR NETWORK FOR FEW-SHOT FINE-GRAINED LEARNING

Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Jingsong Xu

University of Technology Sydney, Australia

64 Tuesday, July 9, 2019

O-04: 3D Media Computing

Time: 14:00 - 15:00 PM

Room: 5DE

Chair: Dan Zeng Shanghai University, China

14:00 FEATURE-AWARE AND CONTENT-WISE DENOISING OF 3D STATIC AND DY NAMIC MESHES USING DEEP AUTOENCODERS

Gerasimos Arvanitis1, Aris S. Lalos2, and Konstantinos Moustakas1

1University of Patras, Greece, 2“ATHENA” Research Center, Greece

14:15 REAL-TIME MONOCULAR VISUAL SLAM BY COMBINING POINTS AND LINES

Xinyu Wei, Jun Huang, Xiaoyuan Ma

Shanghai Advanced Research Institute, China

14:30 F-NUMBER ADAPTATION FOR MAXIMIZING THE SENSOR USAGE OF LIGHT FIELD CAMER- AS

Chuanpu Li, Xin Jin, Junke Li and Qionghai Dai

Shenzhen Key Lab of Broadband Network and Multimedia, China

14:45 BLIND CALIBRATION FOR FOCUSED PLENOPTIC CAMERAS

Xufu Sun, Xin Jin, Pei Wang, Yanqin Chen and Qionghai Dai

Shenzhen Key Lab of Broadband Network and Multimedia, China

65 IEEE ICME2019

Tuesday, July 9, 2019

O-05: Special Session "Pedestrian Detection, Tracking and Re-identification in Videos"

Time: 15:30 - 16:30 PM

Room: 3CD

Chair: Guiguang Ding Tsinghua University, China

Sicheng Zhao University of California, Berkeley, USA

Jungong Han Lancaster University, UK

15:30 PARTICLE SWARM LOSS FOR LIGHTWEIGHT OBJECT DETECTION

Peizhen Zhang1,2,4, Feng Zheng3, Junlong Du2, Jun Zhang2, Xiaowei Guo2, Wei-Shi Zheng1,4

1Sun Yat-sen University, China, 2Youtu Lab, Tencent, China, 3Southern University of Science and Technology, China, 4Key Laboratory of Machine Intelligence and Advanced Computing, China, Ministry of Education, China

15:45 INCORPORATING CATEGORY TAXONOMY IN DEEP REINFORCEMENT LEARNING BASED IMAGE HASHING

Qiang Fu1, Linsen Dong2, Ziyuan Liu2, Yong Luo2, Yonggang Wen2, Ying Li1, Ling-Yu Duan3

1Peking University, China, 2Nanyang Technological University, Singapore, 3Peking University, China

16:00 TRUNCATED GRADIENT CONFIDENCE-WEIGHTED BASED ONLINE LEARNING FOR IMBAL- ANCE STREAMING DATA

Ji Hu1, Chenggang Yan1, Xing Liu1, Jiyong Zhang1, Dongliang Peng1, Yi Yang2

1HangZhou DianZi University, China, 2UTS, Australia

16:15 UAV TARGET TRACKING BY DETECTION VIA DEEP NEURAL NETWORKS

Mohamed A. Kassab1, Ali Maher2, Fathy Elkazzaz3, Zhang Baochang1,4

1Beihang University, China, 2Military Technical college, Egypt, 3Benha University, Egypt, 4Shenzhen Academy of Aerospace Technology, China

66 Tuesday, July 9, 2019

O-06: Special Session "Multimedia Technologies Empowering Retail Experi- ences"

Time: 15:30 - 16:30 PM

Room: 3HI

Chair: Wu Liu JD AI Research, China

Liang Zheng Australian National University, Australia

Yi Yang University of Technology Sydney, Australia

Lexing Xie Australian National University, Australia

15:30 QUARTER-POINT CODEWORD EXPANSION FOR PRODUCT QUANTIZATION

Shan An1,2, Zhibiao Huang1, Guangfu Che1, Xianglong Liu2, Xin Ma3, Yu Chen1

1Department of Data Intelligence, JD.com, China, 2Beihang University, China, 3Shandong University, China

15:45 CONTEXT-AWARE AFFECTIVE GRAPH REASONING FOR EMOTION RECOGNITION

Minghui Zhang, Yumeng Liang, Huadong Ma

Beijing University of Posts and Telecommunications, China

16:00 SPL: EXPLOITING UNLABELED DATA FOR MULTI-LABEL IMAGE CLASSIFICATION

Weibo Zhang1,2, Fuqing Zhu1, Jiao Dai1, Songlin Hu1, Jizhong Han1, Tao Guo1

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2School of Cyber Security, University of Chinese Academy of Sciences, China

16:15 MLTS: A MULTI-LANGUAGE SCENE TEXT SPOTTER

Yu Zhou1, Shancheng Fang2, Hongtao Xie1, Zheng-Jun Zha1, Yongdong Zhang1

1University of Science and Technology of China, China,2Institute of Information Engineering, Chinese Academy of Sciences, China

67 IEEE ICME2019

Tuesday, July 9, 2019

O-07: 3D and Low Level Vision

Time: 15:30 - 16:30 PM

Room: 5BC

Chair: Wei Hu Peking University, China

15:30 UNSUPERVISED MONOCULAR DEPTH ESTIMATION BASED ON DUAL ATTENTION MECHA- NISM AND DEPTH-AWARE LOSS

Xinchen Ye1, Mingliang Zhang1,2, Rui Xu1, Wei Zhong1, Xin Fan1, Zhu Liu1, Jiaao Zhang1

1Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, China, 2Dalian University of Technology of Liaoning Province, China

15:45 TOWARDS HIGH-QUALITY INTRINSIC IMAGES IN THE WILD

Gang Fu1, Qing Zhang2, Chunxia Xiao1

1Wuhan University, China, 2Sun Yat-sen University, China

16:00 UNSUPERVISED LEARNING FOR OPTICAL FLOW ESTIMATION USING PYRAMID CONVOLU- TION LSTM

Shuosen Guan1,3, Haoxin Li2,3, Wei-Shi Zheng1,3

1School of Data and Computer Science, Sun Yat-sen University, China, 2School of Electronics and Information Tech- nology, Sun Yat-sen University, China, 3Key Laboratory of Machine Intelligence and Advanced Computing, Ministry of Education, China

16:15 MAST: MASK-ACCELERATED SHEARLET TRANSFORM FOR DENSELY-SAMPLED LIGHT FIELD RECONSTRUCTION

Yuan Gao1, Robert Bregovic2, Atanas Gotchev2, Reinhard Koch1

1Kiel University, German, 2Tampere University, Finland

68 Tuesday, July 9, 2019

O-08: Object Detection I

Time: 15:30 - 16:30 PM

Room: 5DE

Chair: Wengang Zhou University of Science and Technology of China, China

15:30 CODA: COUNTING OBJECTS VIA SCALE-AWARE ADVERSARIAL DENSITY ADAPTION

Li Wang1, Yongbo Li2, Xiangyang Xue1

1Fudan University, China, 2Megvii Inc (Face++), China

15:45 PDNET: PRIOR-MODEL GUIDED DEPTH-ENHANCED NETWORK FOR SALIENT OBJECT

Chunbiao Zhu, Xing Cai, Kan Huang, Thomas H Li, Ge Li

SECE, China, Shenzhen Graduate School, China, Peking University, China

16:00 CONTINUOUS SCALE ADAPTION FOR EFFICIENT BOX-BASED SCENE TEXT

Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo, Wei Zhong

Dalian University of Technology, China

16:15 MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETEC- TION NETWORK

Xiaobao Guo1, Jinxing Li2, Bingzhi Chen1, Guangming Lu1

1Harbin Institute of Technology (Shenzhen), China, 2The Chinese University of Hong Kong (Shenzhen), China

69 IEEE ICME2019

Tuesday, July 9, 2019

O-09: Emerging Applications of Deep Learning

Time: 16:45 - 17:45 PM

Room: 3CD

Chair: Aris Lalos Industrial System Institute, Greece

16:45 DMPR-PS: A NOVEL APPROACH FOR PARKING-SLOT DETECTION USING DIRECTIONAL MARKING-POINT REGRESSION

Junhao Huang1, Lin Zhang1, Ying Shen1, Huijuan Zhang1, Shengjie Zhao1, Yukai Yang2

1Tongji University, China, 2Uppsala University, Sweden

17:00 ADAPTING SEMANTIC SEGMENTATION OF URBAN SCENES VIA MASK- AWARE GATED DIS- CRIMINATOR

Yong-Xiang Lin1, Daniel Stanley Tan1, Wen-Huang Cheng2, Kai-Lung Hua1

1National Taiwan University of Science and Technology, Taiwan,2National Chiao Tung University,Taiwan

17:15 STOCHASTIC VIDEO GENERATION WITH DISENTANGLED REPRESENTATIONS

Maomao Li1, Chun Yuan1, Zhihui Lin1,2, Zhuobin Zheng1,2, Yangyang Cheng1,2

1Graduate School at Shenzhen, Tsinghua University, China, 2Tsinghua University, China

17:30 Z-ORDER RECURRENT NEURAL NETWORKS FOR VIDEO PREDICTION

Jianjin Zhang, Yunbo Wang, Mingsheng Long, Jianmin Wang, and Philip S. Yu

Tsinghua University, China

70 Tuesday, July 9, 2019

O-10: Multimedia Quality Assessment and Enhancement

Time: 16:45 - 17:45 PM

Room: 3HI

Chair: Chun-Shien Lu Academia Sinica, China

16:45 ENERGY-BASED RECURRENT MODEL FOR STOCHASTIC MODELING OF MUSIC

Yingru Liu1, Dongliang Xie2, Xin Wang1

1Stony Brook University, USA, 2Peking University, China

17:00 RESIDUAL FRAME FOR NOISY VIDEO CLASSIFICATION ACCORDING TO PERCEPTUAL QUALITY IN CONVOLUTIONAL NEURAL NETWORKS

Huaixuan Zhang1, Yuhai Lan3, Tao Dai1,2, Ruizhi Qiao 4, Ying Xu 1, Yao Yao 1, Shu-Tao Xia1,2

1Graduate School at Shenzhen, Tsinghua University, China, 2PCL Research Center of Networks and Communications, Peng Cheng Laboratory, China, 3Harbin Institute of Technology, China, 4Tencent Youtu Lab, China

17:15 RESIDUAL DILATED NETWORK WITH ATTENTION FOR IMAGE BLIND DENOISING

Guanqun Hou1, Yujiu Yang1, Jing-Hao Xue2

1Graduate School at Shenzhen, Tsinghua University, China, 2University College London, UK

17:30 COLLABORATIVE DEEP REINFORCEMENT LEARNING FOR IMAGE CROPPING

Zhuopeng Li, Xiaoyan Zhang

Shenzhen University, China

71 IEEE ICME2019

Tuesday, July 9, 2019

O-11: Multimedia for Society and Health

Time: 16:45 - 17:45 PM

Room: 5BC

Chair: Wolfgang Hürst Utrecht University, Holland

16:45 SIMILARITY-AWARE DEEP ADVERSARIAL LEARNING FOR FACIAL AGE ESTIMATION

Penghui Sun, Hao Liu, Xing Wang, Zhenhua Yu1, Suping Wu

Ningxia University, Yinchuan, China

17:00 LEARNING TRANSMISSION FILTERING NETWORK FOR IMAGE-BASED PM2.5 ESTIMATION

Yinghong Liao1, Bin Qiu1, Zhuo Su1, Ruomei Wang1, Xiangjian He2,3

1Sun Yat-sen University, China, 2Minjiang University, China, 3University of Technology Sydney, Australia

17:15 VIDEO-BASED EARLY ASD DETECTION VIA TEMPORAL PYRAMID NETWORKS

Yuan Tian, Xiongkou Min, Guangtao Zhai, Zhiyong Gao

Shanghai Jiao Tong Unversity, China

17:30 AUTOMATIC USER CATEGORIZATION THROUGH LARGE TRANSACTION DATA

Ying Zhang, YinJia Zhang, Qinpei Zhao, Weixiong Rao

Tongji University, China

72 Tuesday, July 9, 2019

O-12: Immersive Media

Time: 16:45 - 17:45 PM

Room: 5DE

Chair: Fernando Pereira Instituto Superior Técnico, Portugal

16:45 FEATURE PRESERVING AND UNIFORMITY-CONTROLLABLE POINT CLOUD SIMPLIFICATION ON GRAPH

Junkun Qi, WeiHu, Zongming Guo

Peking University, China

17:00 360SRL: A SEQUENTIAL REINFORCEMENT LEARNING APPROACH FOR ABR TILE-BASED 360 VID- EO STREAMING

Jun Fu, Xiaoming Chen, Zhizheng Zhang, Shilin Wu, Zhibo Chen

University of Science and Technology of China, China

17:15 CONTENT-AWARE PERSPECTIVE PROJECTION OPTIMIZATION FOR VIEWPORT RENDERING OF 360° IMAGES

Falah Jabar, Joao Ascenso, Maria Paula Queluz

Universidade de Lisboa, Portugal

17:30 AN AR BENCHMARK SYSTEM FOR INDOOR PLANAR OBJECT TRACKING

Ziming Wu1, Jiabin Guo2, Shuangli Zhang2, Chen Zhao2, Xiaojuan Ma1

1Hong Kong University of Science and Technology, China, 2Netease AR, China

73 IEEE ICME2019

Wednesday, July 10, 2019

O-13: 3D and Stereo Computing

Time: 14:00 - 15:00 PM

Room: 3CD

Chair: Joao Ascenso Instituto Superior Técnico, Portugal

14:00 GLOBAL AS-CONFORMAL-AS-POSSIBLE NON-RIGID REGISTRATION OF MULTI-VIEW SCANS

Zhenchao Wu1, Kun Li1, Yu-Kun Lai2, Jingyu Yang1

1Tianjin University, China, 2Cardiff University, UK

14:15 A LIGHT-WEIGHTED NETWORK FOR FACIAL LAND MARK DETECTION VIA COMBINED HEAT- MAP AND COORDINATE REGRESSION

Zhengning Wang1, Longfei Feng1, Fanwei Zeng1, Guang Hu1, Xiang Zhang1, Xia Lv1, Fengjun Zhang2

1University of Electronic Science and Techonogy of China, China, 2No.30 Institute of CETC, China

14:30 LIGHT WEIGHT STEREO MATCHING VIA DEEP EXTRACTION AND INTEGRATION OF LOW AND HIGH LEVEL INFORMATION

Xianzhe Xu1, Yonghong Hou1, Pichao Wang2, Zhongyu Jiang1, Wanqing Li3

1Tianjin University, China, 2Alibaba Group (U.S.) Inc., 3University of Wollongong, Australia

14:45 JUSTLOOKUP: ONE MILLISECOND DEEP FEATURE EXTRACTION FOR POINT CLOUDS BY LOOK- UP TABLES

Hongxin Lin1,2, Zelin Xiao1,2, Yang Tan1,2, Hongyang Chao1, Shengyong Ding1

1Sun Yat-sen University, China, 2Pixtalks Tech, China

74 Wednesday, July 10, 2019

O-14: Machine Learning Applications in Image and Video Coding I

Time: 14:00 - 15:00 PM

Room: 3HI

Chair: Frederic Dufaux Universite Paris-Saclay, France

14:00 MULTIPLE GRAPH CONVOLUTIONAL NETWORKS FOR CO-SALIENCY DETECTION

Bo Jiang1, Xingyue Jiang1, Jin Tang1, Bin Luo1, Shilei Huang2

1Anhui University, China, 2PKU-HKUST Shenzhen Hong Kong Institution, China

14:15 QUANNET: JOINT IMAGE COMPRESSION AND CLASSIFICATION OVER CHANNELS WITH LIMIT- ED BANDWIDTH

Lahiru Dulanjana Chamain Hewa Gamage1, Sen-ching S Cheung2, Zhi Ding1

1University of Califirnia Davis, USA, 2University of Kentucky, USA

14:30 HIGH EFFICIENCY LIGHT FIELD COMPRESSION VIA VIRTUAL REFERENCE AND HIERARCHI- CAL MV-HEVC

Jiawen Gu, Bichuang Guo, Jiangtao Wen

Tsinghua University, China

14:45 SELF-PACED SUBSPACE CLUSTERING

Youfa Liu, Bo Du, Lefei Zhang

Wuhan University, China

75 IEEE ICME2019

Wednesday, July 10, 2019

O-15: Vision, Language and Text Processing

Time: 14:00 - 15:00 PM

Room: 5BC

Chair: Jiande Sun Shandong University, China

14:00 COLLOQUIAL IMAGE CAPTIONING

Xuri Ge, Fuhai Chen, Chen Shen, Rongrong Ji

Xiamen University, China

14:15 IMPROVING CAPTIONING FOR LOW-RESOURCE LANGUAGES BY CYCLE CONSISTENCY

Yike Wu1, Shiwan Zhao2, Jia Chen3, Yinng Zhang1, Xiaojie Yuan1, Zhong Su2

1Nankai University, China, 2IBM Research, China, 3Carnegie Mellon University, USA

14:30 FRAMERANK: A TEXT PROCESSING APPROACH TO VIDEO SUMMARIZATION

Zhuo Lei1,2, Chao Zhang1,2, Qian Zhang2, Guoping Qiu3,4

1International Doctoral Innovation Center, UK 2The University of Nottingham Ningbo China, China, 3Shenzhen Uni- versity, China, 4University of Nottingham, UK

14:45 CHARACTER IMAGE SYNTHESIS BASED ON SELECTED CONTENT AND REFERENC ED STYLE EMBEDDING

Anna Zhu, Qiyang Zhang, Xiongbo Lu, Shengwu Xiong

Wuhan University of Technology, China

76 Wednesday, July 10, 2019

O-16: Media Classification and Segmentation II

Time: 14:00 - 15:00 PM

Room: 5DE

Chair: Zhu Li University of Missouri, USA

14:00 QUERY-FREE EMBEDDING ATTACK AGAINST DEEP LEARNING

Yujia Liu, Weiming Zhang, Nenghai Yu

University of Science and Technology of China, China

14:15 GRAPH ATTENTION NEURAL NETWORKS FOR POINT CLOUD RECOGNITION

Zongmin li, Jun Zhang, Guanlin Li, Yujie Liu, Siyuan Li

China University of Petroleum (East China), China

14:30 MAXIMAL CORRELATION EMBEDDING NETWORK FOR MULTILABEL LEARNING WITH MISS- ING LABELS

Lu Li, Yang Li, Xiangxiang Xu, Shao-Lun Huang, Lin Zhang

Tsinghua university, China

14:45 SELF-ADAPTION MULTI-CLASSIFIER FUSION NETWORKS FOR IMAGE RECOGNITION

Zengyuan Guo, Xinzhu Ma, Haojie Li, Zhihui Wang, Pengbo Zhang

Dalian University of Technology, China

77 IEEE ICME2019

Wednesday, July 10, 2019

O-17: AI for Human Understanding

Time: 15:30 - 16:30 PM

Room: 3CD

Chair: Bin Liu University of Science and Technology of China, China

15:30 VIDEO EMOTION RECOGNITION WITH CONCEPT SELECTION

Baohan Xu1, Yingbin Zheng2, Hao Ye2, Caili Wu3, Heng Wang1, Gufei Sun1

1Zhongan Technology, China, 2Videt Tech, USA, 3East China Normal University, China

15:45 GRAPH CONVOLUTIONAL LSTM MODEL FOR SKELETON-BASED ACTION RECOGNITION

Han Zhang, Yonghong Song, Yuanlin Zhang

Xi’an Jiaotong University, China

16:00 LEARNING RECURRENT STRUCTURE-GUIDED ATTENTION NETWORK FOR MULTI-PERSON POSE ESTIMATION

Zhongwei Qiu1, Kai Qiu2, Jianlong Fu2, Dongmei Fu1

1University of Science and Technology Beijing, China, 2Microsoft Reasearch, China

16:15 PCPCAD: PROPOSAL COMPLEMENTARY ACTION DETECTOR

Zhenying Fang1, Suguo Zhu1, Jun Yu1, Qi Tian2,3

1Hangzhou Dianzi University, China, 2Huawei Noah’s Ark Lab, China, 3The University of Texas at San Antonio, USA

78 Wednesday, July 10, 2019

O-18: Image Quality Metrics

Time: 15:30 - 16:30 PM Room: 3HI Chair: Patrick Le Callet Universite de Nantes, France

15:30 PERSONALITY DRIVEN MULTI-TASK LEARNING FOR IMAGE AESTHETIC ASSESSMENT

Leida Li1,2, Hancheng Zhu2, Sicheng Zhao3, Guiguang Ding4, Hongyan Jiang2, Allen Tan5

1Xidian University, China, 2China University of Mining and Technology, China, 3University of California Berkeley, USA, 4Tsinghua University, China, 5Tencent, China

15:45 VIDEO QUALITY TEMPORAL POOLING USING A VISIBILITY MEASURE

Chen Bai, Amy R. Reibman

Purdue University, USA 16:00 IMAGE QUALITY ASSESSMENT OF MULTI-EXPOSURE IMAGE FUSION FOR BOTH STATIC AND DYNAMIC SCENES

Yuming Fang1, Yan Zeng1, Hanwei Zhu1, Guangtao Zhai2

1Jiangxi University of Finance and Economics, China, 2Shanghai Jiao Tong University, China 16:15 NO-REFERENCE STEREOSCOPIC IMAGE QUALITY ASSESSMENT BASED ON LOCAL TO GLOBAL FEATURE REGRESSION

Sumei Li, Jianwei Xue, Yongtian Han

Tianjin University, China

79 IEEE ICME2019

Wednesday, July 10, 2019

O-19: Multimedia Recommendations

Time: 15:30 - 16:30 PM

Room: 5BC

Chair: Rui Wang Tongji University, China

15:30 HERDING EFFECT BASED ATTENTION FOR PERSONALIZED TIME-SYNC VIDEO RECOMMENDA- TION

Wenmian Yang1,2, Wenyuan Gao1, Xiaojie Zhou1, Weijia Jia1,2, Shaohua Zhang1,2, Yutao Luo1

1Shanghai JiaoTong University, China, 2University of Macau, China

15:45 SEQUENTIAL BEHAVIOR MODELING FOR NEXT MICRO-VIDEO RECOMMENDATION WITH COL- LABORATIVE TRANSFORMER

Shang Liu, Zhenzhong Chen

Wuhan University, China

16:00 BUTTONTIPS: DESIGNING WEB BUTTONS WITH SUGGESTIONS

Dawei Liu, Ying Cao, Rynson W.H. Lau, Antoni B. Chan

City University of Hongkong, China

16:15 KNOWING USER BETTER: MICRO-VIDEO RECOMMENDER SYSTEM BY JOINTLY OPTIMIZING TO CLICK-THROUGH AND PLAYTIME

Shengjie Ma, Zhengjun Zha, Feng Wu

University of Science and Technology of China, China

80 Wednesday, July 10, 2019

O-20: Search and Retrieval

Time: 15:30 - 16:30 PM

Room: 5DE

Chair: Jianquan Liu NEC Corporation, Japan

15:30 ADVERSARIAL CROSS-MODAL RETRIEVAL VIA LEARNING AND TRANSFERRING SINGLE-MOD- AL SIMILARITIES

Xin Wen1, Zhizhong Han1,2, Xinyu Yin1, Yu-Shen Liu1

1Tsinghua University, China, 2University of Maryland, USA

15:45 SEMI-SUPERVISED COMPATIBILITY LEARNING ACROSS CATEGORIES FOR CLOTHING MATCH- ING

Zekun Li, Zeyu Cui, Shu Wu, Xiaoyu Zhang, Liang Wang

Chinese Academy of Sciences, China

16:00 ADVERSARIAL LEARNING FOR FINE-GRAINED IMAGE SEARCH

Kevin Lin1, Fan Yang2, Qiaosong Wang2, Robinson Piramuthu2

1University of Washington, USA, 2eBay Inc, USA

16:15 A MASK BASED DEEP RANKING NEURAL NETWORK FOR PERSON RETRIEVAL

Lei Qi1, Jing Huo1, Lei Wang2, Yinghuan Shi1, Yang Gao1

1Nanjing University, China, 2University of Wollongong, Australia

81 IEEE ICME2019

Wednesday, July 10, 2019

O-21: Media Understanding

Time: 16:45 - 17:45 PM

Room: 3CD

Chair: Lingyu Duan Peking University, China

16:45 DISCO: DEPTH INFERENCE FROM STEREO USING CONTEXT

Kunal Swami, Kaushik Raghavan, Nikhilanj Pelluriy, Rituparna Sarkar, Pankaj Bajpai

Samsung Research Institute Bangalore, India

17:00 PANET: A CONTEXT BASED PREDICATE ASSOCIATION NETWORK FOR SCENE GRAPH GENERA- TION

Yunian Chen1, Yanjie Wang3, Yang Zhang1, Yanwen Guo1,2

1Nanjing University, China, 2The 28th Research Institute of China Electronics Technology Group Corporation, Chi- na, 3Zhejiang University, China

17:15 UNTARGETED ADVERSARIAL ATTACK VIA EXPANDING THE SEMANTIC GAP

Aming Wu1, Yahong Han1, Quanxin Zhang2, Xiaohui Kuang3

1Tianjin University, China, 2Beijing Institute of Technology, China ,3National Key Laboratory of Science and Technol- ogy on Information System Security, China

17:30 LEARNING GOAL-ORIENTED VISUAL DIALOG AGENTS: IMITATING AND SURPASSING ANALYTIC EXPERTS

Yen-Wei Chang, Wen-Hsiao Peng

National Chiao Tung University, Taiwan

82 Wednesday, July 10, 2019

O-22: Super-resolution and Enhancement

Time: 16:45 - 17:45 PM

Room: 3HI

Chair: Ge Li Peking University, China

16:45 GAN-BASED MULTI-LEVEL MAPPING NETWORK FOR SATELLITE IMAGERY SUPER-RESOLU- TION

Kui Jiang, Zhongyuan Wang, Peng Yi, Junjun Jiang, Guangcheng Wang, Zhen Han, Tao Lu

Wuhan University, China

17:00 QUALITY-GATED CONVOLUTIONAL LSTM FOR ENHANCING COMPRESSED VIDEO

Ren Yang3, Xiaoyan Sun1, Mai Xu2 and Wenjun Zeng1

1Microsoft Research, USA, 2Beihang University, China, 3ETH Zürich, Switzerland

17:15 COMPOUNDED LAYER-PRIOR UNROLLING: A UNIFIED TRANSMISSION-BASED IMAGE EN- HANCEMENT FRAMEWORK

Risheng Liu, Minjun Hou, Jinyuan Liu, Xin Fan, Zhongxuan Luo

Dalian University of Technology, China

17:30 DEEP PYRAMID VARIATION LEARNING FOR IMAGE INTERPOLATION

Fu Qiang, Wenhan Yang, Ying Li, and Jiaying Liu

Peking University, China

83 IEEE ICME2019

Wednesday, July 10, 2019

O-23: Pose and Action Recognition II

Time: 16:45 - 17:45 PM

Room: 5BC

Chair: Sheng Tang Institute of Computing Technology, Chinese Academy of Sciences, China

16:45 CLOTHES KEYPOINTS LOCALIZATION AND ATTRIBUTE RECOGNITION VIA PRIOR KNOWL- EDGE

Zhangxuan Gu, Jianfu Zhang, Ziqi Pan, Haohua Zhao, Liqing Zhang

Shanghai Jiao Tong University, China

17:00 SPATIO-TEMPORAL MULTI-FACTOR DISCRIMINANT ANALYSIS FOR INDIVIDUAL IDENTIFICA- TION

Yong Su, Zhiyong Feng

Tianjin University, China

17:15 CHANNEL-WISE TEMPORAL ATTENTION NETWORK FOR VIDEO ACTION RECOGNITION

Jianjun Lei1, Yalong Jia1, Bo Peng1, Qingming Huang2

1Tianjin University, China, 2University of Chinese Academy of Sciences, China

17:30 LOCALIZATION GUIDED FIGHT ACTION DETECTION IN SURVEILLANCE VIDEOS

Qichao Xu1, John See2, Weiyao Lin1

1Shanghai Jiao Tong University, China, 2Multimedia University, Malaysia

84 Wednesday, July 10, 2019

O-24: Image and Video Enhancements I

Time: 16:45 - 17:45 PM

Room: 5DE

Chair: Miaohui Wang Shenzhen University, China

16:45 RECURSIVE MULTI-STAGE UPSCALING NETWORK WITH DISCRIMINATIVE FUSION FOR SU- PER-RESOLUTION

Yue Lu1, Zhuqing Jiang1, Guodong Ju2, Liangheng Shen2, Aidong Men1

1Beijing University of Posts and Telecommunications, China, 2GuangDong TUS-TuWei Technology Co., Ltd, China

17:00 IMPROVING IMAGE SUPER-RESOLUTION VIA FEATURE RE-BALANCING FUSION

Yuanfei Huang, Jie Li, Xinbo Gao, Wen Lu, Yanting Hu

Xidian University, China

17:15 DIFFICULTY-AWARE IMAGE SUPER RESOLUTION VIA DEEP ADAPTIVE DUAL-NETWORK

Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen

Sun Yat-sen University, China

17:30 DENSE-CONNECTED RESIDUAL NETWORK FOR VIDEO SUPER-RESOLUTION

Xiaoting Du, Yuan Zhou, Yanfang Chen, Yeda Zhang, Jianxing Yang and Dou Jin

Tianjin University, China

85 IEEE ICME2019

Thursday, July 11, 2019

O-25: Face and Person Analysis

Time: 14:00 - 15:00 PM

Room: 3CD

Chair: Hailin Shi JD AI Research, China

14:00 DYNAMIC CASCADED REGRESSION NETWORK WITH REINFORCEMENT LEARNING FOR RO- BUST FACE ALIGNMENT

Zhihao Zhang, Liansheng Zhuang, Wengang Zhou, Houqiang Li

University of Science and Technology of China, China

14:15 DEEP LEARNING FACE HALLUCINATION VIA ATTRIBUTES TRANSFER AND ENHANCEMENT

Mengyan Li, Yuechuan Sun, Zhaoyu Zhang, Haonian Xie and Jun Yu

University of Science and Technology of China, China

14:30 EMOTION RECOGNITION FROM PHYSIOLOGICAL SIGNALS USING MULTI-HYPERGRAPH NEU- RAL NETWORKS

Junjie Zhu1, Xibin Zhao1, Han Hu2, Yue Gao1

1Tsinghua University, China, 2Beijing Institute of Technology, China

14:45 GPS: GROUP PEOPLE SEGMENTATION WITH DETAILED PART INFERENCE

Yue Liao1, Tianrui Hui1, Chen Gao1, Si Liu2, Yao Sun3, Hefei Ling4, Bo Li2

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2Beihang University, China, 3iie, China, 4Huazhong University of Science and Technology, China

86 Thursday, July 11, 2019

O-26: Media Classification and Segmentation III

Time: 14:00 - 15:00 PM

Room: 3HI

Chair: Chenggang Yan Hangzhou Dianzi University, China

14:00 MULTI-LABEL IMAGE RECOGNITION WITH JOINT CLASS-AWARE MAP DISENTANGLING AND LABEL CORRELATION EMBEDDING

Zhao-Min Chen1,2, Xiu-Shen Wei2, Xin Jin2, Yanwen Guo1,3

1Nanjing University, China, 2Megvii Technology, China, 3Science and Technology on Information Systems Engineer- ing Laboraty, China

14:15 REAL TIME COMPRESSED VIDEO OBJECT SEGMENTATION

Zhentao Tan, Bin Liu, Weihai Li, Nenghai Yu

University of Science and Technology of China, China

14:30 ACCURATE AND FAST FINE-GRAINED IMAGE CLASSIFICATION VIA DISCRIMINATIVE LEARN- ING

Zhihui Wang1, Shijie Wang1, Pengbo Zhang1, Haojie Li1, Bo Liu2

1Dalian University of Technology, China, 2Shanghai Jiao Tong University, China

14:45 POSE2BODY: POSE-GUIDED HUMAN PARTS SEGMENTATION

Zhong Li1, Xin Chen2, Wangyiteng Zhou2, Yingliang Zhang2, Jingyi Yu2

1University of Delaware, USA, 2ShanghaiTech University, China

87 IEEE ICME2019

Thursday, July 11, 2019

O-27: Image and Video Enhancements II

Time: 14:00-15:00 PM

Room: 5BC

Chair: Ce Zhu University of Electronic Science & Technology of China, China

14:00 RESIDUAL MAGNIFIER: A DENSE INFORMATION FLOW NETWORK FOR SUPER RESOLUTION

Zhan Shu1, Mengcheng Cheng1, Biao Yang1, Zhuo Su1, Xiangjian He2,3

1Sun Yat-sen University, China, 2Minjiang University, China, 3University of Technology Sydney, Australia

14:15 EVERYONE IS A CARTOONIST: SELFIE CARTOONIZATION WITH ATTENTIVE ADVERSARIAL NETWORKS

Xinyu Li, Wei Zhang, Tong Shen, Tao Mei

JD AI Research, China

14:30 SCALE-AWARE DEEP NETWORK WITH HOLE CONVOLUTION FOR BLIND MOTION DEBLURRING

Jichun Li, Ke Li, Bo Yan

Fudan University, China

14:45 REMOVING RAIN IN VIDEOS: A LARGE-SCALE DATABASE AND A TWO-STREAM CONVLSTM AP- PROACH

Tie Liu, Mai Xu and Zulin Wang

Beihang University, China

88 Thursday, July 11, 2019

O-28: Multimedia Learning and Adaptation

Time: 14:00 - 15:00 PM

Room: 5DE

Chair: Song Li Shanghai Jiao Tong University, China

14:00 TOWARDS QOS-AWARE CLOUD LIVE TRANSCODING: A DEEP REINFORCEMENT LEARNING AP- PROACH

Zhengyuan Pang, Lifeng Sun, Tianchi Huang, Zhi Wang, Shiqiang Yang

Tsinghua University, China

14:15 HIGH SPEED RECURRENT REGRESSION NETWORK FOR VISUAL TRACKING

Ding Ma, Xiangqian Wu

Harbin Institute of Technology, China

14:30 PAAE: A UNIFIED FRAMEWORK FOR PREDICTING ANCHOR LINKS WITH ADVERSARIAL EM- BEDDING

Yanmin Shang1, Zhezhou Kang1, Yanan Cao1, Dongjie Zhang1, Yangxi Li2, Yang Li3, Yanbing Liu1

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2National Computer network Emergency Response technical Team,China, 3State Information Center, China

14:45 MANIFOLD ALIGNMENT AND DISTRIBUTION ADAPTATION FOR UNSUPERVISED DOMAIN ADAP- TATION

Ying Li, Lin Cheng, Yaxin Peng, Zhijie Wen, Shihui Ying

Shanghai University, China

89 IEEE ICME2019

Thursday, July 11, 2019

O-29: Person (Re-)Identification and People Detection

Time: 15:30 - 16:30 PM

Room: 3CD

Chair: Bingpeng Ma University of Science and Technology of China, China

15:30 PEDESTRIAN RE-IDENTIFICATION BASED ON TREE BRANCH NETWORK WITH LOCAL AND GLOBAL LEARNING

Hui Li1, Meng Yang24, Zhihui Lai1, Weishi Zheng2, Zitong Yu3

1Shenzhen University, China, 2Sun Yat-sen University, China, 3University of Oulu, Finland, 4Key Laboratory of Ma- chine Intelligence and Advanced Computing(SYSU), Ministry of Education, China

15:45 ADVERSARIAL BINARY CODING FOR EFFICIENT PERSON RE-IDENTIFICATION

Zheng Liu1, Jie Qin2, Annan Li1, Yunhong Wang1, and Luc Van Gool3

1Beihang University, China, 2Inception Institute of Artificial Intelligence, UAE, 3Computer Vision Laboratory, ETH Zurich, Switzerland

16:00 PERSON RE-IDENTIFICATION WITH GRADUAL BACKGROUND SUPPRESSION

Yingzhi Tang, Xi Yang, Nannan Wang, Xinrui Jiang, Bin Song, Xinbo Gao

Xidian University, China

16:15 MULTI-BRANCH CONTEXT-AWARE NETWORK FOR PERSON RE-IDENTIFICATION

Yingxin Zhu1, Xiaoqiang Guo2, Jianlei Liu1, Zhuqing Jiang1

1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, Beijing, China

90 Thursday, July 11, 2019

O-30: Multimedia and Language II

Time: 15:30 - 16:30 PM

Room: 3HI

Chair: Annan Li Beijing University of Aeronautics and Astronautics, China

15:30 POST-PROCESSING OF WORD REPRESENTATIONS VIA VARIANCE NORMALIZATION AND DY- NAMIC EMBEDDING

Bin Wang1, Fenxiao Chen1, Angela Wang2 and C.-C. Jay Kuo1

1University of Southern California, USA, 2University of California, Berkeley, USA

15:45 MULTI-MODAL LANGUAGE ANALYSIS WITH HIERARCHICAL INTERACTION-LEVEL AND SELEC- TION-LEVEL ATTENTION

Dong Zhang, Liangqing Wu, Shoushan Li, Qiaoming Zhu, Guodong Zhou

Soochow University, China

16:00 MODELING THE CLAUSE-LEVEL STRUCTURE TO MULTIMODAL SENTIMENT ANALYSIS VIA REINFORCEMENT LEARNING

Dong Zhang, Shoushan Li, Qiaoming Zhu, Guodong Zhou

Soochow University, China

16:15 TWICE OPPORTUNITY KNOCKS SYNTACTIC AMBIGUITY: A VISUAL QUESTION ANSWERING MODEL WITH YES/NO FEEDBACK

Jianming Wang, Wei Deng, Yukuan Sun, Yuanyuan Li, Kai Wang, Guanghao Jin

Tianjin Polytechnic University, China

91 IEEE ICME2019

Thursday, July 11, 2019

O-31: Multimedia Communications and Localization

Time: 15:30 - 16:30 PM

Room: 5BC

Chair: Sanjeev Mehrotra Microsoft, USA

15:30 GEOCAPSNET: GROUND TO AERIAL VIEW IMAGE GEO-LOCALIZATION USING CAPSULE NET- WORK

Bin Sun1, Chen Chen2, Yingying Zhu1, Jianmin Jiang1

1Shenzhen University, China, 2University of North Carolina at Charlotte, USA

15:45 IMPROVING ROBUSTNESS OF DASH AGAINST NETWORK UNCERTAINTY

Bo Wang1,2, Fengyuan Ren1,2

1Beijing National Research Center for Information Science and Technology, China, 2Tsinghua University, China

16:00 HYBRID CONTROL-BASED ABR: TOWARDS LOW-DELAY LIVE STREAMING

Bo Wang1,2, Fengyuan Ren1,2, Chao Zhou3

1Beijing National Research Center for Information Science and Technology, China, 2Tsinghua University, China, 3Beijing Kuaishou Technology Co., Ltd, China

16:15 TAXI ORIGIN-DESTINATION DEMAND PREDICTION WITH CONTEXTUALIZED SPATIAL-TEMPO- RAL NETWORK

Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin

Sun Yat-sen University, China

92 Thursday, July 11, 2019

O-32: Multimedia Security, Privacy and Forensics II

Time: 15:30 - 16:30 PM

Room: 5DE

Chair: Wen Ji Institute of Computing Technology, Chinese Academy of Sciences, China

15:30 FAST IMAGE CLUSTERING BASED ON CAMERA FINGERPRINT ORDERING

Sahib Khan, Tiziano Bianchi

Politecnico di Torino, Italy

15:45 ENFORCING ACCESS CONTROL IN DISTRIBUTED VERSION CONTROL SYSTEMS

Xin Xu1,2, Quanwei Cai1,2, Jingqiang Lin1,2, Shiran Pan1,2, Liangqin Ren1,2

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sci- ences, China

16:00 ATTRIBUTE-BASED ACCOUNTABLE ACCESS CONTROL FOR MULTIMEDIA CONTENT WITH IN-NETWORK CACHING

Peixuan He1, Kaiping Xue1, Jie Xu1, Qiudong Xia1, Jianqing Liu2, Hao Yue3

1University of Science and Technology of China, China, 2University of Alabama in Huntsville, USA, 3San Francisco State University, USA

16:15 PRACTICAL IMAGE OBFUSCATION WITH PROVABLE PRIVACY

Liyue Fan

University at Albany, State University of New York, USA

93 IEEE ICME2019

Thursday, July 11, 2019

O-33: Multimedia Sensing and Signal Processing

Time: 16:45 - 17:45 PM

Room: 3HI

Chair: Zhi Jin Sun Yat-sen University, China

16:45 JOINTLY SOLVING DEBLURRING AND SUPER-RESOLUTION PROBLEMS WITH DUAL SUPER- VISED NETWORK

Zhenwen Liang, Dongyang Zhang, Jie Shao

University of Electronic Science and Technology of China, China

17:00 TWO-STAGED ACOUSTIC MODELING ADAPTION FOR ROBUST SPEECH RECOGNITION BY THE EXAMPLE OF GERMAN ORAL HISTORY INTERVIEWS

Michael Gref1,2, Christoph Schmidt1, Sven Behnke1,3, Joachim Köhler1

1Fraunhofer Institute for Intelligent Analysis and Information Systems, Germany, 2Niederrhein University of Applied Sciences, Germany, 3University of Bonn, Germany

17:15 AN ADAPTIVE AFFINITY GRAPH WITH SUBSPACE PURSUIT FOR NATURAL IMAGE SEGMENTA- TION

Yang Zhang1, Huiming Zhang1, Yanwen Guo1, Kai Lin2, Jingwu He1

1Nanjing University, China, 2Hubei University of Technology, China

17:30 PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIR- CULAR MICROPHONE ARRAY

Li He, Yi Zhou, Hongqing Liu

Chongqing University of Posts and Telecommunications, China

94 Thursday, July 11, 2019

O-34: Detection and Recognition

Time: 16:45 - 17:45 PM

Room: 5BC

Chair: Lifang Wu Beijing University of Technology, China

16:45 LOCALITY-CONSTRAINED SPATIAL TRANSFORMER NETWORK FOR VIDEO CROWD COUNTING

Yanyan Fang1, Biyun Zhan1, Wandi Cai1, Shenghua Gao2, Bo Hu1

1Fudan University, China, 2ShanghaiTech University, China

17:00 SPATIAL-AWARE NON-LOCAL ATTENTION FOR FASHION LANDMARK DETECTION

Yixin Li1, Shengqin Tang2, Yun Ye3, Jinwen Ma1

1Peking University, China, 2Xi’an Jiaotong University, China, 3JD AI Research, China

17:15 RELATIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION

Wu Zheng1,2, Lin Li1,2, Zhaoxiang Zhang1,2, Yan Huang1,2, Liang Wang1,2

1Institute of Automation, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China

17:30 MULTI-VIEW LEARNING FOR VEHICLE RE-IDENTIFICATION

Weipeng Lin1, Yidong Li1, Xiaoliang Yang1, Peixi Peng2, Junliang Xing2

1Beijing Jiaotong University, China, 2Institute of Automation, Chinese Academy of Sciences, China

95 IEEE ICME2019

Thursday, July 11, 2019

O-35: Multi-modal Media Computing and Human-machine Interaction

Time: 16:45 - 17:45 PM

Room: 5DE

Chair: Sanghoon Lee Yonsei University, Korea

16:45 MANY COULD BE BETTER THAN ALL: A NOVEL INSTANCE-ORIENTED ALGORITHMFOR MULTI-MODAL MULTI-LABEL PROBLEM

Yi Zhang, Cheng Zeng, Hao Cheng, Chongjun Wang, Lei Zhang

Nanjing University, China

17:00 AFFECTIVE VIDEO CONTENT ANALYSES BY USING CROSS-MODAL EMBEDDING LEARNING FEA- TURES

Benchao Li1,3, Zhenzhong Chen2, Shan Li3, WeiShi Zheng1,4

1Sun Yat-Sen University, China, 2Wuhan University, China, 3Tencent, America, 4Key Laboratory of Machine Intelli- gence and Advanced Computing, Ministry of Education, China

17:15 LEARNING A 3D GAZE ESTIMATOR WITH IMPROVED ITRACKER COMBINED WITH BIDIREC- TIONAL LSTM

Xiaolong Zhou, Jianing Lin, Jiaqi Jiang, Shengyong Chen

Zhejiang University of Technology, China

17:30 DETECTION OF OCCLUDED ROAD SIGNS ON AUTONOMOUS DRIVING VEHICLES

Jingda Guo, Xianwei Cheng, Qi Chen, Qing Yang

University of North Texas, USA

96 Industry Track

Wednesday, July 10, 2019

Time: 14:00 - 15:00 PM

Room: 3B

Chair: Guanbin Li Sun Yat-sen University, China

LOCALIZING ADVERTS IN OUTDOOR SCENES

Soumyabrata Dev

The ADAPT SFI Research Centre, Ireland

HIERARCHICAL RECURSIVE NETWORK FOR SINGLE IMAGE SUPER RESOLUTION

Minglan Su1, Shenqi Lai2, Zhenhua Chai2, Xiaoming Wei2, Yong Liu1

1University of Posts and Telecommunications, China, 2Meituandianping Group, China

PARALLEL VOLUME RENDERING METHOD FOR OUT-OF-CORE NON-UNIFORMLY PARTITIONED DATASETS

Jian Xue1, Xiaoye Zhu1, Ke Lu1, Yutong Kou2

1University of Chinese Academy of Sciences, China, 2Huazhong University of Science & Technology, China

VEHICLE RE-IDENTIFICATION WITH REFINED PART MODEL

Xingan Ma1, Kuan Zhu1, Haiyun Guo2, Jinqiao Wang1, Min Huang1, Qinghai Miao1

1University of Chinese Academy of Sciences, China, 2Institute of Automation, Chinese Academy of Sciences, China

97 IEEE ICME2019

Poster Sessions Poster Session 1 & TMM Poster

Tuesday, July 9, 2019

P-01: Emerging Multimedia Applications and Technologies

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Chun Yuan Tsinghua University, China

[ID:1] PAY BY SHOWING YOUR PALM: A STUDY OF PALMPRINT VERIFICATION ON MOBILE PLATFORMS

Yingyi Zhang1, Lin Zhang1, Xiao Liu1, Shengjie Zhao1, Ying Shen1, Yukai Yang2

1Tongji University, China, 2Uppsala University, Sweden

[ID:2] REGULARIZE NETWORK SKIP CONNECTIONS BY GATING MECHANISMS FOR ELECTRON MICROSCO- PY IMAGE SEGMENTATION

Yuze Guo, Wenjing Huang, Yajing Chen, Shikui Tu

Shanghai Jiao Tong University, China

[ID:3] CROSS MODALITY ALIGNMENT OF MEDICAL VOLUMES USING SPATIO-SEMANTIC ATTENTIVE CY- CLE-GAN

Xiaohui Lin1, Yi Xu1, Mingda Wang1, Bingbing Ni1, Xiaokang Yang1, Guangyu Tao2, Xiaodan Ye2

1Shanghai Jiao Tong University, China, 2Shanghai Chest Hospital, China

[ID:4] A NEW APPROACH TO AUTOMATIC CLOTHING MATTING FROM MANNEQUINS

Bin Yuan1, Zongqing Lu1, Jing-Hao Xue2, Qingmin Liao1

1Tsinghua University Graduate School at Shenzhen, China, 2University College London, UK

[ID:5] CLUSTERING AND DYNAMIC SAMPLING BASED UNSUPERVISED DOMAIN ADAPTATION FOR PERSON RE-IDENTIFICATION

Jinlin Wu1,2, Shengcai, Liao3, Zhen Lei1,2, Xiaobo Wang4, Yang Yang1,2, Stan Z. Li1,2

1Institute of Automation Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3Inception Institute of Artificial Intelligence, China,4 JD AI Research, China

[ID:6] SEMANTIC-EMBEDDING AND SHAPE-AWARE U-NET FOR ULTRASOUND EYEBALL SEGMENTATION

Fanchao Lin, Chuanbin Liu, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang

98 University of Science and Technology of China, China

[ID:7] LECTURE2NOTE: AUTOMATIC GENERATION OF LECTURE NOTES FROM SLIDE-BASED EDUCATIONAL VIDEOS

Chengpei Xu1, Ruomei Wang1, Shujin Lin1, Xiaonan Luo2, Baoquan Zhao2, Lijie Shao1, Mengqiu Hu1

1Sun Yat-sen University, China, 2Guilin University of Electronic Technology, China

[ID:8] DATA-ADAPTIVE PACKING METHOD FOR COMPRESSION OF DYNAMIC POINT CLOUD SEQUENCES

Jianqiang Liu, Jian Yao, Jingmin Tu, Junhao Cheng

Wuhan University, China

[ID:9] SEMANTIC GAN: APPLICATION FOR CROSS-DOMAIN IMAGE STYLE TRANSFER

Pengfei Li, Meng Yang

Sun Yat-sen University, China

[ID:10] IMPROVING EXTREME LOW-LIGHT IMAGE DENOISING VIA RESIDUAL LEARNING

Paras Maharjan1, Li Li1, Zhu Li1, Ning Xu2, Chongyang Ma3, Yue Li4

1University of Missouri-Kansas City, USA, 2Amazon Go, USA, 3Kwai Inc., China, 4University of Science and Technology of China, China

[ID:11] USER PROFILING WITH CAMPUS WI-FI ACCESS TRACE AND NETWORK TRAFFIC

Yang Gao, Jun Tao, Li Zeng, Xiaoming Fang, Qian Fang, Xiaoyan Li

Southeast University, China

[ID:12] A NEW VISUAL INTERFACE FOR SEARCHING AND NAVIGATING SLIDE-BASED LECTURE VIDEOS

Baoquan Zhao1, Songhua Xu2, Shujin Lin3, Ruomei Wang3 and Xiaonan Luo1

1Guilin University of Electronic Technology, China, 2University of South Carolina, Columbia, USA, 3Sun Yat-sen University, China

99 IEEE ICME2019

Tuesday, July 9, 2019

P-02: Media Classification and Segmentation I

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Toshihiko Yamasaki University of Tokyo, Japan

[ID:13] SELF-ATTENTIVE NETWORKS FOR ONE-SHOT IMAGE RECOGNITION

Pin Fang1, Yisen Wang2, Yuan Luo1

1Shanghai Jiao Tong University, China, 2JD AI Research, China

[ID:14] TREE-STRUCTURED KRONECKER CONVOLUTIONAL NETWORK FOR SEMANTIC SEGMENTATION

Tianyi Wu1,2, Sheng Tang1, Rui Zhang1,2, Juan Cao1, Jintao Li1

1Institute of Computing Technology, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China

[ID:15] PART-BASED CONVOLUTIONAL NETWORK FOR IMBALANCED AGE ESTIMATION

Yixin Zhu, Jun-Yong Zhu, Wei-Shi Zheng

Sun Yat-sen University, China

[ID:16] LEARNING TO DISTINGUISH: A GENERAL METHOD TO IMPROVE COMPARE-BASED ONE-SHOT LEARNING FRAMEWORKS FOR SIMILAR CLASSES

Qiuzheng Chen, Ruoyu Yang

Nanjing University, China

[ID:17] PREDICTABILITY ANALYZING: DEEP REINFORCEMENT LEARNING FOR EARLY ACTION RECOGNI- TION

Xiaokai Chen1,2, Ke Gao1, Juan Cao1

1Institute of Computing Technology, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China

[ID:18] A FAST END-TO-END METHOD WITH STYLE TRANSFER FOR ROOM LAYOUT ESTIMATION

Junming Chen, Jie Shao, Dongyang Zhang, Xuehui Wu

University of Electronic Science and Technology of China, China

[ID:19] RESOLVING INTRA-CLASS IMBALANCE FOR GAN-BASED IMAGE AUGMENTATION

Lijyun Huang, Kate Ching-Ju Lin, Yu-Chee Tseng

100 National Chiao Tung University, Taiwan

[ID:20] END-TO-END PANOPTIC SEGMENTATION WITH PIXEL-LEVEL NON-OVERLAPPING EMBEDDING

Weitong Zhang1,2,4, Qieshi Zhang1,2, Jun Cheng1,2, Cong Bai3, Pengyi Hao3

1Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China, 2The Chinese University of Hong Kong, China, 3Zhejiang University of Technology, China 4Shaanxi Normal University, China

[ID:21] ROBUST EMBEDDING FRAMEWORK WITH DYNAMIC HYPERGRAPH FUSION FOR MULTI-LABEL CLASSIFICATION

Kaixiang Wang

Nanjing Normal University, China

101 IEEE ICME2019

Tuesday, July 9, 2019

P-03: Oral-05 to Oral-12

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Jian Zhang University of Technology Sydney, Australia

[ID:22] PARTICLE SWARM LOSS FOR LIGHTWEIGHT OBJECT DETECTION

Peizhen Zhang1,2, Feng Zheng3, Junlong Du2, Jun Zhang2, Xiaowei Guo2, Wei-Shi Zheng1

1Sun Yat-sen University, China, 2Youtu Lab, Tencent, China, 3Southern University of Science and Technology, China

[ID:23] INCORPORATING CATEGORY TAXONOMY IN DEEP REINFORCEMENT LEARNING BASED IMAGE HASHING

Qiang Fu1, Linsen Dong2, Ziyuan Liu2, Yong Luo2, Yonggang Wen2, Ying Li1, Ling-Yu Duan3

1Peking University, China, 2Nanyang Technological University, Singapore, 3Peking University, China

[ID:24] TRUNCATED GRADIENT CONFIDENCE-WEIGHTED BASED ONLINE LEARNING FOR IMBALANCE STREAMING DATA

Ji Hu1, Chenggang Yan1, Xing Liu1, Jiyong Zhang1, Dongliang Peng1, Yi Yang2

1HangZhou DianZi University, China, 2UTS, Australia

[ID:25] UAV TARGET TRACKING BY DETECTION VIA DEEP NEURAL NETWORKS

Mohamed A. Kassab1, Ali Maher2, Fathy Elkazzaz3, Zhang Baochang1,4

1Beihang University, China, 2Military Technical college, Egypt, 3 Benha University, Egypt, 4Shenzhen Academy of Aerospace Technology, China

[ID:26] QUARTER-POINT CODEWORD EXPANSION FOR PRODUCT QUANTIZATION

Shan An1,2, Zhibiao Huang1, Guangfu Che1, Xianglong Liu2, Xin Ma3, Yu Chen1

1Department of Data Intelligence, JD.com, China, 2Beihang University, China, 3Shandong University, China

[ID:27] CONTEXT-AWARE AFFECTIVE GRAPH REASONING FOR EMOTION RECOGNITION

Minghui Zhang, Yumeng Liang, Huadong Ma

Beijing University of Posts and Telecommunications, China

[ID:28] SPL: EXPLOITING UNLABELED DATA FOR MULTI-LABEL IMAGE CLASSIFICATION

102 Weibo Zhang1,2, Fuqing Zhu1, Jiao Dai1, Songlin Hu1, Jizhong Han1, Tao Guo1

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2School of Cyber Security, University of Chinese Academy of Sciences, China

[ID:29] MLTS: A MULTI-LANGUAGE SCENE TEXT SPOTTER

Yu Zhou1, Shancheng Fang2, Hongtao Xie1, Zheng-Jun Zha1, Yongdong Zhang1

1University of Science and Technology of China, China, 2Institute of Information Engineering, Chinese Academy of Sciences, China

[ID:30] UNSUPERVISED MONOCULAR DEPTH ESTIMATION BASED ON DUAL ATTENTION MECHANISM AND DEPTH-AWARE LOSS

Xinchen Ye1, Mingliang Zhang1,2, Rui Xu1, Wei Zhong1, Xin Fan1, Zhu Liu1, Jiaao Zhang1

1Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, China, 2Dalian University of Technolo- gy of Liaoning Province, China

[ID:31] TOWARDS HIGH-QUALITY INTRINSIC IMAGES IN THE WILD

Gang Fu1, Qing Zhang2, Chunxia Xiao1

1Wuhan University, China, 2Sun Yat-sen University, China

[ID:32] UNSUPERVISED LEARNING FOR OPTICAL FLOW ESTIMATION USING PYRAMID CONVOLUTION LSTM

Shuosen Guan1,3, Haoxin Li2,3, Wei-Shi Zheng1,3

1School of Data and Computer Science, Sun Yat-sen University, China, 2School of Electronics and Information Technology, Sun Yat-sen University, China, 3Key Laboratory of Machine Intelligence and Advanced Computing, Ministry of Education, China

[ID:33] MAST: MASK-ACCELERATED SHEARLET TRANSFORM FOR DENSELY-SAMPLED LIGHT FIELD RECONSTRUCTION

Yuan Gao1, Robert Bregovic2, Atanas Gotchev2, Reinhard Koch1

1Kiel University, Germany, 2Tampere University, Finland

[ID:34] CODA: COUNTING OBJECTS VIA SCALE-AWARE ADVERSARIAL DENSITY ADAPTION

Li Wang1, Yongbo Li2, Xiangyang Xue1

1Fudan University, China, 2Megvii Inc (Face++), China

[ID:35] PDNET: PRIOR-MODEL GUIDED DEPTH-ENHANCED NETWORK FOR SALIENT OBJECT

Chunbiao Zhu, Xing Cai, Kan Huang, Thomas H Li, Ge Li

SECE, Shenzhen Graduate School, Peking University, China

103 IEEE ICME2019

[ID:36] CONTINUOUS SCALE ADAPTION FOR EFFICIENT BOX-BASED SCENE TEXT

Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo, Wei Zhong

Dalian University of Technology, China

[ID:37] MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETECTION NETWORK

Xiaobao Guo1, Jinxing Li2, Bingzhi Chen1, Guangming Lu1

1Harbin Institute of Technology (Shenzhen), China, 2The Chinese University of Hong Kong(Shenzhen), China

[ID:38] DMPR-PS: A NOVEL APPROACH FOR PARKING-SLOT DETECTION USING DIRECTIONAL MARK- ING-POINT REGRESSION

Junhao Huang1, Lin Zhang1, Ying Shen1, Huijuan Zhang1, Shengjie Zhao1, Yukai Yang2

1Tongji University, China, 2Uppsala University, Sweden

[ID:39] ADAPTING SEMANTIC SEGMENTATION OF URBAN SCENES VIA MASK-AWARE GATED DISCRIM- INATOR

Yong-Xiang Lin1, Daniel Stanley Tan1, Wen-Huang Cheng2, Kai-Lung Hua1

1National Taiwan University of Science and Technology, Taiwan, 2National Chiao Tung University, Taiwan

[ID:40] STOCHASTIC VIDEO GENERATION WITH DISENTANGLED REPRESENTATIONS

Maomao Li1, Chun Yuan1, Zhihui Lin1,2, Zhuobin Zheng1,2, Yangyang Cheng1,2

1Graduate School at Shenzhen, Tsinghua University, China, 2Tsinghua University, China

[ID:41] Z-ORDER RECURRENT NEURAL NETWORKS FOR VIDEO PREDICTION

Jianjin Zhang, Yunbo Wang, Mingsheng Long, Jianmin Wang, and Philip S. Yu

Tsinghua University, China

[ID:42] ENERGY-BASED RECURRENT MODEL FOR STOCHASTIC MODELING OF MUSIC

Yingru Liu1, Dongliang Xie2, Xin Wang1

1Stony Brook University, USA, 2Beijing University of Posts and Telecommunications

[ID:43] RESIDUAL FRAME FOR NOISY VIDEO CLASSIFICATION ACCORDING TO PERCEPTUAL QUALITY IN CONVOLUTIONAL NEURAL NETWORKS

Huaixuan Zhang1, Yuhai Lan3, Tao Dai1,2, Ruizhi Qiao4, Ying Xu1, Yao Yao1, Shu-Tao Xia1,2

1Graduate School at Shenzhen, Tsinghua University, China, 2PCL Research Center of Networks and Communications, Peng Cheng Laboratory, China, 3Harbin Institute of Technology, China, 4Tencent Youtu Lab, China

104 [ID:44] RESIDUAL DILATED NETWORK WITH ATTENTION FOR IMAGE BLIND DENOISING

Guanqun Hou1, Yujiu Yang1, Jing-Hao Xue2

1Graduate School at Shenzhen, Tsinghua University, China, 2University College London, UK

[ID:45] COLLABORATIVE DEEP REINFORCEMENT LEARNING FOR IMAGE CROPPING

Zhuopeng Li, Xiaoyan Zhang

Shenzhen University, China

[ID:46] SIMILARITY-AWARE DEEP ADVERSARIAL LEARNING FOR FACIAL AGE ESTIMATION

Penghui Sun, Hao Liu, Xing Wang, Zhenhua Yu1, Suping Wu

Ningxia University, China

[ID:47] LEARNING TRANSMISSION FILTERING NETWORK FOR IMAGE-BASED PM2.5 ESTIMATION

Yinghong Liao1, Bin Qiu1, Zhuo Su1, Ruomei Wang1, Xiangjian He2,3

1Sun Yat-sen University, China, 2Minjiang University, China, 3University of Technology Sydney, Australia

[ID:48] VIDEO-BASED EARLY ASD DETECTION VIA TEMPORAL PYRAMID NETWORKS

Yuan Tian, Xiongkou Min, Guangtao Zhai, Zhiyong Gao

Shanghai Jiao Tong Unversity, China

[ID:49] AUTOMATIC USER CATEGORIZATION THROUGH LARGE TRANSACTION DATA

Ying Zhang, YinJia Zhang, Qinpei Zhao, Weixiong Rao

Tongji University, China

[ID:50] FEATURE PRESERVING AND UNIFORMITY-CONTROLLABLE POINT CLOUD SIMPLIFICATION ON GRAPH

Junkun Qi, WeiHu, Zongming Guo

Peking University, China

[ID:51] 360SRL: A SEQUENTIAL REINFORCEMENT LEARNING APPROACH FOR ABR TILE-BASED 360 VIDEO STREAMING

Jun Fu, Xiaoming Chen, Zhizheng Zhang, Shilin Wu, Zhibo Chen

University of Science and Technology of China, China

105 IEEE ICME2019

[ID:52] CONTENT-AWARE PERSPECTIVE PROJECTI ON OPTIMIZATION FOR VIEWPORT RENDERING OF 360° IMAGES

Falah Jabar, Joao Ascenso, Maria Paula Queluz

Universidade de Lisboa, Portugal

[ID:53] AN AR BENCHMARK SYSTEM FOR INDOOR PLANAR OBJECT TRACKING

Ziming Wu1, Jiabin Guo2, Shuangli Zhang2, Chen Zhao2, Xiaojuan Ma1

1Hong Kong University of Science and Technology, China, 2Netease AR, China

Tuesday, July 9, 2019

TMM Poster

Time: 13:30 - 15:00 PM

Room: 3rd Floor

[ID:54] ENHANCING IMAGE WATERMARKING WITH ADAPTIVE EMBEDDING PARAMETER AND PSNR GUAR- ANTEE

Baoning Niu

Taiyuan University of Technology, China

106 Poster Session 2

Tuesday, July 9, 2019

P-04: Multimedia Analysis, Search and Recommendation

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Kate Ching-Ju Lin National Chiao Tung University, Taiwan

[ID:1] MULTI-SCALE SCENE TEXT DETECTION VIA RESOLUTION TRANSFORM

Peirui Cheng, Weiqiang Wang, Yuanqiang Cai

University of Chinese Academy of Sciences, China

[ID:2] TOWARDS ACCURATE INSTANCE-LEVEL TEXT SPOTTING WITH GUIDED ATTENTION

Haiyan Wang, Xuejian Rong, Yingli Tian

The City College of New York, USA

[ID:3] MULTI-SCALE GEM POOLING WITH N-PAIR CENTER LOSS FOR FINE-GRAINED IMAGE SEARCH

Youming Deng, Xianming Lin, Run Li, Rongrong Ji

Xiamen University, China

[ID:4] SEMI-SUPERVISED SEMANTIC-PRESERVING HASHING FOR EFFICIENT CROSS-MODAL RETRIEVAL

Xingzhi Wang1,2, Xin Liu1,2, Zhikai Hu1, Nannan Wang2, Wentao Fan1, Ji-Xiang Du1

1Huaqiao University, China, 2Xidian University, China

[ID:5] ROBUST MULTI-VIEW HASHING FOR CROSS-MODAL RETRIEVAL

Haitao Wang, Hui Chen, Min Meng, JiGang Wu

Guangdong University of Technology, China

[ID:6] SCENE TEXT RECOGNITION VIA GATED CASCADE ATTENTION

Siwei Wang, Yongtao Wang, Xiaoran Qin, Qijie Zhao, Zhi Tang

Peking University, China

[ID:7] TEXT-ATTENTIONAL CONDITIONAL GENERATIVE ADVERSARIAL NETWORK FOR SUPER-RESOLU- TION OF TEXT IMAGES

107 IEEE ICME2019

Yuyang Wang, Feng Su and Ye Qian

Nanjing University, China

[ID:8] ONLINE LEARNING TO RANK IN A LISTWISE APPROACH FOR INFORMATION RETRIEVAL

Fan Ma1, Haoyun Yang2, Haibing Yin2, Xiaofeng Huang2, Chenggang Yan2, Xiang Meng2

1University of Technology Sydney, Australia, 2Hangzhou Dianzi University, China

108 Tuesday, July 9, 2019

P-05: Pose and Action Recognition I

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Yicong Zhou University of Macau, China

[ID:9] RECOGNIZING MICRO ACTIONS IN VIDEOS: LEARNING MOTION DETAILS VIA SEGMENT-LEVEL TEM- PORAL PYRAMID

Yang Mi1, Song Wang1,2

1University of South Carolina, USA, 2Tianjin University, China

[ID:10] ENTANGLEMENT LOSS FOR CONTEXT-BASED STILL IMAGE ACTION RECOGNITION

Miao Xin1, Shuhang Wang2, Jian Cheng1

1Institute of Automation, Chinese Academy of Sciences, China, 2Harvard University, USA

[ID:11] BI-DIRECTIONAL MESSAGE PASSING BASED SCANET FOR HUMAN POSE ESTIMATION

Lu Zhou, Yingying Chen, Jinqiao Wang, Ming Tang and Hanqing Lu

Institute of Automation, Chinese Academy of Sciences, China

[ID:12] SPATIAL MASK CONVLSTM NETWORK AND INTRA-CLASS JOINT TRAINING METHOD FOR HUMAN ACTION RECOGNITION IN VIDEO

Jingjun Chen, Yonghong Song, Yuanlin Zhang

Xi’an Jiaotong University, China

[ID:13] SELF-ATTENTION GUIDED DEEP FEATURES FOR ACTION RECOGNITION

Renyi Xiao1, Yonghong Hou1, Zihui Guo1, Chuankun Li1, Pichao Wang2, Wanqing Li3

1Tianjin University, China, 2Alibaba Group (U.S.) Inc., USA, 3University of Wollongong, Australia

[ID:14] LEARNING SHAPE-MOTION REPRESENTATIONS FROM GEOMETRIC ALGEBRA SPATIO-TEMPORAL MODEL FOR SKELETON-BASED ACTION RECOGNITION

Yanshan Li1, Rongjie Xia1, Xing Liu1, Qinghua Huang2

1Shenzhen University, China, 2Northwestern Polytechnical University, China

[ID:15] ACPNET: ANCHOR-CENTER BASED PERSON NETWORK FOR HUMAN POSE ESTIMATION AND IN- STANCE SEGMENTATION

109 IEEE ICME2019

Yang Bai, Weiqiang Wang

University of Chinese Academy of Sciences, China

[ID:16] SPATIO-TEMPORAL MULTI-SCALE SOFT QUANTIZATION LEARNING FOR SKELETON-BASED HUMAN ACTION RECOGNITION

Jianyu Yang1, Chen Zhu1, Junsong Yuan2

1Soochow University, China, 2State University of New York at Buffalo, USA

[ID:17] LPHD: A LARGE-SCALE HEAD POSE DATASET FOR RGB IMAGES

Wei Sun1, Yezhao Fan1, Xiongkuo Min1, Shihao Peng1, Siwei Ma2 and Guangtao Zhai1

1Shanghai Jiao Tong University, China, 2Peking University, China

110 Tuesday, July 9, 2019

P-06: Person and Emotion Understanding

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Zheng Wang National Institute of Informatics, Japan

[ID:18] HUMAN-CENTERED EMOTION RECOGNITION IN ANIMATED GIFS

Zhengyuan Yang, Yixuan Zhang, Jiebo Luo

University of Rochester, USA

[ID:19] DEEP SEMI-SUPERVISED PERSON RE-IDENTIFICATION WITH EXTERNAL MEMORY

Qize Yang, Ancong Wu, Wei-Shi Zheng

Sun Yat-sen University, China

[ID:20] CONVOLUTIONAL TEMPORAL ATTENTION MODEL FOR VIDEO-BASED PERSON RE-IDENTIFICATION

Tanzila Rahman1, Mrigank Rochan2, Yang Wang2

1University of British Columbia, Canada, 2University of Manitoba, Canada

[ID:21] POOLING MAP ADAPTATION IN CONVOLUTIONAL NEURAL NETWORK FOR FACIAL EXPRESSION RECOGNITION

Zhiyuan Li1, Shizhong Han2, Ahmed Shehab Khan1, Jie Cai1, Zibo Meng3, James O’Reilly1, Yan Tong1

1University of South Carolina, USA, 212Sigma Technologies, China, 3Innopeak Technology Inc., USA

[ID:22] FAST PERSON SEARCH PIPELINE

Jianheng Li, Fuhang Liang, Yuanxun Li, Wei-Shi Zheng

Sun Yat-sen University, China

[ID:23] DYNAMIC REGION DIVISION FOR ADAPTIVE LEARNING PEDESTRIAN COUNTING

Gaoqi He1, Zhenwei Ma2, Binhao Huang2, Bin Sheng3, Yubo Yuan2

1East China Normal University, China, 2East China University of Science and Technology, China, 3Shanghai Jiao Tong Uni- versity, China

111 IEEE ICME2019

[ID:24] ANOTHER DIMENSION: TOWARDS MULTI-SUBNET NEURAL NETWORK FOR IMAGE SENTIMENT ANALYSIS

Jing Zhang, Han Sun, Zhe Wang, Tong Ruan

East China University of Science and Technology, China

[ID:25] TWO-STAGE MODEL FOR SOCIAL RELATIONSHIP UNDERSTANDING FROM VIDEOS

Pilin Dai, Jinna Lv, Bin Wu

Beijing University of Posts and Telecommunications, China

[ID:26] FPN++: A SIMPLE BASELINE FOR PEDESTRIAN DETECTION

Junhao Hu, Lei Jin, Shenghua Gao

ShanghaiTech University, China

[ID:27] AN END-TO-END LEARNING APPROACH FOR MULTIMODAL EMOTION RECOGNITION: EXTRACTING COMMON AND PRIVATE INFORMATION

Fei Ma, Wei Zhang, Yang Li, Shao-Lun Huang, Lin Zhang

Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, China

112 Tuesday, July 9, 2019

P-07: Best Papers and Oral-01 to Oral-04

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Xiaoping Zhang Ted Rogers School of Management, Ryerson University, Canada

[ID:28] AN END-TO-END ARCHITECTURE FOR CLASS-INCREMENTAL OBJECT DETECTION WITH KNOWL- EDGE DISTILLATION

Yu Hao1, Yanwei Fu1, Yu-Gang Jiang1,2, Qi Tian3

1Fudan University, China, 2Jilian Technology Group(Video++) ,China, 3Huawei Noah’s Ark Lab, China

[ID:29] REAL-TIME INDOOR SCENE RECONSTRUCTION WITH RGBD AND INERTIAL INPUT

Zunjie Zhu1, Feng Xu2, Chenggang Yan1, Xinhong Hao3, Xiangyang Ji2, Yongdong Zhang4 ,Qionghai Dai2

1Hangzhou Dianzi University, China, 2Tsinghua University, China, 3Beijing Institute of Technology, China, 4University of Sci- ence and Technology of China, China

[ID:30] DOUBLY SEMI-SUPERVISED MULTIMODAL ADVERSARIAL LEARNING FOR CLASSIFICATION, GENER- ATION AND RETRIEVAL

Changde Du1, Changying Du2, Huiguang He1

1Institute of Automation Chinese Academy of Sciences, China, 2Huawei Noah’s Ark Lab, China

[ID:31] TOWARDS DIGITAL RETINA IN SMART CITIES: A MODEL GENERATION, UTILIZATION AND COMMU- NICATION PARADIGM

Yihang Lou1, Ling-Yu Duan1, Yong Luo1, Ziqian Chen1, Tongliang Liu2, Shiqi Wang3, Wen Gao1

1Peking University, China, 2University of Sydney, Australia, 3City University of Hongkong, China, 4The Peng Cheng Labora- tory, China

[ID:32] SDP: AN IMPROVED BASELINE ESTIMATION MODEL BASED ON STANDARD DEVIATION PROPOR- TION

Zhenhua Tan, Danke Wu, Liangliang He, Qiuyun Chang, Bin Zhang

Northeastern University, China

[ID:33] CITATION RECOMMENDATION BASED ON WEIGHTED HETEROGENEOUS INFORMATION NETWORK CONTAINING SEMANTIC LINKING

Jie Chen, Yang Liu, Shu Zhao, Yanping Zhang

Anhui University, China

113 IEEE ICME2019

[ID:34] FUSION-SUPERVISED DEEP CROSS-MODAL HASHING

Li Wang, Lei Zhu, En Yu, Jiande Sun, Huaxiang Zhang

Shandong Normal University, China

[ID:35] DOMAIN UNCERTAINTY BASED ON INFORMATION THEORY FORCROSS-MODAL HASH RETRIEVAL

Wei Chen1, Nan Pu1, Yu Liu2, Erwin M. Bakker1, Michael S. Lew1

1 Leiden University, Holland, 2 ESAT-PSI, KU Leuven, Belgium

[ID:36] ADAPTIVE PLANE PROJECTION FOR VIDEO-BASED POINT CLOUD CODING

Eurico Lopes, João Ascenso, Catarina Brites, Fernando Pereira

Instituto Superior Técnico, Universidade de Lisboa - Instituto de Telecomunicações, Lisboa, Portugal

[ID:37] FAST CU PARTITIONING ALGORITHM FOR H.266/VVC INTRA-FRAME CODING

Ting Fu1, Hao Zhang 1, Fan Mu1, Huanbang Chen2

1Central South University, China, 2Huawei Base, China

[ID:38] TWO-STAGE FAST MULTIPLE TRANSFORM SELECTION ALGORITHM FOR VVC INTRA CODING

Ting Fu1, Hao Zhang 1, Fan Mu1, Huanbang Chen2

1Central South University, China, 2Huawei Base, China

[ID:39] HISTORY-BASED MOTION VECTOR PREDICTION FOR FUTURE VIDEO CODING

Junru Li1, Meng Wang2, Li Zhang3, Kai Zhang3, Hongbin Liu3, Shiqi Wang2, Siwei Ma1, Wen Gao1

1Peking University, China, 2City University of Hong Kong, China, 3Bytedance Inc., USA

[ID:40] AMS-SFE: TOWARDS AN ALIGNMENT OF MANIFOLD STRUCTURES VIA SEMANTIC FEATURE EX- PANSION FOR ZERO-SHOT LEARNING

Jingcai Guo, Song Guo

The Hong Kong Polytechnic University, China

[ID:41] LOW-SHOT PALMPRINT RECOGNITION BASED ON META-SIAMESE NETWORK

Xuefeng Du1, Dexing Zhong1,2, Pengna Li1

1Xi’an Jiaotong University, China, 2Research Institute of Xi’an Jiaotong University, China

114 [ID:42] SR-GAN: SEMANTIC RECTIFYING GENERATIVE ADVERSIAL NETWORK FOR ZERO-SHOT LEARNING

Zihan Ye1,5, Fan Lyu1,2, Linyan Li3, Qiming Fu1,6, Jinchang Ren4, Fuyuan Hu1,7

1Suzhou University of Science and Technology, China, 2Tianjin University, China, 3Suzhou Institute of Trade & Commerce, China, 4University of Strathclyde, UK, 5Virtual Reality Key Laboratory of Intelligent Interaction and Application Technology of Suzhou, China, 6Key Laboratory of Intelligent Building Energy Efficiency, China, 7Suzhou Key Laboratory for Big Data and Information Service, China

[ID:43] COMPARE MORE NUANCED: PAIRWISE ALIGNMENT BILINEAR NETWORK FOR FEW-SHOT FINE- GRAINED LEARNING

Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Jingsong Xu

University of Technology Sydney, Australia

[ID:44] FEATURE-AWARE AND CONTENT-WISE DENOISING OF 3D STATIC AND DYNAMIC MESHES US- ING DEEP AUTOENCODERS

Gerasimos Arvanitis1, Aris S. Lalos2, and Konstantinos Moustakas1

1University of Patras, Greece, 2"ATHENA" Research Center, Greece

[ID:45] REAL-TIME MONOCULAR VISUAL SLAM BY COMBINING POINTS AND LINES

Xinyu Wei, Jun Huang, Xiaoyuan Ma

Shanghai Advanced Research Institute, China

[ID:46] F-NUMBER ADAPTATION FOR MAXIMIZING THE SENSOR USAGE OF LIGHT FIELD CAMERAS

Chuanpu Li, Xin Jin, Junke Li and Qionghai Dai

Graduate School at Shenzhen, Tsinghua University, China

[ID:47] BLIND CALIBRATION FOR FOCUSED PLENOPTIC CAMERAS

Xufu Sun, Xin Jin, Pei Wang, Yanqin Chen and Qionghai Dai

Graduate School at Shenzhen, Tsinghua University, China

115 IEEE ICME2019

Poster Session 3 & Demo Session 1

Wednesday, July 10, 2019

P-08: Multimedia Creation and Enhancement

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Jing-Hao Xue University College London, UK

[ID:1] BOUNDARY AWARE MULTI-FOCUS IMAGE FUSION USING DEEP NEURAL NETWORK

Haoyu Ma, Juncheng Zhang, Shaojun Liu, Qingmin Liao

Graduate School at Shenzhen, Tsinghua University, China

[ID:2] A MULTI-LEVEL AGGREGATED NETWORK FOR IMAGE RESTORATION

Chenxi Ma, Weimin Tan, Bahetiyaer Bare, and Bo Yan

Fudan University, China

[ID:3] UNSUPERVISED FACIAL IMAGE SYNTHESIS USING TWO-DISCRIMINATOR ADVERSARIAL AUTOEN- CODER NETWORK

Xuehui Wu, Jie Shao, Dongyang Zhang, Junming Chen

University of Electronic Science and Technology of China, China

[ID:4] FACIAL IMAGE INPAINTING USING MULTI-LEVEL GENERATIVE NETWORK

Jie Liu, Cheolkon Jung

Xidian University, China

[ID:5] A VIDEO POST-FILTER DEBLOCKING METHOD BASED ON TEMPORAL BOOSTING RESIDUAL NET- WORKS

Jianyu Wang1, Shaohui Liu1,2, Feng Jiang1,2, Xiaoshuai Sun1, Yongliang Liu3

1Harbin Institute of Technology, China, 2Pengcheng Laboratory, China, 3Alibaba Group, China

[ID:6] DISTILLING WITH RESIDUAL NETWORK FOR SINGLE IMAGE SUPER RESOLUTION

Xiaopeng Sun, Wen Lu, Rui Wang, Furui Bai

Xidian University, China

116 [ID:7] RDGAN: RETINEX DECOMPOSITION BASED ADVERSARIAL LEARNING FOR LOW-LIGHT ENHANCE- MENT

Junyi Wang, Weimin Tan, Xuejing Niu and Bo Yan

Fudan University, China

[ID:8] SINGLE IMAGE DE-RAINING VIA GENERATIVE ADVERSARIAL NETS

Shichao Li, Yonghong Hou, Huanjing Yue, Zihui Guo

Tianjin University, China

[ID:9] SWITCHGAN FOR MULTI-DOMAIN FAICAL IMAGE TRANSLATION

Yuanlue Zhu, Mengchao Bai, Linlin Shen, Zhiwei Wen

Shenzhen University, China

[ID:10] A FEATURE-BASED APPROACH FOR LIGHT FIELD VIDEO ENHANCEMENT

Michele Brizzi, Federica Battisti, Alessandro Neri

Roma Tre University, Italy

117 IEEE ICME2019

Wednesday, July 10, 2019

P-09: Multimedia and Vision I

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Qixiang Ye University of Chinese Academy of Sctiences, China

[ID:11] EASY TRANSFER LEARNING BY EXPLOITING INTRA-DOMAIN STRUCTURES

Jindong Wang1, Yiqiang Chen1, Han Yu2, Meiyu Huang3, Qiang Yang4

1Chinese Academy of Sciences, China, 2Nanyang Technological University, Singapore, 3China Academy of Space Technology, China, 4Hong Kong University of Science and Technology, China

[ID:12] SKELETON-BASED ACTION RECOGNITION WITH SYNCHRONOUS LOCAL AND NON-LOCAL SPA- TIO-TEMPORAL LEARNING AND FREQUENCY ATTENTION

Guyue Hu1,2, Bo Cui1,2, Shan Yu1,2

1Institute of Automation Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China

[ID:13] DEEP GEOMETRY EMBEDDING NETWORKS FOR ROBUST FACIAL LANDMARK DETECTION

Meilu Zhu, Daming Shi

Shenzhen University, China

[ID:14] JOINT PROJECTION AND SUBSPACE LEARNING FOR ZERO-SHOT RECOGNITION

Guangzhen Liu, Jiechao Guan, Manli Zhang, Jianhong Zhang, Zihao Wang, Zhiwu Lu

Renmin University of China, China

[ID:15] PART-PRESERVING POSE MANIPULATION FOR PERSON IMAGE SYNTHESIS

Haoye Dong1, Xiaodan Liang1, Chenxing Zhou1, Hanjiang Lai1, Jia Zhu2, Jian Yin1

1Sun Yat-sen University, China, 2South China Normal University, China

[ID:16] BREGMAN-TANIMOTO BASED METHOD FOR CONTRAST PRESERVING DECOLORIZATION

He Chen, Faming Fang

East China Normal University, China

[ID:17] TDCC: TOP-DOWN SEMANTIC AGGREGATION FOR COLOR CONSTANCY

Xiaoqiang Li, Yaqin Zhu, Jiayue Han, Jide Li, Weiqin Tong

118 Shanghai University, China

[ID:18] FROM MARKET TO DISH: MULTI-INGREDIENT IMAGE RECOGNITION FOR PERSONALIZED RECIPE RECOMMENDATION

Lin Zhang1, Jianbo Zhao2, Si Li2, Boxin Shi1,3, Ling-Yu Duan1,3

1Peking University, China, 2Beijing University of Posts and Telecommunications, China, 3Peng Cheng Laboratory, China

[ID:19] IMPROVING OPEN SET DOMAIN ADAPTATION USING IMAGE-TO-IMAGE TRANSLATION

Hongjie Zhang1, Ang Li2, Xu Han1, Zhaoming Chen1, Yang Zhang1, Yanwen Guo1

1Nanjing University, China, 2DeepMind, Mountain View, USA

[ID:20] STRUCTURE GENERATION AND GUIDANCE NETWORK FOR UNSUPERVISED MONOCULAR DEPTH ESTIMATION

Chaoqun Wang, Xuejin Chen, Shaobo Min, Feng Wu

University of Science and Technology of China, China

[ID:21] A CONDITIONAL BAYESIAN BLOCK STRUCTURE INFERENCE MODEL FOR OPTIMIZED AV1 ENCOD- ING

Xinyao Chen1, Bichuan Guo1, Minhao Tang1, Yuxing Han2, Jiangtao Wen1

1Tsinghua University, China, 2South China Agriculture University, China

[ID:22] LEARNING TO REMOVE REFLECTIONS FOR TEXT IMAGES

Ce Wang1, Renjie Wan2, Feng Gao3, Boxin Shi1,4, Ling-Yu Duan1,4

1Peking University, China, 2Nanyang Technological University, Singapore,3Tsinghua University, China, 4Peng Cheng Labora- tory, China

119 IEEE ICME2019

Wednesday, July 10, 2019

P-10: Oral-17 to Oral-24

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Roger Zimmermann National University of Singapore, Singapore

[ID:23] VIDEO EMOTION RECOGNITION WITH CONCEPT SELECTION

Baohan Xu1, Yingbin Zheng2, Hao Ye2, Caili Wu3, Heng Wang1, Gufei Sun1

1Zhongan Technology, China, 2Videt Tech, USA, 3East China Normal University, China

[ID:24] GRAPH CONVOLUTIONAL LSTM MODEL FOR SKELETON-BASED ACTION RECOGNITION

Han Zhang, Yonghong Song, Yuanlin Zhang

Xi’an Jiaotong University, China

[ID:25] LEARNING RECURRENT STRUCTURE-GUIDED ATTENTION NETWORK FOR MULTI-PERSON POSE ES- TIMATION

Zhongwei Qiu1, Kai Qiu2, Jianlong Fu2, Dongmei Fu1

1University of Science and Technology Beijing, China, 2Microsoft Reasearch, China

[ID:26] PCPCAD: PROPOSAL COMPLEMENTARY ACTION DETECTOR

Zhenying Fang1, Suguo Zhu1, Jun Yu1, Qi Tian2,3

1Hangzhou Dianzi University, China, 2Huawei Noah’s Ark Lab, China, 3The University of Texas at San Antonio, USA

[ID:27] PERSONALITY DRIVEN MULTI-TASK LEARNING FOR IMAGE AESTHETIC ASSESSMENT

Leida Li1,2, Hancheng Zhu2, Sicheng Zhao3, Guiguang Ding4, Hongyan Jiang2, Allen Tan5

1Xidian University, China, 2China University of Mining and Technology, China, 3University of California Berkeley, USA, 4Ts- inghua University, China, 5Tencent, China

[ID:28] VIDEO QUALITY TEMPORAL POOLING USING A VISIBILITY MEASURE

Chen Bai, Amy R. Reibman

Purdue University, USA

[ID:29] IMAGE QUALITY ASSESSMENT OF MULTI-EXPOSURE IMAGE FUSION FOR BOTH STATIC AND DY- NAMIC SCENES

Yuming Fang1, Yan Zeng1, Hanwei Zhu1, Guangtao Zhai2

1Jiangxi University of Finance and Economics, China, 2Shanghai Jiao Tong University, China

120 [ID:30] NO-REFERENCE STEREOSCOPIC IMAGE QUALITY ASSESSMENT BASED ON LOCAL TO GLOBAL FEA- TURE REGRESSION

Sumei Li, Jianwei Xue, Yongtian Han

Tianjin University, China

[ID:31] HERDING EFFECT BASED ATTENTION FOR PERSONALIZED TIME-SYNC VIDEO RECOMMENDATION

Wenmian Yang1,2, Wenyuan Gao1, Xiaojie Zhou1, Weijia Jia1,2, Shaohua Zhang1,2, Yutao Luo1

1Shanghai JiaoTong University, China, 2University of Macau, China

[ID:32] SEQUENTIAL BEHAVIOR MODELING FOR NEXT MICRO-VIDEO RECOMMENDATION WITH COLLABO- RATIVE TRANSFORMER

Shang Liu, Zhenzhong Chen

Wuhan University, China

[ID:33] BUTTONTIPS: DESIGNING WEB BUTTONS WITH SUGGESTIONS

Dawei Liu, Ying Cao, Rynson W.H. Lau, Antoni B. Chan

City University of Hongkong, China

[ID:34] KNOWING USER BETTER: MICRO-VIDEO RECOMMENDER SYSTEM BY JOINTLY OPTIMIZING TO CLICK-THROUGH AND PLAYTIME

Shengjie Ma, Zhengjun Zha, Feng Wu

University of Science and Technology of China, China

[ID:35] ADVERSARIAL CROSS-MODAL RETRIEVAL VIA LEARNING AND TRANSFERRING SINGLE-MODAL SIMILARITIES

Xin Wen1, Zhizhong Han1,2, Xinyu Yin1, Yu-Shen Liu1

1Tsinghua University, China, 2University of Maryland, USA

[ID:36] SEMI-SUPERVISED COMPATIBILITY LEARNING ACROSS CATEGORIES FOR CLOTHING MATCHING

Zekun Li, Zeyu Cui, Shu Wu, Xiaoyu Zhang, Liang Wang

Chinese Academy of Sciences, China

[ID:37] ADVERSARIAL LEARNING FOR FINE-GRAINED IMAGE SEARCH

Kevin Lin1, Fan Yang2, Qiaosong Wang2, Robinson Piramuthu2

1University of Washington, USA, 2eBay Inc., USA

[ID:38] A MASK BASED DEEP RANKING NEURAL NETWORK FOR PERSON RETRIEVAL

121 IEEE ICME2019

Lei Qi1, Jing Huo1, Lei Wang2, Yinghuan Shi1, Yang Gao1

1Nanjing University, China, 2University of Wollongong, Australia

[ID:39] DISCO: DEPTH INFERENCE FROM STEREO USING CONTEXT

Kunal Swami, Kaushik Raghavan, Nikhilanj Pelluriy, Rituparna Sarkar, Pankaj Bajpai

Samsung Research Institute Bangalore, India

[ID:40] PANET: A CONTEXT BASED PREDICATE ASSOCIATION NETWORK FOR SCENE GRAPH GENERATION

Yunian Chen1, Yanjie Wang3, Yang Zhang1, Yanwen Guo1,2

1Nanjing University, China, 2The 28th Research Institute of China Electronics Technology Group Corporation, China, 3Zheji- ang University, China

[ID:41] UNTARGETED ADVERSARIAL ATTACK VIA EXPANDING THE SEMANTIC GAP

Aming Wu1, Yahong Han1, Quanxin Zhang2, Xiaohui Kuang3

1Tianjin University, China, 2Beijing Institute of Technology, China, 3National Key Laboratory of Science and Technology on Information System Security, China

[ID:42] LEARNING GOAL-ORIENTED VISUAL DIALOG AGENTS: IMITATING AND SURPASSING ANALYTIC EX- PERTS

Yen-Wei Chang, Wen-Hsiao Peng

National Chiao Tung University, Taiwan

[ID:43] GAN-BASED MULTI-LEVEL MAPPING NETWORK FOR SATELLITE IMAGERY SUPER-RESOLUTION

Kui Jiang, Zhongyuan Wang, Peng Yi, Junjun Jiang, Guangcheng Wang, Zhen Han, Tao Lu

Wuhan University, China

[ID:44] QUALITY-GATED CONVOLUTIONAL LSTM FOR ENHANCING COMPRESSED VIDEO

Ren Yang3, Xiaoyan Sun1, Mai Xu2 and Wenjun Zeng1

1Microsoft Research, USA, 2Beihang University, China, 3ETH, Switzerland

[ID:45] COMPOUNDED LAYER-PRIOR UNROLLING: A UNIFIED TRANSMISSION-BASED IMAGE ENHANCE- MENT FRAMEWORK

Risheng Liu, Minjun Hou, Jinyuan Liu, Xin Fan, Zhongxuan Luo

Dalian University of Technology, China

[ID:46] DEEP PYRAMID VARIATION LEARNING FOR IMAGE INTERPOLATION

Fu Qiang, Wenhan Yang, Ying Li, and Jiaying Liu

Peking University, China

122 [ID:47] CLOTHES KEYPOINTS LOCALIZATION AND ATTRIBUTE RECOGNITION VIA PRIOR KNOWLEDGE

Zhangxuan Gu, Jianfu Zhang, Ziqi Pan, Haohua Zhao, Liqing Zhang

Shanghai Jiao Tong University, China

[ID:48] SPATIO-TEMPORAL MULTI-FACTOR DISCRIMINANT ANALYSIS FOR INDIVIDUAL IDENTIFICATION

Yong Su, Zhiyong Feng

Tianjin University, China

[ID:49] CHANNEL-WISE TEMPORAL ATTENTION NETWORK FOR VIDEO ACTION RECOGNITION

Jianjun Lei1, Yalong Jia1,Bo Peng1, Qingming Huang2

1Tianjin University, China, 2University of Chinese Academy of Sciences, China

[ID:50] LOCALIZATION GUIDED FIGHT ACTION DETECTION IN SURVEILLANCE VIDEOS

Qichao Xu1, John See2, Weiyao Lin1

1Shanghai Jiao Tong University, China, 2Multimedia University, Malaysia

[ID:51] RECURSIVE MULTI-STAGE UPSCALING NETWORK WITH DISCRIMINATIVE FUSION FOR SUPER-RES- OLUTION

Yue Lu1, Zhuqing Jiang1, Guodong Ju2, Liangheng Shen2, Aidong Men1

1Beijing University of Posts and Telecommunications, China, 2GuangDong TUS-TuWei Technology Co.,Ltd, China

[ID:52] IMPROVING IMAGE SUPER-RESOLUTION VIA FEATURE RE-BALANCING FUSION

Yuanfei Huang, Jie Li, Xinbo Gao, Wen Lu, Yanting Hu

Xidian University, China

[ID:53] DIFFICULTY-AWARE IMAGE SUPER RESOLUTION VIA DEEP ADAPTIVE DUAL-NETWORK

Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen

Sun Yat-sen University, China

[ID:54] DENSE-CONNECTED RESIDUAL NETWORK FOR VIDEO SUPER-RESOLUTION

Xiaoting Du, Yuan Zhou, Yanfang Chen, Yeda Zhang, Jianxing Yang and Dou Jin

Tianjin University, China

123 IEEE ICME2019

Wednesday, July 10, 2019

Demo Session 1

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Dong Liu University of Science and Technology of China, China

[ID:55] BASEBALL PLAYER BEHAVIOR RECOGNITION SYSTEM USING MULTIMODAL FEATURES WITH AN AUGMENTED REALITY DISPLAY ON A SMART GLASS

Wei-Chen Yen1, Chih-Chieh Fang2, Shih-Wei Sun1, Kai-Lung Hua3, and Huang-Chia Shih4

1Department of New Media Art, Taipei National University of the Arts, Taiwan,2Graduate Institute of Dance Theory, Taipei National University of the Arts, Taiwan, 3Dept. Computer Science and Information Engineering, Natl. Taiwan Univ. of Sci. and Tech., Taiwan,4Dept. Electrical Engineering, Yuan Ze University, Taiwan

[ID:56] PRACTICAL IMAGE OBFUSCATION WITH PROVABLE PRIVACY

Liyue Fan

University at Albany SUNY, USA

[ID:57] SMART ADVERTISING IN VIDEOS BASED ON COMPREHENSIVE CONTENT ANALYTICS

Yi Zhang, Fan Luan, Yu-Gang Jiang

Jilian Technology Group (Video++), Shanghai, China

[ID:58] LIVE DEMONSTRATION: HIGH PERFORMANCE FOCUSED PLENOPTIC CAMERA

Chuanpu Li, Xufu Sun, Xin Jin, Qionghai Dai

Graduate School at Shenzhen, Tsinghua University, Shenzhen, China

124 Poster Session 4 & Demo Session 2

Wednesday, July 10, 2019

P-11: Multimedia and Language I

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Liyue Fan University at Albany, USA

[ID:1] DYNAMIC PSEUDO LABEL DECODING FOR CONTINUOUS SIGN LANGUAGE RECOGNITION

Hao Zhou, Wengang Zhou, Houqiang Li

University of Science and Technology of China, China

[ID:2] DECOUPLING LOCALIZATION AND CLASSIFICATION IN SINGLE SHOT TEMPORAL ACTION DETEC- TION

Yupan Huang1, Qi Dai2, Yutong Lu1

1Sun Yat-sen University, China, 2Microsoft Research, USA

[ID:3] IMAGE-TO-TREE: A TREE-STRUCTURED DECODER FOR IMAGE CAPTIONING

Zhiming Ma, Chun Yuan, Yangyang Cheng, Xinrui Zhu

Tsinghua University, China

[ID:4] MULTIMODAL SEMANTIC ATTENTION NETWORK FOR VIDEO CAPTIONING

Liang Sun1, Bing Li2, Chunfeng Yuan2, Zhengjun Zha1, Weiming Hu2

1University of Science and Technology of China, China, 2Chinese Academy of Sciences, China

[ID:5] CONCRETE IMAGE CAPTIONING BY INTEGRATING CONTENT SENSITIVE AND GLOBAL DISCRIMINA- TIVE OBJECTIVE

Jie Wu1, Tianshui Chen1,2, Hefeng Wu1,3, Zhi Yang1, Qing Wang1,2, Liang Lin1,2

1Sun Yat-sen University, China, 2DarkMatter AI Research, United Arab Emirates, 3Guangdong University of Foreign Studies, China

[ID:6] REVNET: BRING REVIEWING INTO VIDEO CAPTIONING FOR A BETTER DESCRIPTION

Huidong Li, Dandan Song, Lejian Liao, Cuimei Peng

Beijing Institute of Technology, China

125 IEEE ICME2019

[ID:7] MULTIMODAL IMAGE CAPTIONING THROUGH COMBINING REINFORCED CROSS ENTROPY LOSS AND STOCHASTIC DEPRECATION

Xi Meng, Hao Kong, Dongqi Tang, Tong Lu

Nanjing University, China

126 Wednesday, July 10, 2019

P-12: Advances in Artificial Intelligence

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Baoning Niu Taiyuan University of Technology, China

[ID:8] INVERSENET: SOLVING INVERSE PROBLEMS WITH SPLITTING NETWORKS

Qi Wei1, Kai Fan2, Wenlin Wang3, Tianhang Zheng4, Amit Chakraborty5, Katherine Heller3, Changyou Chen4, Kui Ren4,6

1J.P. Morgan, USA, 2Alibaba DAMO Academy, China, 3Duke University, USA, 4SUNY at Buffalo, USA, 5Siemens Corporate Technology, China, 6Zhejiang University, China

[ID:9] HIGH-RESOLUTION DRIVING SCENE SYNTHESIS USING STACKED CONDITIONAL GANS AND SPEC- TRAL NORMALIZATION

Shaobo Lin1, Long Chen1, Qin Zou2, Wei Tian3

1Sun Yat-sen University, China, 2Wuhan University, China, 3Karlsruhe Institute of Technology, Germany

[ID:10] ZERO-SHOT LEARNING WITH FEW SEEN CLASS SAMPLES

Yuqi Huo, Jiechao Guan, Jianhong Zhang, Manli Zhang, Ji-Rong Wen, Zhiwu Lu

Renmin University of China, China

[ID:11] ATTENTIONDROP FOR CONVOLUTIONAL NEURAL NETWORKS

Zhihao Ouyang1, Yan Feng1,2, Zihao He1, Tianbo Hao1, Tao Dai1,2, Shu-Tao Xia1,2

1Tsinghua University, China, 2Peng Cheng Laboratory, China

[ID:12] MULTI-VIEW CLUSTERING VIA SIMULTANEOUSLY LEARNING GRAPH REGULARIZED LOW-RANK TENSOR REPRESENTATION AND AFFINITY MATRIX

Yongyong Chen, Xiaolin Xiao, Yicong Zhou

University of Macau, China

[ID:13] DATA AUGMENTATION FOR MONAURAL SINGING VOICE SEPARATION BASED ON VARIATIONAL AU- TOENCODER-GENERATIVE ADVERSARIAL NETWORK

Boxin He1, Shengbei Wang1, Weitao Yuan1, Jianming Wang1, Masashi Unoki2

1Tianjin Polytechnic University, China, 2Japan Advanced Institute of Science and Technology, Japan

127 IEEE ICME2019

[ID:14] TOWARDS BETTER UNCERTAINTY SAMPLING: ACTIVE LEARNING WITH MULTIPLE VIEWS FOR DEEP CONVOLUTIONAL NEURAL NETWORK

Tao He1, Xiaoming Jin1, Guiguang Ding1, Lan Yi2, Chenggang Yan3

1Tsinghua University, China, 2Cisco, USA, 3Hangzhou Dianzi University, China

[ID:15] LOCAL METRIC LEARNING BASED ON ANCHOR POINTS FOR MULTIMEDIA ANALYSIS

Chunbin Gu, Jiajun Bu, Keyue Shi, Zhi Yu, Beidou Wang, Liangcheng Li

Zhejiang University, China

[ID:16] SPARSE REGRESSION-BASED MULTIPLE SEQUENCE ALIGNMENT

Tung Doan1, Takasu Atsuhiro2

1SOKENDAI (The Graduate University for Advanced Studies), Japan, 2National Institute of Informatics, Japan

128 Wednesday, July 10, 2019

P-13: Multimedia Security, Privacy and Forensics I

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Qieshi Zhang Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China

[ID:17] SINGLE IMAGE DERAINING USING A RECURRENT MULTI-SCALE AGGREGATION AND ENHANCE- MENT NETWORK

Youzhao Yang, Hong Lu

Fudan University, China

[ID:18] NEURAL NETWORK BASED PHASE COMPENSATION METHODS ON MONAURAL SPEECH SEPARA- TION

Chunpeng Wang, Jie Zhu

Shanghai Jiao Tong University, China

[ID:19] PALMGAN FOR CROSS-DOMAIN PALMPRINT ECOGNITION

Huikai Shao, Dexing Zhong, Yuhan Li

Xi’an Jiaotong University, China

[ID:20] EVENT-BASED VISION ENHANCED: A JOINT DETECTION FRAMEWORK IN AUTONOMOUS DRIVING

Jianing Li1, Siwei Dong1, Zhaofei Yu1,2, Yonghong Tian1,2, Tiejun Huang1,2

1Peking University, China, 2Pengcheng Laboratory, China

[ID:21] MULTI-DOMAIN EMBEDDING STRATEGIES FOR VIDEO STEGANOGRAPHY BY COMBINING PARTI- TION MODES AND MOTION VECTORS

Liming Zhai, Lina Wang, Yanzhen Ren

Wuhan University, China

[ID:22] Real-Time Indoor 3D Human Imaging Based on MIMO Radar Sensing

Hanqing Guo1, Nan Zhang1, Wenjun Shi1, Saeed AlQarni1, Shaoen Wu1, Honggang Wang2

1Ball State University, USA, 2University of Massachusetts Dartmouth, USA

[ID:23] MULTI-SPEAKERS SPEECH SEPARATION BASED ON MODIFIED ATTRACTOR POINTS ESTIMATION AND GMM CLUSTERING

129 IEEE ICME2019

Shanfa Ke1, Ruimin Hu1, Gang Li1, Tingzhao Wu1, Xiaochen Wang1,2, Zhongyuan Wang1

1Wuhan University, China, 2Collaborative Innovation Center of Geospatial Technolog, China

[ID:24] LEARNING A DEEP CONVOLUTIONAL NETWORK FOR SUBBAND IMAGE DENOISING

Jing Zhao1, Ruiqin Xiong1, Jizheng Xu2, Feng Wu3, Tiejun Huang1

1Peking University, China, 2ByteDance, USA, 3University of Science and Technology of China, China

[ID:25] MULTI-TASK CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION

Zhijie Lin, Sen Jia, Bin Deng

Shenzhen University, China

[ID:26] A RETINA-INSPIRED SAMPLING METHOD FOR VISUAL TEXTURE RECONSTRUCTION

Lin Zhu1, Siwei Dong1, Tiejun Huang1,2, Yonghong Tian1,2

1Peking University, China, 2Pengcheng Laboratory, China

130 Wednesday, July 10, 2019

P-14: Machine Learning Applications in Image and Video Coding II

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Dan Zeng Shanghai University, China

[ID:27] LEARNED SCALABLE IMAGE COMPRESSION WITH BIDIRECTIONAL CONTEXT DISENTANGLEMENT NETWORK

Zhizheng Zhang, Zhibo Chen, Jianxin Lin, Weiping Li

University of Science and Technology of China, China

[ID:28] AN ATTENTION RESIDUAL NEURAL NETWORK WITH RECURRENT GREEDY APPROACH AS LOOP FIL- TER FOR INTER FRAMES

Jiabao Yao, Li Wang, Fangdong Chen, Chaoyi Lin, Shiliang Pu

Hikvision Research Institute, China

[ID:29] BAYESIAN NONNEGATIVE MATRIX FACTORIZATION WITH A TRUNCATED SPIKE-AND-SLAB PRIOR

Yuhang Liu1,2, Wenyong Dong1, Wanjuan Song1, Lei Zhang3

1Wuhan University, China, 2The University of Adelaide, Australia, 3Inception Institute of Artificial Intelligence, United Arab Emirates

[ID:30] ENCODING COMPLEXITY CONTROL FOR LIVE VIDEO APPLICATIONS: AN INTERPRETABLE MACHINE LEARNING APPROACH

Chao Huang, Zongju Peng, Fen Chen, Qiuping Jiang, Xin Cui, Gangyi Jiang

Ningbo University, China

[ID:31] ENHANCED RESIDUAL DENSE INTRINSIC NETWORK FOR INTRINSIC IMAGE DECOMPOSITION

Risheng Liu1,2,3, Cheng Yang1,3, Long Ma1,3, Miao Zhang1,3, Xin Fan1,3, Zhongxuan Luo1,3

1Dalian University of Technology, China, 2Xidian University, China, 3Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, China

131 IEEE ICME2019

[ID:32] NON-CONVEX TRANSFER SUBSPACE LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION

Zhipeng Lin1, Zhenyu Zhao1, Tingjin Luo1, Wenjing Yang1, Yongjun Zhang2, Yuhua Tang1

1National University of Defense Technology, China, 2National Innovation Institute of Defense Technology, China

[ID:33] DISCRIMINATIVE GROUP COLLABORATIVE COMPETITIVE REPRESENTATION FOR VISUAL CLASSIFI- CATION

Jianping Gou1, Lei Wang1, Zhang Yi2, Yunhao Yuan3, Weihua Ou4, Qirong Mao1

1Jiangsu University, China, 2Sichuan University, China, 3Yangzhou University, China, 4Guizhou Normal University, China

132 Wednesday, July 10, 2019

P-15: Multimedia and Vision II

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Shao-Yi Chien National Taiwan University, Taiwan

[ID:34] LARGE-SCALE DATASETS FOR GOING DEEPER IN IMAGE UNDERSTANDING

Jiahong Wu1,2, He Zheng3, Bo Zhao4, Yixin Li4, Baoming Yan4, Rui Liang1, Wenjia Wang4, Shipei Zhou4,5, Guosen Lin2, Yan- wei Fu6, Yizhou Wang4, Yonggang Wang1

1Sinovation Ventures, China, 2Ainnovation Technology Ltd, China, 3University of Chinese Academy of Sciences, China, 4Pe- king University, China, 5Carnegie Mellon University, USA, 6Fudan University, China

[ID:35] REVISIT SURROUND-VIEW CAMERA SYSTEM CALIBRATION

Xuan Shao1, Xiao Liu1, Lin Zhang1, Shengjie Zhao1, Ying Shen1, Yukai Yang2

1Tongji University, China, 2Uppsala University, Sweden

[ID:36] DECOUPLING SEMANTIC CONTEXT AND COLOR CORRELATION WITH MULTI-CLASS CROSS BRANCH REGULARIZATION

Vishal Keshav, Tejpratap GVSL

Samsung Research Institute Bangalore, India

[ID:37] CROWD COUNTING VIA MULTI-VIEW SCALE AGGREGATION NETWORKS

Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin

Sun Yat-sen University, China

[ID:38] PASIAM: PREDICTING ATTENTION INSPIRED SIAMESE NETWORK, FOR SPACE-BORNE SATELLITE VIDEO TRACKING

Jia Shao1, Bo Du1, Chen Wu1, Pingkun Yan2

1Wuhan University, China, 2Rensselaer Polytechnic Institute, USA

[ID:39] A RELATION NETWORK EMBEDDED WITH PRIOR FEATURES FOR FEW-SHOT CARICATURE RECOGNI- TION

Wenbo Zheng1,2, Lan Yan2,3, Chao Gou2,4, Wenwen Zhang1,2, Fei-Yue Wang2,4

1Xi’an Jiaotong University, China, 2Chinese Academy of Sciences, China, 3University of Chinese Academy of Sciences, Chi- na, 4Qingdao Academy of Intelligent Industries, China

133 IEEE ICME2019

[ID:40] A SINGLE-SHOT ORIENTED SCENE TEXT DETECTOR WITH LEARNABLE ANCHORS

Fenfen Sheng1,2, Zhineng Chen1, Tao Mei3, Bo Xu1

1Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3JD AI Research, China

[ID:41] CONTEXT-CONSTRAINED ACCURATE CONTOUR EXTRACTION FOR OCCLUSION EDGE DETECTION

Rui Lu1, Menghan Zhou1, Anlong Ming1, Yu Zhou2

1Beijing University of Posts and Telecommunications, China, 2Huazhong University of Science and Technology, China

[ID:42] LEARNING SIMULTANEOUS FACE SUPER-RESOLUTION USING MULTISET PARTIAL LEAST SQUARES

Yun-Hao Yuan1, Jin Li1, Jianping Gou2, Yun Li1, Jipeng Qiang1, Bin Li1

1Yangzhou University, China, 2Jiangsu University, China

[ID:43] CROPPING REGION PROPOSAL NETWORK BASED FRAMEWORK FOR EFFICIENT OBJECT DETECTION ON LARGE SCALE REMOTE SENSING IMAGES

Qifeng Lin, Jianhui Zhao, Qianqian Tong, Guian Zhang, Gang Fu, Zhiyong Yuan

Wuhan University, China

[ID:44] SPECTRAL ANALYSIS NETWORK FOR DEEP REPRESENTATION LEARNING AND IMAGE CLUSTERING

Jinghua Wang1, Adrian Hilton2, Jianmin Jiang1

1Shenzhen University, China, 2University of Surrey, UK

134 Wednesday, July 10, 2019

P-16: Oral-13 to Oral-16

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Lei Zhang Microsoft AI & Research, USA

[ID:45] GLOBAL AS-CONFORMAL-AS-POSSIBLE NON-RIGID REGISTRATION OF MULTI-VIEW SCANS

Zhenchao Wu1, Kun Li1, Yu-Kun Lai2, Jingyu Yang1

1Tianjin University, China, 2Cardiff University, UK

[ID:46] A LIGHT-WEIGHTED NETWORK FOR FACIAL LAND MARK DETECTION VIA COMBINED HEATMAP AND COORDINATE REGRESSION

Zhengning Wang1, Longfei Feng1, Fanwei Zeng1, Guang Hu1, Xiang Zhang1, Xia Lv1, Fengjun Zhang2

1University of Electronic Science and Techonogy of China, China, 2No.30 Institute of CETC, China

[ID:47] LIGHT WEIGHT STEREO MATCHING VIA DEEP EXTRACTION AND INTEGRATION OF LOW AND HIGH LEVEL INFORMATION

Xianzhe Xu1, Yonghong Hou1, Pichao Wang2, Zhongyu Jiang1, Wanqing Li3

1Tianjin University, China, 2Alibaba Group (U.S.) Inc., China, 3University of Wollongong, Australia

[ID:48] JUSTLOOKUP: ONE MILLISECOND DEEP FEATURE EXTRACTION FOR POINT CLOUDS BY LOOKUP TABLES

Hongxin Lin1,2, Zelin Xiao1,2, Yang Tan1,2, Hongyang Chao1, Shengyong Ding1

1Sun Yat-sen University, China, 2Pixtalks Tech, China

[ID:49] MULTIPLE GRAPH CONVOLUTIONAL NETWORKS FOR CO-SALIENCY DETECTION

Bo Jiang1, Xingyue Jiang1, Jin Tang1, Bin Luo1, Shilei Huang2

1Anhui University, China, 2PKU-HKUST Shenzhen Hong Kong Institution, China

[ID:50] QUANNET: JOINT IMAGE COMPRESSION AND CLASSIFICATION OVER CHANNELS WITH LIMITED BANDWIDTH

Lahiru Dulanjana Chamain Hewa Gamage1, Sen-ching S Cheung2, Zhi Ding1

1University of Califirnia Davis, USA,2 University of Kentucky, USA

135 IEEE ICME2019

[ID:51] HIGH EFFICIENCY LIGHT FIELD COMPRESSION VIA VIRTUAL REFERENCE AND HIERARCHICAL MV- HEVC

Jiawen Gu, Bichuang Guo, Jiangtao Wen

Tsinghua University, China

[ID:52] SELF-PACED SUBSPACE CLUSTERING

Youfa Liu, Bo Du, Lefei Zhang

Wuhan University, China

[ID:53] COLLOQUIAL IMAGE CAPTIONING

Xuri Ge, Fuhai Chen, Chen Shen, Rongrong Ji

Xiamen University, China

[ID:54] IMPROVING CAPTIONING FOR LOW-RESOURCE LANGUAGES BY CYCLE CONSISTENCY

Yike Wu1, Shiwan Zhao2, Jia Chen3, Yinng Zhang1, Xiaojie Yuan1, Zhong Su2

1Nankai University, China, 2IBM Research, USA, 3Carnegie Mellon University, USA

[ID:55] FRAMERANK: A TEXT PROCESSING APPROACH TO VIDEO SUMMARIZATION

Zhuo Lei1,2, Chao Zhang1,2, Qian Zhang2, Guoping Qiu3,4

1International Doctoral Innovation Center, China, 2The University of Nottingham Ningbo China, China, 3Shenzhen Universi- ty, China, 4University of Nottingham, UK

[ID:56] CHARACTER IMAGE SYNTHESIS BASED ON SELECTED CONTENT AND REFERENC ED STYLE EM- BEDDING

Anna Zhu, Qiyang Zhang, Xiongbo Lu, Shengwu Xiong

Wuhan University of Technology, China

[ID:57] QUERY-FREE EMBEDDING ATTACK AGAINST DEEP LEARNING

Yujia Liu, Weiming Zhang, Nenghai Yu

University of Science and Technology of China, China

[ID:58] GRAPH ATTENTION NEURAL NETWORKS FOR POINT CLOUD RECOGNITION

Zongmin li, Jun Zhang, Guanlin Li, Yujie Liu, Siyuan Li

China University of Petroleum (East China), China

136 [ID:59] MAXIMAL CORRELATION EMBEDDING NETWORK FOR MULTILABEL LEARNING WITH MISSING LA- BELS

Lu Li, Yang Li, Xiangxiang Xu, Shao-Lun Huang, Lin Zhang

Tsinghua University, China

[ID:60] SELF-ADAPTION MULTI-CLASSIFIER FUSION NETWORKS FOR IMAGE RECOGNITION

Zengyuan Guo, Xinzhu Ma, Haojie Li, Zhihui Wang, Pengbo Zhang

Dalian University of Technology, China

137 IEEE ICME2019

Wednesday, July 10, 2019

Demo Session 2

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Dong Liu University of Science and Technology of China, China

[ID:61] DEMONSTRATION OF APPLICATIONS IN COMPUTER VISION AND NLP ON ULTRA POWER-EFFICIENT CNN DOMAIN SPECIFIC ACCELERATOR WITH 9.3TOPS/WATT

Baohua Sun, Lin Yang, Wenhan Zhang, Patrick Dong, Charles Young, Jason Dong, Michael Lin

Gyrfalcon Technology Inc., USA

[ID:62] LIGHT FIELD RECONSTRUCTION USING SHEARLET TRANSFORM IN TENSORFLOW

Yuan Gao1, Reinhard Koch1, Robert Bregovic2, Atanas Gotchev2

1Kiel University, Germany, 2Tampere University, Finland

[ID:63] AUTOMATIC LONG-TERM DECEPTION DETECTION IN GROUP INTERACTION VIDEOS

Chongyang Bai1, Maksim Bolonkin1, Judee Burgoon3, Chao Chen1, Norah Dunbar4, Bharat Singh2, V. S. Subrahmanian1, Zhe Wu2

1Dartmouth College, USA, 2Univerity of Maryland, USA, 3University of Arizona, USA, 4University of California Santa Barba- ra, USA

138 Poster Session 5 & Grand Challenge

Thursday, July 11, 2019

P-17: Multimedia Understanding and Mixed Reality

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Robert Bregovic Tampere University of Technology, Finland

[ID:1] SALIENT OBJECT DETECTION VIA RECURRENTLY AGGREGATING SPATIAL ATTENTION WEIGHTED CROSS-LEVEL DEEP FEATURES

Chang Tang1, Xinzhong Zhu2, Xinwang Liu3, Pichao Wang4

1China University of Geosciences, China, 2Zhejiang Normal University, China, 3National University of Defense Technology, China, 4Alibaba Group (U.S.), China

[ID:2] FAST REGISTRATION FOR CROSS-SOURCE POINT CLOUDS BY USING WEAK REGIONAL AFFINITY AND PIXEL-WISE REFINEMENT

Xiaoshui Huang1, Lixin Fan2, Qiang Wu1, Jian Zhang1,4, Chun Yuan3

1University of Technology Sydney, Australia, 2Nokia Technologies, Finland, 3Graduate School at Shenzhen, Tsinghua Univer- sity, China, 4Peng Cheng Laboratory, China

[ID:3] 3D FACE REPRENTATION AND RECONSTRUCTION WITH MULTI-SCALE GRAPH CONVOLUTIONAL AU- TOENCODERS

Cunkuan Yuan1, Kun Li1, Yu-Kun Lai2, Yebin Liu3, Jingyu Yang1

1Tianjin University, China, 2Cardiff University, UK, 3Tsinghua University, China

[ID:4] VISUAL DIALOG WITH TARGETED OBJECTS

Qiang Wang, Yahong Han

Tianjin University, China

[ID:5] A NEW OBJECT SCENE FLOW ALGORITHM BASED ON SUPPORT POINTS SELECTION AND ROBUST MOVING OBJECT PROPOSAL

Zhengyang Sun1, Zongqing Lu1, Jing-Hao Xue2, Qingmin Liao1

1Graduate School at Shenzhen, Tsinghua University, China, 2University College London, UK

[ID:6] REFINING PROPOSALS WITH NEIGHBORING CONTEXTS FOR TEMPORAL ACTION DETECTION

Dashan Guo, Wei Li, Ning Xu, Jianhui Sun and Xiangzhong Fang

139 IEEE ICME2019

Shanghai Jiao Tong University, China

[ID:7] A DATA-DRIVEN FRAMEWORK FOR APPEARANCE EDITING OF MEASURED MATERIALS

Yanjun Chen1, Jie Guo1, Bingyang Hu1, Yanwen Guo1,2, Jingui Pan1

1Nanjing University, China, 2The 28th Research Institute of China Electronics Technology Group Corporation, China

[ID:8] ACTIVE SEMANTIC LABELING OF STREET VIEW POINT CLOUDS

Yang Zhou1,2, Shuhan Shen1,2, Zhanyi Hu1,2

1NLPR, Institute of Automation, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China

[ID:9] VIDEO PREDICTION WITH TEMPORAL-SPATIAL ATTENTION MECHANISM AND DEEP PERCEPTUAL SIMILARITY BRANCH

Qian Wu, Wenmin Wang, Xiongtao Chen, Weimian Li

Peking University, China

[ID:10] AUTOMATIC LONG-TERM DECEPTION DETECTION IN GROUP INTERACTION VIDEOS

Chongyang Bai1, Maksim Bolonkin1, Judee Burgoon3, Chao Chen1, Norah Dunbar4, Bharat Singh2, V. S. Subrahmanian1, Zhe Wu2

1Dartmouth College, USA, 2Univerity of Maryland, USA, 3University of Arizona, USA, 4University of California Santa Barba- ra, USA

[ID:11] A NEW ROTATION-INVARIANT DEEP NETWORK FOR 3D OBJECT RECOGNITION

Yachi Zhang1, Zongqing Lu1, Jing-Hao Xue2, Qingmin Liao1

1Graduate School at Shenzhen, Tsinghua University, China, 2University College London, UK

[ID:12] LOCAL OPTICAL FLOW CONSIDERING OBJECT BOUNDARIES BY ADAPTIVE WINDOW POSITIONING

Andreas Kah, Matthias Narroschke

RheinMain University of Applied Sciences, Germany

[ID:13] MULTI-GRANULARITY REASONING FOR SOCIAL RELATION RECOGNITION FROM IMAGES

Meng Zhang1, Xinchen Liu2, Wu Liu2, Anfu Zhou1, Huadong Ma1, Tao Mei2

1Beijing University of Posts and Telecommunications, China, 2JD AI Research, China

140 Thursday, July 11, 2019

P-18: Media Classification and Segmentation IV

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Jianyu Yang Soochow University, China

[ID:14] MULTI-TIMESCALE CONTEXT ENCODING FOR SCENE PARSING PREDICTION

Xin Chen, Yahong Han

Tianjin University, China

[ID:15] PORTRAIT INSTANCE SEGMENTATION FOR MOBILE DEVICES

Lingyu Zhu1, Tinghuai Wang2, Emre Aksu2, Joni-Kristian Kamarainen1

1Tampere University, Finland, 2Nokia Technologies, Finland

[ID:16] LEARNING TO SEGMENT UNSEEN CATEGORY OBJECTS USING GRADIENT GAUSSIAN ATTENTION

Pengbo Zhang1, Zhihui Wang1, Xinzhu Ma1, Haojie Li1, Jianjun Li2

1Dalian University of Technology, China, 2Hangzhou Dianzi University, China

[ID:17] TEMPORAL SEGMENT CONVOLUTIONAL KERNEL NETWORKS FOR SEQUENCE MODELING OF VID- EOS

Fei Pan1, Yanwen Guo1, Zhicheng Yan2, Jie Guo1

1Nanjing University, China, 2Facebook Research, USA

[ID:18] SVNET: A SINGLE VIEW NETWORK FOR 3D SHAPE RECOGNITION

Shaoshuai Li, Fuyan Liu

Shanghai University, China

[ID:19] SPFUSIONNET: SKETCH SEGMENTATION USING MULTI-MODAL DATA FUSION

Fei Wang1, Shujin Lin1, Hefeng Wu2, Hanhui Li3, Ruomei Wang1, Xiaonan Luo3, Xiangjian He4

1Sun Yat-sen University, China, 2Guangdong University of Foreign Studies, China, 3Guilin University of Electronic Technolo- gy, China, 4University of Technology Sydney, Australia

[ID:20] ADAPTIVE COMPONENT EMBEDDING FOR UNSUPERVISED DOMAIN ADAPTATION

Mengmeng Jing1, Jingjing Li1, Ke Lu1, Jieyan Liu1, Zi Huang2

141 IEEE ICME2019

1University of Electronic Science and Technology of China, China, 2The University of Queensland, Australia

[ID:21] ACOUSTIC SCENE CLASSIFICATION WITH MISMATCHED RECORDING DEVICES USING MIXTURE OF EXPERTS LAYER

Truc Nguyen, Franz Pernkopf

Graz University of Technology, Germany

[ID:22] SELF-REPRESENTATION CONVOLUTIONAL NEURAL NETWORKS

Hongchao Gao1,2, Xi Wang1, Yujia Li1,2, Jizhong Han1, Songlin Hu1, Ruixuan Li3

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3Huazhong University of Science and Technology, China

142 Thursday, July 11, 2019

P-19: Oral-29 to Oral-35

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Honggang Wang University of Massachusetts (UMass) Dartmouth, USA

[ID:23] PEDESTRIAN RE-IDENTIFICATION BASED ON TREE BRANCH NETWORK WITH LOCAL AND GLOBAL LEARNING

Hui Li1, Meng Yang2, Zhihui Lai1, Weishi Zheng2, Zitong Yu3

1Shenzhen University, China, 2Sun Yat-sen University, China, 3University of Oulu, Finland

[ID:24] ADVERSARIAL BINARY CODING FOR EFFICIENT PERSON RE-IDENTIFICATION

Zheng Liu1, Jie Qin2, Annan Li1, Yunhong Wang1, and Luc Van Gool3

1Beihang University, China, 2Inception Institute of Artificial Intelligence, United Arab Emirates, 3Computer Vision Laborato- ry, ETH Zurich, Switzerland

[ID:25] PERSON RE-IDENTIFICATION WITH GRADUAL BACKGROUND SUPPRESSION

Yingzhi Tang, Xi Yang, Nannan Wang, Xinrui Jiang, Bin Song, Xinbo Gao

Xidian University, China

[ID:26] MULTI-BRANCH CONTEXT-AWARE NETWORK FOR PERSON RE-IDENTIFICATION

Yingxin Zhu1, Xiaoqiang Guo2, Jianlei Liu1, Zhuqing Jiang1

1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, China

[ID:27] POST-PROCESSING OF WORD REPRESENTATIONS VIA VARIANCE NORMALIZATION AND DYNAMIC EMBEDDING

Bin Wang1, Fenxiao Chen1, Angela Wang2 and C.-C. Jay Kuo1

1University of Southern California, USA, 2University of California, Berkeley, USA

[ID:28] MULTI-MODAL LANGUAGE ANALYSIS WITH HIERARCHICAL INTERACTION-LEVEL AND SELEC- TION-LEVEL ATTENTION

Dong Zhang, Liangqing Wu, Shoushan Li, Qiaoming Zhu, Guodong Zhou

Soochow University, China

[ID:29] MODELING THE CLAUSE-LEVEL STRUCTURE TO MULTIMODAL SENTIMENT ANALYSIS VIA REIN-

143 IEEE ICME2019

FORCEMENT LEARNING

Dong Zhang, Shoushan Li, Qiaoming Zhu, Guodong Zhou

Soochow University, China

[ID:30] TWICE OPPORTUNITY KNOCKS SYNTACTIC AMBIGUITY: A VISUAL QUESTION ANSWERING MODEL WITH YES/NO FEEDBACK

Jianming Wang, Wei Deng, Yukuan Sun, Yuanyuan Li, Kai Wang, Guanghao Jin

Tianjin Polytechnic University, China

[ID:31] GEOCAPSNET: GROUND TO AERIAL VIEW IMAGE GEO-LOCALIZATION USING CAPSULE NETWORK

Bin Sun1, Chen Chen2, Yingying Zhu1, Jianmin Jiang1

1Shenzhen University, China, 2University of North Carolina at Charlotte, USA

[ID:32] IMPROVING ROBUSTNESS OF DASH AGAINST NETWORK UNCERTAINTY

Bo Wang1,2, Fengyuan Ren1,2

1Beijing National Research Center for Information Science and Technology, China, 2Tsinghua University, China

[ID:33] HYBRID CONTROL-BASED ABR: TOWARDS LOW-DELAY LIVE STREAMING

Bo Wang1,2, Fengyuan Ren1,2, Chao Zhou3

1Beijing National Research Center for Information Science and Technology, China, 2Tsinghua University, China, 3Beijing Kuaishou Technology Co., Ltd. , China

[ID:34] TAXI ORIGIN-DESTINATION DEMAND PREDICTION WITH CONTEXTUALIZED SPATIAL-TEMPORAL NETWORK

Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin

Sun Yat-sen University, China

[ID:35] FAST IMAGE CLUSTERING BASED ON CAMERA FINGERPRINT ORDERING

Sahib Khan, Tiziano Bianchi

Politecnico di Torino, Italy

[ID:36] ENFORCING ACCESS CONTROL IN DISTRIBUTED VERSION CONTROL SYSTEMS

Xin Xu1,2, Quanwei Cai1,2, Jingqiang Lin1,2, Shiran Pan1,2, Liangqin Ren1,2

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China

144 [ID:37] ATTRIBUTE-BASED ACCOUNTABLE ACCESS CONTROL FOR MULTIMEDIA CONTENT WITH IN-NET- WORK CACHING

Peixuan He1, Kaiping Xue1, Jie Xu1, Qiudong Xia1, Jianqing Liu2, Hao Yue3

1University of Science and Technology of China, China, 2University of Alabama in Huntsville, USA, 3San Francisco State University, USA

[ID:38] PRACTICAL IMAGE OBFUSCATION WITH PROVABLE PRIVACY

Liyue Fan

University at Albany, State University of New York, USA

[ID:39] JOINTLY SOLVING DEBLURRING AND SUPER-RESOLUTION PROBLEMS WITH DUAL SUPERVISED NETWORK

Zhenwen Liang, Dongyang Zhang, Jie Shao

University of Electronic Science and Technology of China, China

[ID:40] TWO-STAGED ACOUSTIC MODELING ADAPTION FOR ROBUST SPEECH RECOGNITION BY THE EX- AMPLE OF GERMAN ORAL HISTORY INTERVIEWS

Michael Gref 1,2, Christoph Schmidt1, Sven Behnke1,3, Joachim Köhler1

1Fraunhofer Institute for Intelligent Analysis and Information Systems, Germany, 2Niederrhein University of Applied Scienc- es, Germany, 3University of Bonn, Germany

[ID:41] AN ADAPTIVE AFFINITY GRAPH WITH SUBSPACE PURSUIT FOR NATURAL IMAGE SEGMENTATION

Yang Zhang1, Huiming Zhang1, Yanwen Guo1, Kai Lin2, Jingwu He1

1Nanjing University, China, 2Hubei University of Technology, China

[ID:42] PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIRCU- LAR MICROPHONE ARRAY

Li He, Yi Zhou, Hongqing Liu

Chongqing University of Posts and Telecommunications, China

[ID:43] LOCALITY-CONSTRAINED SPATIAL TRANSFORMER NETWORK FOR VIDEO CROWD COUNTING

Yanyan Fang1, Biyun Zhan1, Wandi Cai1, Shenghua Gao2, Bo Hu1

1Fudan University, China, 2ShanghaiTech University, China

[ID:44] SPATIAL-AWARE NON-LOCAL ATTENTION FOR FASHION LANDMARK DETECTION

Yixin Li1, Shengqin Tang2, Yun Ye3, Jinwen Ma1

1Peking University, China, 2Xi’an Jiaotong University, China, 3JD AI Research, China

145 IEEE ICME2019

[ID:45] RELATIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION

Wu Zheng1,2, Lin Li1,2, Zhaoxiang Zhang1,2, Yan Huang1,2, Liang Wang1,2

1Institute of Automation, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China

[ID:46] MULTI-VIEW LEARNING FOR VEHICLE RE-IDENTIFICATION

Weipeng Lin1, Yidong Li1, Xiaoliang Yang1, Peixi Peng2, Junliang Xing2

1Beijing Jiaotong University, China, 2Institute of Automation, Chinese Academy of Sciences, China

[ID:47] MANY COULD BE BETTER THAN ALL: A NOVEL INSTANCE-ORIENTED ALGORITHMFOR MULTI-MOD- AL MULTI-LABEL PROBLEM

Yi Zhang, Cheng Zeng, Hao Cheng, Chongjun Wang, Lei Zhang

Nanjing University, China

[ID:48] AFFECTIVE VIDEO CONTENT ANALYSES BY USING CROSS-MODAL EMBEDDING LEARNING FEA- TURES

Benchao Li1,3, Zhenzhong Chen2, Shan Li3, WeiShi Zheng1,4

1Sun Yat-Sen University, China, 2Wuhan University, China, 3Tencent, China, 4Key Laboratory of Machine Intelligence and Advanced Computing, Ministry of Education, ,China

[ID:49] LEARNING A 3D GAZE ESTIMATOR WITH IMPROVED ITRACKER COMBINED WITH BIDIRECTIONAL LSTM

Xiaolong Zhou, Jianing Lin, Jiaqi Jiang, Shengyong Chen

Zhejiang University of Technology, China

[ID:50] DETECTION OF OCCLUDED ROAD SIGNS ON AUTONOMOUS DRIVING VEHICLES

Jingda Guo, Xianwei Cheng, Qi Chen, Qing Yang

University of North Texas, USA

146 Thursday, July 11, 2019

Grand Challenge

Time: 13:30 - 15:00 PM

Room: 3rd Floor

Chair: Jiaying Liu Peking University, China

[ID:51] SALIENCY PREDICTION VIA MULTI-LEVEL FEATURES AND DEEP SUPERVISION FOR CHILDREN WITH AUTISM SPECTRUM DOISORDER

Weijie Wei1, Zhi Liu1, Lijin Huang1, Alexis Nebout2, Olivier Le Meur2 1Shanghai University, China, 2 University of Rennes 1, France

[ID:52] VISUAL ATTENTION MODELING FOR AUTISM SPECTRUM DISORDER BY U-NET

Yuming Fang, Hanqin Huang, Boyang Wan, and Yifan Zuo Jiangxi University of Finance and Economics, China

[ID:53] PREDICTING SALIENCY MAPS FOR ASD PEOPLE

Alexis Nebout1, Weijie Wei2, Zhi Liu2, Lijin Huang2, Olivier Le Meur1 1University of Rennes 1, France, 2Shanghai University, China

[ID:54] CLASSIFYING AUTISM SPECTRUM DISORDER BASED ON SCANPATHS AND SALIENCY

Mikhail Startsev, Michael Dorr Technical University of Munich, Germany

[ID:55] EXPLOITING VISUAL BEHAVIOUR FOR AUTISM SPECTRUM DISORDER IDENTIFICATION

Giuliano Arru, Pramit Mazumdar, Federica Battisti Roma Tre University, Italy

[ID:56] SP-ASDNET: CNN-LSTM BASED ASD CLASSIFICATION MODEL USING OBSERVER SCANPATHS

Yudong Tao, Mei-Ling Shyu University of Miami, USA

147 IEEE ICME2019

[ID:57] PREDICTING AUTISM DIAGNOSIS USING IMAGE WITH FIXATIONS AND SYNTHETIC SACCADE PATTERNS

Chongruo Wu1, Sidrah Liaqat2, Sen-ching Cheung2, Chen-Nee Chuah1, Sally Ozonoff1 1University of California, Davis, USA, 2University of Kentucky, USA

[ID:58] A SIMPLE BUT USEFUL MODEL FOR CLASSIFYING ASD AND NORMAL VIEWERS USING GAZE DATA AND LINEAR REGRESSION

S. Xu, J. Yan, M. Hu Shanghai Key Laboratory of Multidimensional Information Processing, East China Normal University, China

148 Poster Session 6

Thursday, July 11, 2019

P-20: Multimedia Communications, Networking and Mobility

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Tsung-Jung Liu National Chung Hsing University, Taiwan

[ID:1] TIYUNTSONG: A SELF-PLAY REINFORCEMENT LEARNING APPROACH FOR ABR VIDEO STREAMING

Tianchi Huang, Xin Yao, Chenglei Wu, Rui-Xiao Zhang, Zhengyuan Pang, Lifeng Sun

Tsinghua University, China

[ID:2] EDGE-BOOST: ENHANCING MULTIMEDIA DELIVERY WITH MOBILE EDGE CACHING IN 5G-D2D NET- WORKS

Venkatraman Balasubramanian1, Mu Wang1, Martin Reisslein1, Changqiao Xu2

1Arizona State University, USA, 2University of Posts and Telecommunications, China

[ID:3] 3D MESH BASED INTER-IMAGE PREDICTION FOR IMAGE SET COMPRESSION

Hao Wu1, Xiaoyan Sun2, Jingyu Yang1, Feng Wu3

1Tianjin University, China, 2Microsoft Research Asia, China, 3University of Science and Technology of China, China

[ID:4] FAST INTER MODE PREDICTIONS FOR SHVC

Dayong Wang1, Yu Sun2, Weisheng Li1, Ce Zhu3, Frederic Dufaux4

1Chongqing University of Posts and Telecommunications, China, 2University of Central Arkansas, USA, 3University of Elec- tronic Science and Technology of China, China, 4CNRS - CentraleSupélec – Université Paris-Sud, France

[ID:5] HIT RATIO DRIVEN MOBILE EDGE CACHING SCHEME FOR VIDEO ON DEMAND SERVICES

Xing Chen, Lijun He, Shang Xu, Shibo Hu, Qingzhou Li, Guizhong Liu

Xi’an Jiao Tong University, China

[ID:6] QOE-DRIVEN MOBILE STREAMING: A LOCATION-AWARE APPROACH

Fang Liu, Wei Zhang, Yonggang Wen

Nanyang Technological University, Singapore

[ID:7] ENERGY EFFICIENT TRANSMISSION OF 3D MESHES OVER MMWAVE-BASED MASSIVE MIMO SYS- TEMS

149 IEEE ICME2019

Aris Lalos1, Gerasimos Arvanitis2, Evangelos Vlachos3, Konstantinos Moustakas2

1Industrial System Institute, Greece, 2University of Patras, Greece, 3University of Edinburgh, UK

[ID:8] IDENTIFYING INFLUENTIAL USERS IN MOBILE DEVICE-TO-DEVICE SOCIAL NETWORKS TO PROMOTE OFFLINE MULTIMEDIA CONTENT PROPAGATION

Hao Fan, Xu Tong, Qing Zhang, Tianxiang Zhang, Chenyang Wang and Xiaofei Wang

Tianjin University, China

150 Thursday, July 11, 2019

P-21: Object Detection II

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Ye Luo Tongji University, China

[ID:9] ACCURATE AND EFFICIENT OBJECT DETECTION WITH CONTEXT ENHANCEMENT BLOCK

Yuhao Chen1, Min Zhao1, Xin Tan2, Hong Tang1, Dihua Sun1

1Chongqing University, China, 2Shanghai Jiao Tong University, China

[ID:10] MASK GUIDED KNOWLEDGE DISTILLATION FOR SINGLE SHOT DETECTOR

Yousong Zhu1,2, Chaoyang Zhao1,2, Chenxia Han3, Jinqiao Wang1,2, Hanqing Lu1,2

1Institute of Automation, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3Wuhan University, China

[ID:11] VIDEO TEXT DETECTION WITH FULLY CONVOLUTIONAL NETWORK AND TRACKING

Yang Wang, Lan Wang, Feng Su, Jiahao Shi

Nanjing University, China

[ID:12] CASCADE REGION PROPOSAL NETWORKS FOR OBJECT DETECTION IN THE WILD

DongMing Yang1,2, YueXian Zou1,2

1Peking University, China, 2Peng Cheng Laboratory, China

[ID:13] TRACKING ASSISTED FASTER VIDEO OBJECT DETECTION

Wenfei Yang, BinLiu, Weihai Li, Nenghai Yu

University of Science and Technology of China, China

[ID:14] REFINETEXT: REFINING MULTI-ORIENTED SCENE TEXT DETECTION WITH A FEATURE REFINEMENT MODULE

Pengyuan Xie, Jing Xiao, Yang Cao, Jia Zhu, Asad Khan

South China Normal University, China

[ID:15] MULTI-SCALE CAPSULE ATTENTION-BASED SALIENT OBJECT DETECTION WITH MULTI-CROSSED LAYER CONNECTIONS

Qi Qi1, Sanyuan Zhao1, Jianbing Shen1, Kin-Man Lam2

1Beijing Institute of Technology, China, 2The Hong Kong Polytechnic University, China

151 IEEE ICME2019

Thursday, July 11, 2019

P-22: Artificial Intelligence for Multimedia

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Kunal Swami Samsung, Korea

[ID:16] CONTINUOUS BIDIRECTIONAL OPTICAL FLOW FOR VIDEO FRAME SEQUENCE INTERPOLATION

Donghao Gu, Zhaojing Wen, Wenxue Cui, Rui Wang, Feng Jiang, Shaohui Liu

Harbin Institute of Technology, China

[ID:17] ROBUST DEEP TRACKING WITH TWO-STEP AUGMENTATION DISCRIMINATIVE CORRELATION FIL- TERS

Chunhui Zhang1,2, Shiming Ge1, Yingying Hua1,2, Dan Zeng3

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3Shanghai University, China

[ID:18] EFFICIENT IMPLEMENTATION OF CONVOLUTIONAL NEURAL NETWORKS WITH END TO END INTE- GER-ONLY DATAFLOW

Yiwu Yao, Bin Dong, Yuke Li, Weiqiang Yang, Haoqi Zhu

Yidun Lab, NetEase Inc, China

[ID:19] LEARNING MOTION-AWARE POLICIES FOR ROBUST VISUAL TRACKING

Qianqian Wang, Liansheng Zhuang, Ning Wang, Wengang Zhou, Houqiang Li

University of Science and Technology of China, China

[ID:20] KNOWLEDGE DISTILLATION WITH CATEGORY-AWARE ATTENTION AND DISCRIMINANT LOGIT LOSSES

Lei Jiang, Wengang Zhou, Houqiang Li

University of Science and Technology of China, China

[ID:21] UNSUPERVISED LEARNING OF DEPTH AND EGO-MOTION WITH SPATIAL-TEMPORAL GEOMETRIC CONSTRAINTS

Anjie Wang1, Yongbin Gao1, Zhijun Fang1, Xiaoyan Jiang1, Shanshe Wang2, Siwei Ma2, Jenq-Neng Hwang3

1Shanghai University of Engineering Science, China, 2Peking University, China, 3University of Washington, USA

152 [ID:22] LEARNING MINIMAL INTRA-GENRE MULTIMODAL EMBEDDING FROM TRAILER CONTENT AND RE- ACTOR EXPRESSIONS FOR BOX OFFICE PREDICTION

Ming-Ya Ko, Jeng-Lin Li, Chi-Chun Lee

National Tsing Hua University, Taiwan

[ID:23] DEEP PAIRWISE RANKING WITH MULTI-LABEL INFORMATION FOR CROSS-MODAL RETRIEVAL

Yangwo Jian, Jing Xiao, Yang Cao, Asad Khan, Jia Zhu

South China Normal University, China

[ID:24] CORRELATION FILTER TRACKING WITH ADAPTIVE PROPOSAL SELECTION FOR ACCURATE SCALE ESTIMATION

Luo Xiong, Yanjie Liang, Yan Yan, Hanzi Wang

Xiamen University, China

[ID:25] SUPERVISED CONSISTENT AND SPECIFIC HASHING

Haitao Wang, Min Meng, Hui Chen, JiGang Wu

Guangdong University of Technology, China

[ID:26] MOMENTUM BASED ON ADAPTIVE BOLD DRIVER

Shengdong Li1,2, Xueqiang Lv3

1Renmin University of China, China, 2Langfang Yanjing Vocational Technical College, China, 3Beijing Information Science and Technology University, China

[ID:27] A LIGHTWEIGHT NEURAL NETWORK BASED HUMAN DEPTH RECOVERY METHOD

Meiyu Huang1, Xueshuang Xiang1, Yao Xu1, Yiqiang Chen2

1Qian Xuesen Laboratory of Space Technology, China Academy of Space Technology, China, 2Institute of Computing Tech- nology, Chinese Academy of Sciences, China

153 IEEE ICME2019

Thursday, July 11, 2019

P-23: Multimedia Quality Assessment and Metrics

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Federica Battisti Roma Tre University, Italy

[ID:28] EVALUATION OF DEFOGGING: A REAL-WORLD BENCHMARK DATASET, A NEW CRITERION AND BASELINES

Shiyu Zhao1, Lin Zhang1, Shuaiyi Huang2, Ying Shen1, Shengjie Zhao1, Yukai Yang3

1Tongji University, China, 2ShanghaiTech University, China, 3Uppsala University, Sweden

[ID:29] RESA: A REAL-TIME EVALUATION SYSTEM FOR ABR

Yanan Wang, Haili Wang, Jiaoyang Shang, Hu Tuo iQIYI, Inc, China

[ID:30] BLIND IMAGE SHARPNESS ASSESSMENT AND ENHANCEMENT VIA DEEP AUXILIARY LEARNING

Qingbo Wu, Rui Ma, King N. Ngan, Hongliang Li, and Fanman Meng

University of Electronic Science and Technology of China, China

[ID:31] END-TO-END BLIND IMAGE QUALITY ASSESSMENT WITH CASCADED DEEP FEATURES

Jinjian Wu, Jupo Ma, Fuhu Liang, Weisheng Dong, Guangming Shi

Xidian University, China

[ID:32] ENCODING DISTORTIONS FOR MULTI-TASK FULL-REFERENCE IMAGE QUALITY ASSESSMENT

Chen Huang1,2, Tingting Jiang1, Ming Jiang1

1Peking University, China, 2Baidu Inc., China

[ID:33] CAUSAL ANALYSIS OF THE UNSATISFYING EXPERIENCE IN REALTIME MOBILE MULTIPLAYER GAMES IN THE WILD

Yuan Meng1,4, Shenglin Zhang2, Zijie Ye1, Benliang Wang2, Zhi Wang1, Yongqian Sun2, Qitong Liu3, Shuai Yang3, Dan Pei1,4

1Tsinghua University, China, 2Nankai University, China, 3Tencent, China, 4Beijing National Research Center for Information Science and Technology, China

154 Thursday, July 11, 2019

P-24: Oral-25 to Oral-28

Time: 15:30 - 17:00 PM

Room: 3rd Floor

Chair: Kuan-Hsien Liu National Taichung University of Science and Technology, Taiwan

[ID:34] DYNAMIC CASCADED REGRESSION NETWORK WITH REINFORCEMENT LEARNING FOR ROBUST FACE ALIGNMENT

Zhihao Zhang, Liansheng Zhuang, Wengang Zhou, Houqiang Li

University of Science and Technology of China, China

[ID:35] DEEP LEARNING FACE HALLUCINATION VIA ATTRIBUTES TRANSFER AND ENHANCEMENT

Mengyan Li, Yuechuan Sun, Zhaoyu Zhang, Haonian Xie and Jun Yu

University of Science and Technology of China, China

[ID:36] EMOTION RECOGNITION FROM PHYSIOLOGICAL SIGNALS USING MULTI-HYPERGRAPH NEURAL NETWORKS

Junjie Zhu1, Xibin Zhao1, Han Hu2, Yue Gao1

1Tsinghua University, China, 2Beijing Institute of Technology, China

[ID:37] GPS: GROUP PEOPLE SEGMENTATION WITH DETAILED PART INFERENCE

Yue Liao1, Tianrui Hui1, Chen Gao1, Si Liu2, Yao Sun3, Hefei Ling4, Bo Li2

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2Beihang University, China, 3iie, China, 4Huazhong University of Science and Technology, China

[ID:38] MULTI-LABEL IMAGE RECOGNITION WITH JOINT CLASS-AWARE MAP DISENTANGLING AND LABEL CORRELATION EMBEDDING

Zhao-Min Chen1,2 Xiu-Shen Wei2, Xin Jin, 2Yanwen Guo1,3

1Nanjing University, China, 2Megvii Technology, China, 3Science and Technology on Information Systems Engineering Labo- raty, China

[ID:39] REAL TIME COMPRESSED VIDEO OBJECT SEGMENTATION

Zhentao Tan, Bin Liu, Weihai Li, Nenghai Yu

University of Science and Technology of China, China

155 IEEE ICME2019

[ID:40] ACCURATE AND FAST FINE-GRAINED IMAGE CLASSIFICATION VIA DISCRIMINATIVE LEARNING

Zhihui Wang1, Shijie Wang1, Pengbo Zhang1, Haojie Li1, Bo Liu2

1Dalian University of Technology, China, 2Shanghai Jiao Tong University, China

[ID:41] POSE2BODY: POSE-GUIDED HUMAN PARTS SEGMENTATION

Zhong Li1, Xin Chen2, Wangyiteng Zhou2, Yingliang Zhang2, Jingyi Yu2

1University of Delaware, USA, 2ShanghaiTech University, China

[ID:42] RESIDUAL MAGNIFIER: A DENSE INFORMATION FLOW NETWORK FOR SUPER RESOLUTION

Zhan Shu1, Mengcheng Cheng1, Biao Yang1, Zhuo Su1, Xiangjian He2,3

1Sun Yat-sen University, China, 2Minjiang University, China, 3University of Technology Sydney, Australia

[ID:43] EVERYONE IS A CARTOONIST: SELFIE CARTOONIZATION WITH ATTENTIVE ADVERSARIAL NET- WORKS

Xinyu Li, Wei Zhang, Tong Shen, Tao Mei

JD AI Research, China

[ID:44] SCALE-AWARE DEEP NETWORK WITH HOLE CONVOLUTION FOR BLIND MOTION DEBLURRING

Jichun Li, Ke Li, Bo Yan

Fudan University, China

[ID:45] REMOVING RAIN IN VIDEOS: A LARGE-SCALE DATABASE AND A TWO-STREAM CONVLSTM AP- PROACH

Tie Liu, Mai Xu and Zulin Wang

Beihang University, China

[ID:46] TOWARDS QOS-AWARE CLOUD LIVE TRANSCODING: A DEEP REINFORCEMENT LEARNING AP- PROACH

Zhengyuan Pang, Lifeng Sun, Tianchi Huang, Zhi Wang, Shiqiang Yang

Tsinghua University, China

[ID:47] HIGH SPEED RECURRENT REGRESSION NETWORK FOR VISUAL TRACKING

Ding Ma, Xiangqian Wu

Harbin Institute of Technology, China

[ID:48] PAAE: A UNIFIED FRAMEWORK FOR PREDICTING ANCHOR LINKS WITH ADVERSARIAL EMBEDDING

156 Yanmin Shang1, Zhezhou Kang1, Yanan Cao1, Dongjie Zhang1, Yangxi Li2, Yang Li3, Yanbing Liu1

1Institute of Information Engineering, Chinese Academy of Sciences, China, 2National Computer Network Emergency Re- sponse technical Team, China, 3State Information Center, China

[ID:49] MANIFOLD ALIGNMENT AND DISTRIBUTION ADAPTATION FOR UNSUPERVISED DOMAIN ADAPTA- TION

Ying Li, Lin Cheng, Yaxin Peng, Zhijie Wen, Shihui Ying

Shanghai University, China

157 IEEE ICME2019

Workshops

Monday, July 8, 2019

W-01: Multimedia Services and Technologies for Smart-health(MUST-SH)

Time: 8:30 AM - 17:00 PM

Room: 5F

Organizers: Shamim Hossain King Saud University, Saudi Arabia

Stefan Goebel KOM, TU Darmstadt, Germany

Yin Zhang Zhongnan University of Economics and Law, China

8:30 - 8:35 Opening Remarks:

Yin Zhang Zhongnan University of Economics and Law, China

8:35 - 9:30 Keynote Talk:

Huimin Lu Kyushu Institute of Technology, Japan

9:30 - 10:00 Oral Session 1:

Session Chair: Shamim Hossain King Saud University, Saudi Arabia

FULLY CONVOLUTIONAL NETWORK FOR 3D HUMAN SKELETON ESTIMATION FROM A SIN- GLE VIEW FOR ACTION ANALYSIS

Wen-Nung Lie1, Guan-Han Lin1, Lung-Sheng Shih1, YuLing Hsu1, Thang Huu Nguyen2, Quynh Nguyen Quang Nhu2

1National Chung Cheng University, Taiwan, 2The University of Danang, University of Science and Technology, Vietnam

10:00 - 10:30 Coffee Break

10:30 - 12:00 Oral Session 2:

Session Chair: Stefan Goebel KOM, TU Darmstadt, Germany

10:30 - 11:00

ATTENTION BASED SEMI-SUPERVISED DICTIONARY LEARNING FOR DIAGNOSIS OF AU- TISM SPECTRUM DISORDERS

Meng Yang1,2, Qin Zhong1, Lin Chen3, Fanglin Huang4, Baiying Lei4

1Sun Yat-sen University, Guangzhou, China, 2Key Laboratory of Machine Intelligence and Advanced Comput- ing(SYSU), Ministry of Education, 3Sogou, China, 4Shenzhen University, China

11:00 - 11:30

RT-ADI: FAST REAL-TIME VIDEO REPRESENTATION FOR MULTI-VIEW HUMAN FALL DETEC- TION

158 Qianggang Ding, Fan Yang, Jiawei Li, Sifan Wu, Bowen Zhao, Zhi Wang, Shutao Xia

Tsinghua University, China

11:30 - 12:00

A NEW IMAGE WATERMARKING SCHEME FOR EFFICIENT TAMPER DETECTION, LOCALIZA- TION AND RECOVERY

Faranak Tohidi, Manoranjan Paul

Charles Sturt University, Australia

12:00 - 13:30 Lunch Break

13:30 - 15:00 Oral Session 3:

Session Chair: Yin Zhang Zhongnan University of Economics and Law, China

13:30 - 14:00

PREDICTING HUMAN GRASP LOCATIONS ON CUP HANDLES BY USING DEEP NEURAL NET- WORKS TO INFER HEAT SIGNATURES FROM DEPTH DATA

Yijun Jiang, Sean Banerjee, Natasha Kholgade Banerjee

Clarkson University, USA

14:00 - 14:30

HIERARCHICAL FUZZY INFERENCE SYSTEM FOR DIAGNOSING DENGUE DISEASE

Mubarak Alrashoud

King Saud University, Saudi Arabia

14:30 - 15:00

HUMAN-INTERACTION WEAKLY-SUPERVISED DEEP NETWORKS FOR SEMANTIC SEGMEN- TATION

Wenfeng Luo1, Meng Yang1,2

1Sun Yat-sen University, China, 2Key Laboratory of Machine Intelligence and Advanced Computing (SYSU), Ministry of Educationl, China

15:00 - 15:30 Coffee Break 15:30 - 17:00 Oral Session 4: Session Chair: Shamim Hossain King Saud University, Saudi Arabia

15:30 - 16:15 THE PREDICTION MODEL OF BLOOD GLUCOSE CONCENTRATION FOR SMART HEALTH Han Yu, Jianmin Lu, Yue JIn, Binglei Yue, Xiao Ma Zhongnan University of Economics and Law, China

16:15 - 17:00 PREDICTING SPINE SURGERY COMPLICATIONS USING MACHINE LEARNING Mohamad Hoda1, Abdulmotaleb EI Saddik1, Eugene Wai2, Philippe Phan3

1University of Ottawa, Canada, 2The Ottawa Hospital, Canada, 3The Ottawa Hospital, Canada

159 IEEE ICME2019

Monday, July 8, 2019

W-02: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia (MMArt-ACM)

Time: 8:30 AM - 12:00 PM

Room: 5H

Organizers: Wei-Ta Chu National Chung Cheng University, Taiwan

Norimichi Tsumura Graduate School of Engineering, Chiba University, Japan

Shoji Yamamoto Tokyo Metropolitan College of Industrial Technology, Japan

Toshihiko Yamasaki University of Tokyo, Japan

8:30 - 8:35 Opening Remarks:

Session Chair: Toshihiko Yamasaki

8:35 - 9:50 Oral Session 1: Multimedia Artworks Analysis

Session Chair: Norimichi Tsumura, Toshihiko Yamasaki

8:35 - 8:50

DEEPIR: A DEEP SEMANTICS DRIVEN FRAMEWORK FOR IMAGE RETARGETING

Jianxin Lin, Tiankuang Zhou, Zhibo Chen

University of Science and Technology of China, China

8:50 - 9:05

MULTI-DEPTH DILATED NETWORK FOR FASHION LANDMARK DETECTION

Zeng Kai, Jun Feng, Richard F E Sutcliffe, Wang Xiaoyu, Bu Qirong

NorthWest University, China

9:05 - 9:20

SALIENCY-GUIDED IMAGE STYLE TRANSFER

Xiuwen Liu, Zhi Liu, Xiaofei Zhou, Minyu Chen

Shanghai University, China

9:20 - 9:35

A MULTIMEDIA-BASED MOVIE STYLE MODEL

Priyankar Choudhary, Neeraj Goel, Mukesh Saini

Indian Institute of Technology Ropa, India

9:35 - 9:50

NEURAL STYLE TRANSFER WITH CONTENT DISCRIMINATION

160 Xiyu Yan, Yeli Xing, Zihao He, Tao Dai, Yong Jiang, Shutao Xia

Tsinghua University, China

10:00 - 10:30 Coffee Break

10:30 - 11:30 Keynote talk by Prof. Jia Jia

Session Chair: Toshihiko Yamasaki

11:30 - 12:00 Oral Session 2: Attractiveness Computing in Multimedia

Session Chair: Wei-Ta Chu

11:30 - 11:45

PREDICTING THE ATTRACTIVENESS OF REAL-ESTATE IMAGES BY PAIRWISE COMPARISON USING DEEP LEARNING

Xueting Wang, Yuki Takada, Youiti Kado, Toshihiko Yamasaki

The University of Tokyo, Japan

11:45 - 12:00

VIDEO-BASED STRESS LEVEL MEASUREMENT USING IMAGING PHOTOPLETHYSMOGRA- PHY

Ryota Mitsuhashi1, Kaito Iuchi1, Takashi Goto2, Akira Matsubara2, Takahiro Hirayama2, Hideki Hashizume2, Norimichi Tsumura1

1Chiba University, Japan, 2Daikin Industries LTD, Japan

161 IEEE ICME2019

Monday, July 8, 2019

W-03: Visual Emotion Analysis: Theories and Applications

Time: 13:30 - 17:30 PM

Room: 5H

Organizers: Lifang Wu Beijing University of Technology, China

Jufeng Yang Nankai University, China

Rongrong Ji Xiamen University, China

13:30 - 13:35 Opening Remarks

13:35 - 14:30 Keynote: Computation of Emotion (Jiebo Luo)

14:30 - 15:00 Invited Talk 1: Affective and aesthetic computing on social images (Jia Jia)

15:00 - 15:30 Coffee Break

15:30 - 16:00 Invited Talk 2: Visual sentiment analysis and beyond (Yanwei Fu)

16:00 - 16:30 Invited Talk 3: Weakly supervised coupled networks for visual sentiment analysis (Dongyu She)

16:30 -16:50

FEAFA: A WELL ANOATED DATABASE FOR FACIAL EXPRESSION ANALYSIS AND 3D FACIAL ANIMATION

Yanfu Yan1, Ke Lu1, Jian Xue1, Pengcheng Gao1, Jiayi Lyu2

1University of Chinese Academy of Sciences, China 2Capital Normal University, China

16:50 - 17:10

CROSS-DATABASE MICRO-EXPRESSION RECOGNITION: A STYLE AGGREGATED AND AT- TENTION TRANSFER APPROACH

Ling Zhou, Qirong Mao, Luoyang Xue

Jiangsu University, China

17:10 -17:30

THE FUSION KNOWLEDGE OF FACE, BODY AND CONTEXT FOR EMOTION RECOGNITION

Jingjing Wu, Yong Zhang, Li Ning

Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, China

162 Monday, July 8, 2019

W-04: 1st International Workshop on Big Surveillance Data Analysis and Pro- cessing

Time: 8:30 AM - 12:00 PM

Room: 5I

Organizers: Weiyao Lin Shanghai Jiao Tong University, China

John See Multimedia University, Malaysia

Michael Ying Yang University of Twente, the Netherlands

8:30 - 10:00 Oral Session 1: Object Motion Analysis in Big Surveillance Videos

Session Chair: Weiyao Lin, Michael Ying Yang

8:30 - 8:45

DEFORMATION SAMPLE GENERATED NETWORK FOR ROBUST VISUAL TRACKING

Zizi Li, Yuan Zhou, Chunping Hou

Tianjin University, China

8:45 - 9:00

PRESERVING STRUCTURAL RELATIONSHIPS FOR PERSON RE-IDENTIFICATION

Liqiang Bao1, Bingpeng Ma1, Hong Chang2, Xilin Chen2

1University of Chinese Academy of Sciences, China 2Chinese Academy of Sciences, China

9:00 - 9:15

ADAPTIVE UPDATING SIAMESE NETWORK WITH LIKE-HOOD ESTIMATION FOR SURVEIL- LANCE VIDEO OBJECT TRACKING

Zhenxian Zheng, Yang Yi, Jinlong Shen, Jiahao Zhang

Sun Yat-sen University, China

9:15 - 9:30

A MULTIMODAL LOSSLESS CODING METHOD FOR SKELETONS IN VIDEOS

Xiaoyi He, Mingzhou Liu, Weiyao Lin, Xintong Han, Yanmin Zhu, Hongtao Lu, Hongkai Xiong

Shanghai Jiao Tong University, China

9:30 - 9:45

EFFICIENT SEMANTIC-BASED VEHICLE RETRIEVAL IN LONG-TERM CAR PARK VIDEOS

Clarence Weihan Cheong, Ryan Woei-Sheng Lim, John See, Lai-Kuan Wong, Ian Kim Teck Tan

Multimedia University, Malaysia

9:45 - 10:00

163 IEEE ICME2019

SINGLE IMAGE HAZE REMOVAL BY FEATURE MAPPING

Feiniu Yuan1, Yu Zhou2, Xue Xia2, Ya Li2

1Shanghai Normal University, China, 2Jiangxi University of Finance and Economics, China

10:00 - 10:30 Coffee Break

10:30 - 12:00 Oral Session 2: Human & Action Sensing for Big Surveillance Videos

Session Chair: Weiyao Lin, Michael Ying Yang

10:30 - 10:45

MOTION-LET CLUSTERING FOR SKELETON-BASED ACTION RECOGNITION

Jianyu Yang1, Chen Zhu1, Junsong Yuan2

1Soochow University, China, 2State University of New York at Buffalo, USA

10:45 - 11:00

DEEP KEY CLIPS-VIDEO FEATURE FUSION FRAMEWORK FOR ACTION RECOGNITION

Chao Li1, Yue Ming1, Yuan Shen2, Hui Yu3

1Beijing University of Posts and Telecommunications, China 2Tencent Technology (Beijing) Co., Ltd, China 3University of Portsmouth, UK

11:00 - 11:15

HUMAN IDENTIFICATION RECOGNITION IN SURVEILLANCE VIDEOS

Kai Jin, Xuemei Xie, Fangyu Wang, Xiao Han, Guangming Shi

Xidian University, China

11:15 - 11:30

AGE ESTIMATION FOR LOW-QUALITY FACIAL IMAGES: FROM SEPARATE DCNNS TO A DE- CISION FUSER

Kuan-Hsien Liu1, Pak Ki Chan2, Tsung-Jung Liu3, Hsiu-An Her1

1National Taichung University of Science and Technology, Taiwan, 2China Medical University Hospital,China 3National Chung Hsing University, Taiwan

11:30 - 11:45

SEMANTIC SEGMENTATION OF SATELLITE IMAGES USING A U-SHAPED FULLY CONNECT- ED NETWORK WITH DENSE RESIDUAL BLOCKS

Eric R Narciso Molina, Zenghui Zhang

Shanghai Jiao Tong University, China

11:45 - 12:00

MTCNN WITH WEIGHTED LOSS PENALTY AND ADAPTIVE THRESHOLD LEARNING FOR FA- CIAL ATTRIBUTE PREDICTION

Xingting He, Pingyu Wang, Zhicheng Zhao, Yanyun Zhao, Fei Su

Beijing University of Posts and Telecommunications, China

164 Monday, July 8, 2019

W-05: Multimedia for Robot, Unmanned Aerial Vehicle and Driverless Car

Time: 13:30 - 17:00 PM

Room: 5I

Organizers: Dong Zhao Beijing University of Posts and Telecommunications, China

Chenqiang Gao Chongqing University of Posts and Telecommunications, China

Jiayi Ma Wuhan University, China

Quan Zhou Nanjing University of Posts and Telecommunications, China

Ji Zhao TuSimple, China

Yu Zhou Beijing University of Posts and Telecommunications, China

13:30 - 13:35 Opening Remarks:

Yu Zhou Huazhong University of Science and Technology, China

13:35 - 14:10 Keynote Talk:

Yiqun Li Huazhong University of Science and Technology, China

14:10 - 14:45 Keynote Talk:

Chen Chen University of North Carolina at Charlotte, USA

14:45 - 15:05 Oral Session 1:

Session Chair: Dong Zhao

14:45 -15:05

MULTI-PATH FUSION NETWORK FOR HIGH-RESOLUTION HEIGHT ESTIMATION FROM A SINGLE ORTHOPHOTO

Yiteng Zhang, Xuejin Chen

University of Science and Technology of China, China

15:05 - 15:25 Coffee Break

15:25 - 16:00 Keynote Talk:

Lin Zhang Tongji University, China

16:00 - 17:00 Oral Session 2:

Session Chair:Jiayi Ma

16:00 - 16:20

FACE ANTI-SPOOFING BASED ON MULTI-LAYER DOMAIN ADAPTATION

Fengshun Zhou1,2, Chenqiang Gao1,2, Fang Chen1,2, Chaoyu Li1,2, Xindou L1,2, Feng Yang1,2, Yue Zhao1,2

1Chongqing University of Posts and Telecommunications, Chongqing, China, 2Chongqing Key Laboratory of Signal and Information Processing, Chongqing 400065, China

165 IEEE ICME2019

16:20 - 16:40

SELF-ATTENTION RELATION NETWORK FOR FEW-SHOT LEARNING

Binyuan Hui, Pengfei Zhu, Qinghua Hu, Qilong Wang

Tianjin University, China

16:40 - 17:00

BISE-RESNET: COMBINE SEGMENTATION AND CLASSIFICATION NETWORKS FOR ROAD FOLLOWING ON UNMANNED AERIAL VEHICLE

Dian Lyu, Peng Cheng, Ruizhou Liu, Liang Liu

Beijing University of Posts and Telecommunication, China

166 Monday, July 8, 2019

W-06: Information Theory and Multimedia Computing (ITMC)

Time: 8:30 AM - 16:30 PM

Room: 5J

Organizers: Ran He Chinese Academy of Sciences, China

Xiaotong Yuan Nanjing University, China

Jitao Sang Beijing Jiaotong University, China

8:50 - 9:00 Opening

9:00 - 10:00 Keynote Talk: Ran He

10:00 - 10:15 Coffee Break

10:15 - 11:45 Oral Session 1:

Session Chair: Ran He

10:15 – 10:30

HYBRID DEFENSE FOR DEEP NEURAL NETWORKS: AN INTEGRATION OF DETECTING AND CLEANING ADVERSARIAL PERTURBATIONS

Weiqi Fan, Guangling Sun, Yuying Su, Zhi Liu, Xiaofeng Lu

Shanghai University, China

10:30 – 10:45

SKETCH-BASED IMAGE RETRIEVAL VIA A SEMI-HETEROGENEOUS CROSS-DOMAIN NET- WORK

Chuo Li, Yuan Zhou, Jianxing Yang

Tianjin University, Tianjin, China

10:45 – 11:00

QUESTION SPLITTING AND UNBALANCED MULTI-MODAL POOLING FOR VQA

Mengfei Li, Huan Shao, Yi Ji, Yang Yang, ChunPing Liu

Soochow University Suzhou, , China

11:00 – 11:15

AI-GAN: SIGNAL DE-INTERFERENCE VIA ASYNCHRONOUS INTERACTIVE GENERATIVE AD- VERSARIAL NETWORK

Xin Jin, Zhibo Chen, Jianxin Lin, Wei Zhou, Jiale Chen, Chaowei Shan

University of Science and Technology of China, Hefei, China

11:15 – 11:30

Visual object tracking via Graph Convolutional Representation

167 IEEE ICME2019

Zhengzheng Tu, Ajian Zhou, Bo Jiang, Bin Luo

Anhui University, China

11:30 – 11:45

MOIRE PATTERN REMOVAL WITH MULTI-SCALE FEATURE ENHANCING NETWORK

Tian yu Gao1, Yanqing Guo1, Xin Zheng1, Qianyu Wang1, Xiangyang Luo2

1Dalian University of Technology, China 2The State Key Laboratory of Mathematical Engineering and Advanced Computing, China

12:00 - 13:30 Lunch Break

13:30 - 15:00 Oral Session 2:

Session Chair: Yi Li

13:30 – 13:45

DEEP COLOR IMAGE DEMOSAICKING WITH FEATURE PYRAMID CHANNEL ATTENTION.

Qi Kang, Ying Fu, Hua Huang

Beijing Institute of Technology, China

13:45 – 14:00

REAL-WORLD IMAGE DENOISING VIA WEIGHTED LOW RANK APPROXIMATION.

Yuenan Guo, Ying Fu, Hua Huang

Beijing Institute of Technology, China

14:00 – 14:15

TWO-STRE SPARSE NETWORK FOR ACCURATE IMAGE SUPER-RESOLUTION.

Ling Hu1,2, Shuhui Wang1, Liang Li1, Qingming Huang1,2

1Key Lab of Intell. Info. Process., Inst. of Comput. Tech., CAS, China, 2University of Chinese Academy of Sci- ences, Beijing, 100049, China

14:15 – 14:30

EMBEDDING NON-LOCAL MEAN IN SQUEEZE-AND-EXCITATION NETWORK FOR SINGLE IMAGE DERAINING.

Cong Wang, Hongyan Wang, Zhixun Su, Yan Yang

Dalian University of Technology, China

14:30 – 14:45

RELATIVE DEPTH ESTIMATION PRIOR FOR SINGLE IMAGE DEHAZING.

Jinbao Wang1, Ke Lu1, Jian Xue1, Yutong Kou2

1University of Chinese Academy of Sciences, China 2Huazhong University of Science & Technology, China

14:45 – 15:00

LOW-LIGHT IMAGE ENHANCEMENT WITH ATTENTION AND MULTI-LEVEL FEATURE FU- SION.

Lei Wang1, guangtao fu2, zhuqing jiang1, Guodong Ju3, aidong men1

168 1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, China, 3GuangDong TUS-TuWei Technology Co, Ltd, China

15:00 - 15:30 Coffee Break

15:30 - 16:30 Oral Session 3:

Session Chair: Yi Li

15:30 – 15:45

BLIND MESH QUALITY ASSESSMENT METHOD BASED ON CONCAVE, CONVEX AND STRUC- TURAL FEATURES ANALYSES.

Yaoyao Lin, Mei Yu, Ken Chen, Gangyi Jiang, Zongju Peng, Fen Chen

Faculty of Information Science and Engineering, Ningbo University, Ningbo, China

15:45 – 16:00

K-COVERS FOR ACTIVE LEARNING IN IMAGE CLASSIFICATION.

Yeji Shen1, Yuhang Song1, Hanhan Li2, Shahab Kamali2, Bin Wang1, C.-C. Jay Kuo1

1University of Southern California, USA, 2Google Research, USA

16:00 – 16:15

DISTRIBUTION DISCREPANCY MAXIMIZATION FOR IMAGE PRIVACY PRESERVING.

Sen Liu, Jianxin Lin, Zhibo Chen

University of Science and Technology of China, China

16:15 – 16:30

A NOVEL DISTANCE LEARNING FOR ELASTIC CROSS MODAL AUDIO-VISUAL MATCHING.

Rui Wang1, Huaibo Huang2,3, Xufeng Zhang1, Jixin Ma4, Aihua Zheng1

1Anhui University, China, 2University of Chinese Academy of Sciences, China, 3CASIA, China, 4University of Greenwich, UK

169 IEEE ICME2019

Friday, July 12, 2019

W-07: 6th IEEE International Workshop on Mobile Multimedia Computing (MMC)

Time: 8:30 AM - 12:00 PM

Room: 5F

Organizers: Tian Gan Shandong University, China

Wen-Huang Cheng National Chiao Tung University, Taiwan

Kai-Lung Hua National Taiwan University of Science and Technology, Taiwan

Klaus Schoeffmann Klagenfurt University, Austria

Vladan Velisavljevic University of Bedfordshire, UK

Christian von der Weth National University of Singapore, Singapore

8:30 - 9:00 Opening & Keynotes

9:00 - 10:00 Oral Session 1:

Session Chair: Wen-Huang Cheng

9:00 - 09:15

FINE DETECTION AND CLASSIFICATION OF MULTI-CLASS BARCODE IN COMPLEX ENVI- RONMENTS

Jiahe Zhang1, Jun Jia1, Zehao Zhu1, Xiongkuo Min1, Guangtao Zhai1, Xiao-Ping Zhang2

1Shanghai Jiao Tong University, China, 2Ryerson University, Canada

9:15 - 09:30

DEEP LEARNING BASED METHOD FOR PRUNING DEEP NEURAL NETWORKS

Lianqiang Li1, Jie Zhu1, Ming-Ting Sun2

Shanghai Jiao Tong University, China, 2University of Washington, USA

9:30 - 09:45

ALPS 1.0: Towards Automated Lecture Profiling System

Pratibha Kumari1, Prakhar Jain1, Swarna Sahay1, Gan Tian2, Mukesh Saini1

1Indian Institute of Technology Ropar, India, 2Shandong University, China

9:45 - 10:00

VAS360: QOE-DRIVEN VIEWPORT ADAPTIVE STREAMING FOR 360 VIDEO

Yuxiang Hu, Yu Liu, Yumei wang

Beijing University Posts and Telecommunications, China

10:00 - 10:30 Coffee Break

170 10:30 - 11:30 Oral Session 2:

Session Chair: Tian Gan

10:30 - 10:45

FUSING GEOGRAPHIC INFORMATION INTO LATENT FACTOR MODEL FOR PICK-UP REGION RECOMMENDATION

Zhuhua Liao, Jian Zhang, Yizhi Liu

Hunan University of Science & Technology, China

10:45 - 11:00

A FLEXIBLE VIEWPORT-ADAPTIVE PROCESSING MECHANISM FOR REAL-TIME VR VIDEO TRANSMISSION

Anyue Xu, Xinyu Chen, Yu Liu, Yumei Wang

Beijing University Posts and Telecommunications, China

11:00 - 11:15

OBJECTIVE QUALITY ASSESSMENT METHOD FOR STEREOSCOPIC IMAGE RETARGETING

Salah Addin Mohammed M Mohammed, Ya Zhou, Zhibo Chen, Houqiang Li

University of Science and Technology of China, China

11:15 - 11:30

OPTIMAL MULTI-CODEC ADAPTIVE BITRATE STREAMING

Yuriy Reznik, Xinagbo Li, Karl Lillevold, Abhijith Jagannath, Justin Greer

Brightcove Inc. USA

11:30 - 12:00

Best Paper Award Announcement

171 IEEE ICME2019

Friday, July 12, 2019

W-08: Time-sequenced Multimedia Computing

Time: 13:30 - 17:45 PM

Room: 5F

Organizers: Wei Li Fudan University, China

Mengyao Zhu Shanghai University, China

Bing-Kun Bao Nanjing University of Posts and Telecommunications, Nanjing, China

Min Xu University of Technology Sydney, Australia

Xi Shao Nanjing University of Posts and Telecommunications, Nanjing, China

13:30 - 13:55

AUDIO SCENE CLASSIFICATION WITH DISCRIMINATIVELY-TRAINED SEGMENT-LEVEL FEA- TURES

Haichuan Bai1,2, Hangting Chen1,2, Yonghong Yan1,2

1Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China

13:55 - 14:20

EFFICIENT IMPLICIT FOURIER COMPRESSION BASED CONVOLUTIONAL FEATURES FOR VISUAL TRACKING

Ridong Zhu, Xiaoyuan Yang, Jingkai Wang, Zhengze Li

Beihang University, China

14:20 - 14:45

AUDIO2FACE: GENERATING SPEECH/FACE ANIMATION FROM SINGLE AUDIO WITH ATTEN- TION-BASED BIDIRECTIONAL LSTM NETWORKS

Guanzhong Tian1, Yi Yuan2, Yong Liu1

1Zhejiang University, China, 2Fuxi AI Lab, Netease, China

14:45 - 15:10

DEEP VOCODER: LOW BIT RATE COMPRESSION OF SPEECH WITH DEEP AUTOENCODER

Gang Min1, Changqing Zhang 1, Xiongwei Zhang 2, Wei Tan1

1National University of Defense Technology, China 2Army Engineering University of PLA, China

15:10 - 15:30 Coffee Break

15:30 - 15:55

BLIND ESTIMATION OF REVERBERATION TIME USING BINAURAL COMPLEX IDEAL RATIO MASK

MingYang Chai1, TianTian Li1, MengYao Zhu1, Tao Wang1, Wen Zhang2

172 1Shanghai University, China, 2Northwestern Polytechnical University, China

15:55 - 16:20

OPV: BIAS CORRECTION BASED OPTIMAL PROBABILISTIC VIEWPORT-ADAPTIVE STREAMING FOR 360-DEGREE VIDEO

Weihong Lin, Xinggong Zhang, Zongming Guo, Wei Hu

Peking University, China

16:20 - 16:45

SVD-BASED CHANNEL PRUNING FOR CONVOLUTIONAL NEURAL NETWORK IN ACOUSTIC SCENE CLASSIFICATION MODEL

Jun Wang1, Shengchen Li1, Wenwu Wang2

1Beijing University of Posts and Telecommunications, China, 2University of Surrey, UK

16:45 - 17:10

MULTI-LEVEL ATTENTION MODEL WITH DEEP SCATTERING SPECTRUM FOR ACOUSTIC SCENE CLASSIFICATION

Zhitong Li1, Yuanbo Hou2, Xiang Xie1,3, Shengchen Li2, Liqiang Zhang1, Shixuan Du1, Wei Liu1

1Beijing Institute of Technology, China, 2Beijing University of Posts and Telecommunications, China, 3Beijing Institute of Technology, China

17:10 - 17:45

A MULTI-CRITERIA SUBJECTIVE EVALUATION METHOD FOR BINAURAL AUDIO RENDERING TECHNIQUES IN VIRTUAL REALITY APPLICATIONS

Zhaoyu Yan, Jing Wang, Zhuoran Li

Beijing Institute of Technology, China

173 IEEE ICME2019

Friday, July 12, 2019

W-09: Smart Camera Gigavision ( ) Time: 8:30 AM - 12:00 PM

Room: 5I

Organizers: Lu Fang Associate Professor, Tsinghua-Berkeley Shenzhen Institute, China

David J. Brady Duke University, USA

Shenghua Gao Assistant Professor, ShanghaiTech University, China

Yuchen Guo Tsinghua University, China

8:30 - 8:35 Opening Remarks:

Lu Fang Tsinghua University, China

8:35 - 9:15 Plenary Talk:

David J. Brady Duke University, USA

9:15 - 9:40 Keynote Talk:

Lu Fang Tsinghua University, China

9:40 - 10:05 Oral Session 1:

Session Chair: Lu Fang

SCALE-ADAPTIVE CNN BASED CROWD COUNTING AND DYNAMIC SUPERVISION

Zhengxin Li1, Jing Li1, Ling Xie1, Jianli Liu2

1ShanghaiTech University, Shanghai, China, 2Jiangnan University, Wuxi, China

SPATIAL-TEMPORAL CODEC ACCURACY CALIBRATION FOR MULTI-SCALE GIGA-PIXEL MACRO- SCOPE

Lei WANG, Jinli SUO, Jingtao FAN

Tsinghua University, China

10:05 - 10:20 Coffee Break

10:20 - 10:45 Keynote Talk:

Zhan Ma Nanjing University, China

10:45 - 11:10 Keynote Talk:

Shenghua Gao ShanghaiTech University, China

11:10 - 11:35 Keynote Talk:

Xing Lin Tsinghua University, China

11:35 - 12:00 Oral Session 2:

174 Session Chair: Lu Fang

SEGMENTATION OF BUILDING FOOTPRINTS WITH XCEPTION AND IOULOSS

Kepeng Xu1, Yunye Zhang1, Wenxin Yu1, Zhiqiang Zhang1, Jingwei Lu2, Yibo Fan3, Gang He4, Zhuo Yang5

1Southwest University of Science and Technology, China, 2Cadence Design Systems, Inc, 3Fudan University, China 4Xidian University, China 5Guangdong University of Technology, China

GIGAPIXEL-LEVEL IMAGE CROWD COUNTING USING CSRNET

Zhijie Cao1, Renyou Yan2, Yiyong Huang3, Zhiru Shi4

1Shanghai Jiao Tong University, China, 2ShanghaiTech University, China, 3Shanghai University, China, 4Yoke Intelligence, China

175 IEEE ICME2019

Friday, July 12, 2019

W-10: Cross-media Big Data Analysis for Semantic Knowledge Understanding

Time: 13:30 AM - 17:45 PM

Room: 5I

Organizers: Yang Yang University of Electronic Science and Technology of China, China.

Yang Wang Dalian University of Technology, China.

Xing Xu University of Electronic Science and Technology of China, China.

Zi Huang University of Queensland, Australia.

13:30 - 13:35 Opening Remarks

13:35 - 14:05 Keynote 1: Tentative

14:05 - 15:35 Oral Session 1: Knowledge Transfer Methods in Vision and Language

Session Chair: Yang Yang

14:05 - 14:20

MASK-GUIDED STYLE TRANSFER NETWORK FOR PURIFYING REAL IMAGES

Tongtong Zhao, Yuxiao Yan, Jinjia Peng, Huibing Wang, Xianping Fu

Dalian Maritime University, China

14:20 - 14:35

IMITATION LEARNING FOR SENTENCE GENERATION WITH DILATED CONVOLUTIONS USING ADVERSARIAL TRAINING

JianWei Peng1, MinChun Hu1, ChuanWang Chang2

1National Cheng Kung University, Taiwan, 2Kun Shan University, Taiwan

14:35 - 14:50

NON-RIGID 3D SHAPE RETRIEVAL BASED ON MULTI-VIEW METRIC LEARNING

Haohao Li, Shengfa Wang, Nannan Li, Zhixun Su, Ximin

Dalian University of Technology, China

14:50 - 15:05

WHAT TOPICS DO IMAGES SAY: A NEURAL IMAGE CAPTIONING MODEL WITH TOPIC REPRESEN- TATION

Feng Chen, Songxian Xie, Xinyi Li, Shasha Li, Jintao Tang, Ting Wang

National University of Defense Technology, China

15:05 - 15:30 Coffee Break

15:30 - 16:00 Keynote 2: Tentative

16:00 - 16:30 Oral Session 1: Knowledge Transfer Methods in Vision and Language

176 Session Chair: Yang Yang

16:00 - 16:15

CROSS DOMAIN KNOWLEDGE TRANSFER FOR UNSUPERVISED VEHICLE RE-IDENTIFICATION

Jinjia Peng, Huibing Wang, Tongtong Zhao and Xianping Fu

Dalian Maritime University, China

16:15 - 16:30

CYCLE-CONSISTENT DIVERSE IMAGE SYNTHESIS FROM NATURAL LANGUAGE

Zhi Chen, Yadan Luo

The University of Queensland, Australia

16:30 - 18:00 Session 2: Knowledge Transfer Related Application

Session chair: Yang Wang

16:30 - 16:45

SELF-WEIGHTED MULTIVIEW METRIC LEARNING BY MAXIMIZING THE CROSS CORRELATIONS

Huibing Wang, Jinjia Peng and Xianping Fu

Dalian Maritime University, China

16:45 - 17:00

CAUSATION-DRIVEN VISUALIZATIONS FOR INSURANCE RECOMMENDATION

Zhixiu Liu1, Chengxi Zang2, Kun Kuang1, Hao Zou1, Hu Zheng3, Peng Cui1

1Tsinghua University, China, 2Cornell University, USA, 3Datebao Insurance Ltd, China

17:00 - 17:15

CROSS-MODAL TRANSFER HASHING BASED ON COHERENT PROJECTION

En Yu1,2, Jiande Sun1, Li Wang1, Xiaojun Chang3, Huaxiang Zhang1, Alexander G. Hauptmann2

1Shandong Normal University, China, 2Carnegie Mellon University, USA, 3Monash University, Australia

17:15 - 17:30

RELATION NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION

Bin Deng (Shenzhen University)*; Daming Shi (College of Computer Science and Software Engineering, Shen-

zhen University)

Tianjin University, China

17:30 - 17:45

ANNOTATING 3D MODELS AND THEIR PARTS VIA DEEP FEATURE EMBEDDING

Kouki Omata, Takahiko Furuya, Ryutarou Ohbuchi

University of Yamanashi, Japan

177 IEEE ICME2019

Friday, July 12, 2019

W-11: AI Technology for Visual Fashion Computing Time: 8:30 - 9:50 AM

Room: 5J

Organizers: Wei Zhang JD AI Research, China

Ting Yao JD AI Research, China

Wen-Huang Cheng National Chiao Tung University, Taiwan

8:30 - 8:35 Opening Remarks

Session Chairs: Wei Zhang JD AI Research, China

8:35 - 9:00

DISENTANGLED HUMAN ACTION VIDEO GENERATION VIA DECOUPLED LEARNING

Lingbo Yang1, Zhenghui Zhao1, Shiqi Wang2, Shanshe Wang1, Siwei Ma1, Wen Gao1

1Peking University, China, 2City University of Hong Kong, China

9:00 - 9:25

PERSONALIZED IMAGE RECOMMENDATION WITH PHOTO IMPORTANCE AND USER-ITEM IN- TERACTIVE ATTENTION

Wan Zhang, Zepeng Wang, Tao Chen

Hefei University of Technology, China

9:25 - 9:50

PARTIALLY OCCLUDED HEAD POSTURE ESTIMATION FOR 2D IMAGES USING PYRAMID HOG FEATURES

Jun Wu1, Z. Shang1, K. Wang1, J. Zhai1, Y. Wang1, F. Xia1, W. Li1, J. Zhang1, Fan Zhang2

1Northwestern Polytechnical University, China, 2Zhejiang University, China

178 Friday, July 12, 2019

W-12: 2nd IEEE International Workshop on Faces in Multimedia(FacesMM)

Time: 10:30 - 12:00 AM

Room: 5J

Organizers: Yun Fu Northeastern University, China

Joseph P Robinson Northeastern University, China

Ming Shao University of Massachusetts, Dartmouth

Siyu Xia Southeast University, China

10:30 - 10:35 Opening Remarks: Joseph P Robinson

10:35 - 11:15 Keynote Talk: Di Huang Beihang University, China

11:15 - 11:30

ADAPTIVE SALIENCE PRESERVING POOLING FOR DEEP CONVOLUTIONAL NEURAL NETWORKS

Yu Zhenyu1, Dai Shiyu1, Xing Yuxiang2

1Nuctech Company Limited, China, 2Tsinghua University, China

11:30 - 11:45

FULLY AUTOMATIC PHOTOREALISTIC FACIAL EXPRESSION AND EYE GAZE TRANSFER WITH A SINGLE IMAGE

Wanxin Xu, Sen-ching Cheung

University of Kentucky, USA

11:45 - 12:00

DEEP DOMAIN ADAPTATION FOR ASIAN FACE RECOGNITION VIA ADA-IBN

Chen Qian, Yi Jin, Yidong Li, Congyan Lang, Songhe Feng, Tao Wang

Beijing Jiaotong University, China

179 IEEE ICME2019

Friday, July 12, 2019

W-13: The Third Workshop on Human Identification in Multimedia (HIM)

Time: 13:30 - 17:30 PM

Room: 5J

Organizers: Liangliang Ren Department of Automation University of Tsinghua University, China

Guangyi Chen Dept. of Automation University of Tsinghua University, China

Dr. Jiwen Lu Department of Automation Tsinghua University, China

13:30 - 13:35 Introduction

13:35 - 14:25 Invited Talk: Person Re-identification

Weishi Zheng

14:25 - 14:55 Oral Session 1: Human Identification

Session chair: Liangliang Ren

14:25 - 14:40

SIMILARITY PRESERVED CAMERA-TO-CAMERA GAN FOR PERSON RE-IDENTIFICATION

Jianlei Liu1, Yun Zhou2, Lingchuan Sun1, Zhuqing Jiang1

1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, China

14:40 - 14:55

UNSUPERVISED DOMAIN ADAPTATION FOR DISGUISED FACE RECOGNITION

Fangyu Wu1,2, Shiyang Yan3, Jeremy S. Smith2, Wenjin Lu1, Bailing Zhang4

1Xi’an Jiaotong-liverpool Universit, China, 2University of Liverpool, Liverpool, 3Queen’s University Belfast, UK, 4Zhejiang University, China

15:00 - 15:30 Coffee Break

15:30 - 16:45 Oral Session 2: Detection and Tracking

Session chair: Guangyi Chen

15:30 - 15:45

DUAL-CYCLE DEEP REINFORCEMENT LEARNING FOR STABILIZING FACE TRACKING

Congcong Zhu, Zhenhua Yu, Suping Wu, Hao Liu

Ningxia University, China

15:45 - 16:00

MULTI-TASK LEARNING FOR PEDESTRIAN BODY PARTS DETECTION AND MULTI-ATTRIBUTE

180 CLASSIFICATION

Miaomiao Lou1,2, Lin Chen1, Feng Guo2

1Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Science,China 2Chengdu Univer- sity of Information Technology,China

16:00 - 16:15

CONTEXT ATTENTION MODULE FOR HUMAN HAND DETECTION

Zhihuai Xie1, Shaojie Wang2, Wentian Zhao2, Zhenhua Guo1

1Department of Information Science and Technology, Graduate School at Shenzhen, Tsinghua University, China, 2Department of Computer Science, University of Rochester, USA

16:15 - 16:30

TOWARD ROBUST ONLINE ADAPTIVE VISUAL TRACKING VIA PYRAMIDAL FEATURES EX- TRACTION

Shuai Bai1, Yuan Dong1, Ting-Bing Xu2, Hongliang Bai3

1Beijing University of Posts and Telecommunications, China, 2Institute of Automation of Chinese Academy of Sciences, China, 3Beijing FaceAll Co., China

16:30 - 16:45

IMPROVING HUMAN POSE ESTIMATION WITH SELF-ATTENTION GENERATIVE ADVERSARIAL NETWORKS

Zhongzheng Cao, Rui Wang, Xiangyang Wang, Zhi Liu, Xiaoqiang Zhu

Shanghai University, China

16:45 - 17:30 Oral Session 3: Multimedia Processing

Session chair: Liangliang Ren

16:45 - 17:00

COLLABORATIVE REPRESENTATION GUIDED GRAPH LEARNING FOR VISUAL CLASSIFICATION

Sheng Huang, Yongxin Ge, Feiyu Chen, Kewen He, Xiaohong Zhang

Chongqing University, China

17:00 - 17:15

SPORTS HIGHLIGHTS GENERATION USING DECOMPOSED AUDIO INFORMATION

Muhammad Rafiqul Islam, Manoranjan Paul, Michael Antolovich, Ashad Kabir

Charles Sturt University, Australia

17:15 - 17:30

NEW BENCHMARK DATASETS AND A CHARACTER IDENTIFICATION SYSTEM ON TV SERIES

Zhuo Lei1, Qian Zhang2, Guoping Qiu3,4

1The University of Nottingh Ningbo China, 2University of Nottingh Ningbo China, 3Shenzhen University, China, 4University of Nottingham, UK

181 IEEE ICME2019

Student Program

Wednesday, July 10, 2019

Student Career Lunch

Time: 12:30 - 14:00 PM

Room: 5I

Chair: Weiyao Lin Shanghai Jiao Tong University, China

Xiaoyan Sun Microsoft Research Asia, China

Shaoen Wu Ball State University, China

3MT Competition

Time: 14:00 - 15:30 PM

Room: 5I

Chair: Weiyao Lin Shanghai Jiao Tong University, China

Xiaoyan Sun Microsoft Research Asia, China

Shaoen Wu Ball State University, China

14:00 ENHANCING QUALITY FOR COMPRESSED VIDEO

Ren Yang

ETH Zurich, Switzerland

14:05 STUDENT PARTICIPATION IN ICME2019

Qiyang Zhang

Wuhan University of Technology, China

14:10 RESEARCH ON LONG-TERM STABLE VISUAL TRACKING

Yuqi Han

Beijing Institute of Technology, China

14:15 PORTRAIT INSTANCE SEGMENTATION FOR MOBILE DEVICES

Lingyu Zhu

182 Tampere University, Finland

14:20 MODELING BOTH CONTEXT AND SPEAKER-SENSITIVE DEPENDENCE FOR EMOTION DETECTION IN MULTI-SPEAKER CONVERSATIONS

Dong Zhang

Soochow University, China

14:25 FOCUSED PLENOPTIC CAMERA AND CALIBRATION

Xufu Sun

Tsinghua University, China

14:30 ADVERSARIAL CROSS-MODAL RETRIEVAL VIA LEARNING AND TRANSFERRING SINGLE-MODAL SIMILARITIES

Xin Wen

Tsinghua University, China

14:35 DATA AUGMENTATION FOR MONAURAL SINGING VIOCE SEPARATION BASED ON VARIATIONAL AUTOENCODER-GENERATIVE ADVERSARIAL NETWORK

Boxin He

Tianjin Polytechnic University, China

14:40 MACHINE LEARNING FOR ACOUSTIC SCENE CLASSIFICATION

TrucThi Kim Nguyen

Graz University of Technology, Austria

14:45 STUDENT CAREEN LUNCH

Tie Liu

Beihang University, China

14:50 AN ADAPTIVE PARAMETER MODEL FOR DCT-BASED WATERMARKING SCHEMES

Ying Huang

Taiyuan University of Technology, China

14:55 NEWS-ORIENTED STOCK MOVEMENT PREDICTION ON DENSE TEMPORAL SEQUENCE USING IMPLICIT NEWS

Tsun-Hsien Tang

National Taiwan University, Taiwan

15:00 Evaluation and Q&A

183 IEEE ICME2019

Social Events

ICME 2019 Reception

Time: 18:00 - 21:00, Monday, July 8th, 2019

Room: Pearl Hall (7F)

ICME 2019 Student Career Dinner

Time: 18:00 - 21:00, Wednesday, July 10th, 2018

Room: Grand Ballroom (7F)

ICME 2019 Banquet

Time: 18:00 - 21:00, Wednesday, July 10th, 2018

Room: Grand Ballroom (7F)

184 Side Meetings

Tuesday, July 9, 2019 Tuesday, July 9, 2019 Tuesday, July 9, 2019

Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM

Room: 5J Room: 5F Room: 5H

TC meeting 1 (TMM SC) TC meeting 2 (MMSP TC) TC meeting 3 (ICME SC)

Wednesday, July 10, 2019 Wednesday, July 10, 2019 Wednesday, July 10, 2019

Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM

Room: 5J Room: 5F Room: 5H

TC meeting 4 (TMM EB) TC meeting 5 (ComSoc MMTC) TC meeting 6 (TCMC)

Thursday, July 11, 2019 Thursday, July 11, 2019 Thursday, July 11, 2019

Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM

Room: 5J Room: 5F Room: 5H

TC meeting 7 TC meeting 8 (IEEE MM-MAG EB) TC meeting 9 (MSA TC) (ICME 2019/2020 OC)

185 IEEE ICME2019

Area Chairs

Pradeep K. Atrey Song Guo University at Albany, SUNY, USA The Hong Kong Polytechnic University, China

Cunjian Chen Jungong Han Michigan State University, USA Lancaster University, UK

Ngai Man Cheung Luis Herranz Singapore University of Technology and Design, Singapore Computer Vision Center, Spain

Peng Cui Richang Hong Tsinghua University, China Hefei University of Technology, China

Tasos Dagiuklas Wolfgang Hürst London South Bank University, UK Utrecht University, Netherlands

Guiguang Ding Wen Ji Tsinghua University, China Institute of Computing Technology, Chinese Academy of Scienc- es, China

Ming Dong Cheolkon Jung Wayne State University Xidian University, China

Lingyu Duan Andre Kaup Peking University, China Friedrich-Alexander-Universität, Germany

Frederic Dufaux Patrick Le Callet Centre national de la recherche scientifique, France Universite de Nantes, France

Lu Fang Ge Li Tsinghua University, China Peking University, China

Jianlong Fu Houqiang Li Microsoft Research, USA University of Science and Technology of China, China

Chuang Gan Wanqing Li MIT-Watson AI Lab, USA University of Wollongong, Australia

Yue Gao Xiaolin Li Tsinghua University, China University of Florida, USA

Jingming Guo Weiyao Lin National Taiwan University of Science and Technology,Taiwan Shanghai Jiaotong University, China

186 Jianquan Liu Ju Shen NEC Corporation, Japan Department of Computer Science, University of Dayton, USA

Wu Liu Hailin Shi AI Research of JD.com, China JD AI Research, China

Jiwen Lu Vladimir Stankovic Tsinghua University, China University of Strathclyde, UK

Yan Lu Jinhui Tang Microsoft Research Asia, China Nanjing University of Science and Technology, China

Bo Luo Jelena Tešić University of Kansas, USA Texas State University, USA

Sanjeev Mehrotra Yonghong Tian Microsoft, USA Peking University, China

Jingjing Meng Farzad Toutounchi State University of New York at Buffalo, USA Queen Mary University of London, UK

Yuxin Peng Vladan Velisavljevic Peking University, China University of Bedfordshire, UK

Marius Preda Ruiping Wang Université Paris-Sud,France ICT, CAS, China

GuoJun Qi Wei Wang Huawei Cloud, China Aginome Scientific, Singapore

GuoJun Qi Xinchao Wang Huawei Cloud, China Stevens Institute of Technology, USA

Rajiv Ratn Shah Xin Jing Wang Indraprastha Institute of Information Technology, India Wework, USA

Amy R. Reibman Zhangyang Wang Purdue University, India Texas A&M University,USA

Jian Ren Mathias Wien Michigan State University, USA RWTH Aachen University, Germany

Yong Man Ro Dalei Wu Korea Advanced Institute of Science and Technology, Korea University of Technology of Compiegne, France

Sebastian Schwarz Shaoen Wu Nokia, Finland Ball State University, USA

187 IEEE ICME2019

Feng Xue Chengcui Zhang Hefei University of Technology, China The University of Alabama at Birmingham, USA

Chenggang Yan Hanwang Zhang Hangzhou Dianzi University, China Nanyang Technological University, Singapore

Qing Yang Qianni Zhang University of North Texas, USA Queen Mary University of London, UK

Wenxian Yang Sicheng Zhao Aginome Scientific, Singapore University of California Berkeley, USA

Yi Yang Liang Zheng University of Technology Sydney, Australia Australian National University, Australia

Ting Yao Ce Zhu JD AI Research, China University of Electronic Science & Technology of China, China

Cha Zhang Microsoft Research, USA

188 Technical Program Committee Members

Milad Abdollahzadeh Saverio Blasi Guangyi Chen Jaeyoung Choi

Mona Abid Du Bo Haoming Chen Kyoung-Ho Choi

Velibor Adzic Erik Bochinski Homer Chen Taelim Choi

Mariana Afonso Maksim Bolonkin Jingjing Chen Yoojin Choi

Luciano Volcan Agostini Marc Bosch Jun-Cheng Chen Hang Chu

Mohammad Faizal Ahmad Fauzi Imed Bouazizi Shixing Chen Lingyang Chu

Syed Hassan Ahmed Catarina Brites Shu-Ching Chen Wei-Ta Chu

Ali Ak Matthew Broadbent Si-Bao Chen Yung-Yu Chuang

Anique Akhtar Michael S Brown Toly Chen Stelvio Cimato

Hasan Al Marzouqi Michele Buccoli Wei Chen Claudiu Cobarzan

Ghassan Alregib Yujun Cai Wei-Bang Chen Giulio Coluccia

Laurent Amsaleg Roberto Caldelli Wuyang Chen Pedro Comesana-Alfaro

Gerasimos Arvanitis Shaun Canavan Xi Stephen Chen Antoine Coutrot

Joao Ascenso K. Selçuk Candan Xin Chen Dubravko Culibrk

Pedro A. Assuncao Kai Cao Xing Chen Eduardo A. B. Da Silva

Christoph Bachhuber Stefania Cecchi Yuli Chen Luis A Da Silva Cruz

Tom Bäckström Zhenhua Chai Zebin Chen Qi Dai

Chongyang Bai Chee Seng Chan Zhibo Chen Xiyang Dai

Yan Bai Din-Yuen Chan Zhineng Chen Antitza Dantcheva

Anant Baijal Chuan-Wang Chang Zhixiang Chen Mohamed Daoudi

Werner Bailer Chun-Fa Chang Bowen Cheng Carl J Debono

Ivan Bajic Yakun Chang Juntong Cheng Alessio Degani

Yukihiro Bandoh Yao-Jen Chang Shyi-Chyi Cheng Carlos Roberto Del Blanco

Bingkun Bao Marc Chaumont Wen-Huang Cheng Weijian Deng

Qian Bao Berlin Chen Anoop Cherian Weijian Deng

Tom Bashford-Rogers Bo-Wei Chen Boon-Seng Chew Mohamed Deriche

Jordi Mongay Batalla Chen Chen Jui-Chiu Chiang Jian-Jiun Ding

Jenny Benois-Pineau Chun-Chi Chen Jen-Tzung Chien Yu Ding

Marco Bertini Chun-Fu Chen Chih-Yi Chiu Zewei Ding

Zhenpeng Bian Dongdong Chen Nam Ik Cho Jana Dittmann

Tiziano Bianchi Francine Chen Hyomin Choi Marek Domański

189 IEEE ICME2019

Haoye Dong Han Gao Raouf Hamzaoui Lei Huang

Tingting Dong Wei Gao Hong Han Likun Huang

Dejing Dou Yuan Gao Shizhong Han Meiyu Huang

Pengfei Dou Liuhao Ge Xian-Hua Han Rui Huang

Shaoyi Du Shiming Ge Yahong Han Tsung-Wei Huang

Jiali Duan Yongxin Ge Renlong Hang Xiaofeng Huang

Yueqi Duan Guoping Gong Zongbo Hao Xiaohua Huang

Abhimanyu Dubey Mingming Gong Choochart Haruechaiyasak Xiaoming Huang

Pinar Duygulu Xinyu Gong Carlo Harvey Xin Huang

Isao Echizen Jianping Gou Mahmoud Reza Hashemi Yue Huang

Peter Eisert Marco Grangetto Devamanyu Hazarika Tzu-Yi Hung

Hazim KEMAL Ekenel Dan Grois Li He Jenq-Neng Hwang

Engin Erzin William I. Grosky Liang He Ichiro Ide

Ralph Ewerth Guanghua Gu Xiangnan He Tomohiro Ikai

Baojie Fan Renshu Gu Yuwen He Bogdan Ionescu

Jianping Fan Jian Guan Shintami C. Hidayati Maria Silvia Ito

Liyue Fan Valia Guerra Ones Lyndon Hill I-Hong Jhuo

Yuchen Fan Hongxing Guo Tuan Hoang Nguyen Anh Rongrong Ji

Zhipeng Fan Jingcai Guo Xiaopeng Hong Wen Ji

Zhiwen Fan Jingcai Guo Mohammad Hosseini Wen Ji

Leyuan Fang Jingda Guo Mohammad Hosseini Chuanmin Jia

Sergio M Faria Shuaishuai Guo Guanqun Hou Wenjing Jia

Reuben Farrugia Song Guo Junhui Hou Meng Jian

Attilio Fiandrotti Xun Guo Li Hou Junjun Jiang

Karel Fliegel Yiluan Guo Cheng-Hsin Hsu Tingting Jiang

Jingjing Fu Yiwen Guo Han Hu Wenbin Jiang

Qingtao Fu Yuanfang Guo Haoji Hu Xi Jiang

Xianping Fu Yuchen Guo Junlin Hu Xiaoyan Jiang

Takahiko Furuya Zongyu Guo Min-Chun Hu Jiren Jin

Neeraj Gadgil Zongyu Guo Tao Hu Xin Jin

Ji Gan Chitralekha Gupta Wei Hu Rolf Jongebloed

Tian Gan Cathal Gurrin Chih-Wei Huang Brendan Jou

Ernest D Ganaa Jesús Gutiérrez Huaxi Huang Kashyap K.R. Kam- bhatla Guanyu Gao Paul Haimes Kan Huang Li-Wei Kang

190 Xiangui Kang Chi-Chun Lee Dalton Lin Miaomiao Lou

Akankshya Kar Hyowon Lee Hsueh-Yi Lin Chun-Shien Lu

Kasun Karunanayaka Sanghoon Lee Jianxin Lin Guoyu Lu

Mohamed Abosaief Kassab Chuankun Li Jingqiang Lin Shao-Ping Lu

Birendra Kathariya Gaoling Li Wei-Yang Lin Yao Lu

Angeliki Katsenou Hongyan Li Suiyi Ling Yue Lu

Marie Katsurai Hongzhi Li Bei Liu Lannan Luo

Mohammad Kazemi Jianwu Li Bo Liu Wenfeng Luo

Naimul Mefraz Khan Jing Li Ding Liu Yong Luo

Changick Kim Jingjing Li Guangchi Liu Yong Luo

Changick Kim Kristen Li Hantao Liu Ryan Lustig

Han-Ul Kim Leida Li Hao Liu Bingpeng Ma

Jongyoo Kim Li Li Haomiao Liu Chongyang Ma

Woojae Kim Liang Li Jiankun Liu Fei Ma

Yeong Jun Koh Lianqiang Li Jingen Liu He Ma

Stefanos Kollias Lin Li Kuan-Hsien Liu Kede Ma

Jan Koloda Lizhong Li Kun Liu Qiang Ma

Takahiro Komamizu Shaozi Li Lingbo Liu Shiheng Ma

Jari Korhonen Shuai Li Peng Liu Siwei Ma

Harald Kosch Shuangqun Li Ping Liu Zhan Ma

Lukas Krasula Shujun Li Qiegen Liu Zhanyu Ma

Gosala Kulupana Site Li Rui Liu Debanjan Mahata

Anurag Kumar Teng Li Ruixu Liu Guangcan Mai

Yaman Kumar Wanhua Li Sheng Liu Emanuele Maiorana

Minoru Kuribayashi Xu Li Tsung-Jung Liu Qirong Mao

Jui-Hsin Lai Yehao Li Weifeng Liu Manuel J. Marín-Jiménez

Yu-Kun Lai Yue Li Xinchen Liu Manuel Martinello

Shang-Hong Lai Yuxi Li Xueliang Liu Marc Masana

Aris Lalos Zhengguo Li Yi Liu Reji Mathew

Long Lan Zhi Li Yinglu Liu Seksan Mathulaprangsan

Xiangyuan Lan Zhuoran Li Yu-Shen Liu Puneet Mathur

Jochen Lang Chia-Kai Liang Chengjiang Long Shaohui Mei

Chaker Larabi Haoyi Liang Zhiling Long Hardik Meisheri

Bowon Lee Chunze Lin Yi Loo Rufael Mekuria

191 IEEE ICME2019

Hongying Meng Dilruk Perera Shin'Ichi Satoh A Subramanyam

Zibo Meng Cristian Perra Peter Schelkens Chang Sun

Olivier Meur Matthieu Perreira Da Sil- Klaus Schoeffmann Heming Sun va Vasileios Mezaris John See Jiande Sun Antonio Pinheiro Qiguang Miao Mustafa Sert Lifeng Sun William J.-P. Puech Zhenjiang Miao Jie Shao Xiaoxiao Sun Fei Qi Simone Milani Bo Shen Yangfan Sun Lei Qi Vahid Mirjalili Jianghao Shen Yibao Sun Na Qi Philipp Moll Jie Shen Wei lian Suo Xiaojun Qi Marie-Jose Montpetit Jun Shen Thomas Swearingen Buyue Qian Yuta Nakashima Liyue Shen Seishi Takamura Xueming Qian Aous T. Naman Roger Shen Mengxuan Tan Linbo Qing Manish Narwaria Rui Shen Robby T. Tan Jiayan Qiu Ambarish Natu Yeji Shen Chang Tang Kai Qiu Hung Nguyen Boxin Shi Chih-Wei Tang Zhaofan Qiu Nhu Q Nguyen Haichao Shi Sheng Tang Maria Paula Queluz Weizhi Nie Shu Shi Youbao Tang Georges Quénot Xiushan Nie Yuxuan Shi Zheng Tang Saimunur Rahman Nikos Nikolaidis Huang-Chia Shih Georg Thallinger Aakanksha Rana Naoko Nitta Masato Shirai Nikolaos Thomos Yongming Rao Paulo Nunes Carlos N Silla Lei Tian Yogesh Rawat Makoto Okabe Jae-Young Sim Christian Timmerer Bappaditya Ray Vincent Oria Ashutosh Singla Ngoc-Trung Tran Kui Ren Yingwei Pan Luis Soares Juan Ramón Troncoso Pastoriza Liangliang Ren Wai Man Raymond Pang Faranak Sobhani Ngoc Trung Nuno Rodrigues Shivam Parikh Houbing Song Chia-Ming Tsai Hoda Roodaki Shashikant Patil Li Song Sik-Ho Tsang Nina Rosa Vikram Patil Qing Song Pei-Kuei Tsung Sankarasrinivasan S Houwen Peng Sibo Song Stefano Tubaro Mukesh Saini Wen-Hsiao Peng Eckehard Steinbach Nkiruka Uzuegbunam Dimitrios Sakkos Xi Peng Haakon K Stensland Giuseppev Valenzise Enrique Sánchez-Lozano Xiulian Peng Guan-Ming Su Avinash Varna Maria Santamaria Yan-Tsung Peng Haonan Su Stefanos Vrochidis Nabil Sarhan Yuxin Peng Yong Su Ji Wan Andrej Satnik

192 Jun Wan Zhen Wang Hongteng Xu Yasin Yazici

Bin Wang Zheng Wang Jianfeng Xu Mao Ye

Gaoang Wang Zhenzhen Wang Xiaozhong Xu Minxiang Ye

Hongxing Wang Zhiyong Wang Yiling Xu Xinchen Ye

Hsin-Min Wang Zhongdao Wang Yuanlu Xu Yun Ye

Huogen Wang Zhongyuan Wang Yuhui Xu Xi Yin

Jianfeng Wang Ziwei Wang Zengmin Xu Yifang Yin

Jiangping Wang Shikui Wei Zongyi Xu Zhenqiang Ying

Jinglu Wang Wei Wei Feng Xue Jianming Yong

Lizhi Wang Xiu-Shen Wei Jing-Hao Xue Satoshi Yoshida

Mea Wang Yingcan Wei Takehiro Yamamoto Atsuo Yoshitaka

Nannan Wang Bihan Wen Toshihiko Yamasaki Ilsun You

Pichao Wang Wei Wen Bo Yan Biting Yu

Qifei Wang Xin Wen Haibin Yan Dongfei Yu

Qin Wang Chaoqun Weng Jun Yan Haichao Yu

Shangfei Wang Fangyu Wu Xiyu Yan Jiahui Yu

Shanshe Wang Gengshen Wu Keiji Yanai Junqing Yu

Shaojie Wang Hefeng Wu Cheng Yang Mali Yu

Shuhui Wang Jinjian Wu Fei Yang Shengtao Yu

Sicheng Wang Junru Wu Fuzheng Yang Tan Yu

Song Wang Wei Wu Jufeng Yang Xiyu Yu

Suyu Wang Xiao Wu Meng Yang Yi Yu

Tinghuai Wang Yuhang Wu Qize Yang Jianbo Yuan

Xiangyu Wang Yuwei Wu Shuai Yang Ye Yuan

Xiaobo Wang Sen Xiang Wenhan Yang Huanjing Yue

Xingzheng Wang Chunxia Xiao Wenmian Yang Inyong Yun

Yaxing Wang Jing Xiao Xiaopeng Yang Pietro Zanuttigh

Yizhou Wang Xiaohua Xie Yang Yang Huanqiang Zeng

Yong Wang Zhihuai Xie Yiding Yang Liaoyuan Zeng

Yong Wang Qi Xin Yi-Hsuan Yang Qiang Zeng

Yongtao Wang Junliang Xing Yujiu Yang Yi-Chong Zeng

Yu Wang Jinbo Xiong Zhengyuan Yang Zhiyuan Zha

Yuantian Wang Baohan Xu Hantao Yao Liming Zhai

Yuanyuan Wang Chang Xu Kim Yap Baochang Zhang

193 IEEE ICME2019

Cheng Zhang Xiangrong Zhang H. Vicky Zhao Chunluan Zhou

Dejun Zhang Xiangrong Zhang Pinghua Zhao Jun Zhou

Fan Zhang Xue Zhang Sicheng Zhao Lijuan Zhou

Fan Zhang Yaping Zhang Tiesong Zhao Wei Zhou

Guigang Zhang Yi Zhang Xibin Zhao Wengang Zhou

Guofeng Zhang Yingxue Zhang Ziping Zhao Xiaolong Zhou

Junjie Zhang Yongbing Zhang Cairong Zhao Xiuzhuang Zhou

Ke Zhang Yuan Zhang Heliang Zheng Yipeng Zhou

Lefei Zhang Zhaobin Zhang Huiru Zheng Zhi Zhou

Lei Zhang Zhao-Xiang Zhang Wei-Shi Zheng Guibo Zhu

Ning Zhang Zhendong Zhang Wenzhao Zheng Lingyu Zhu

Shiliang Zhang Zheng Zhang Xiaozhen Zheng Yanjun Zhu

Tianzhu Zhang Zhizheng Zhang Zhedong Zheng Yingying Zhu

Wei Zhang Zhongyan Zhang Chen Zhong Peixian Zhuang

Weiming Zhang Baoquan Zhao Sheng-Hua Zhong Jeffrey Zou

Weitong Zhang Bin Zhao Zhun Zhong Ivan Zupancic

194 Sponsors

195 IEEE ICME2019

Organizational Sponsors

196 Whova Event App User Tutorial Get Most out of Your Event

How to Download the Whova App

The Whova event app is for free for event attendees. To download the app, please follow the step below: IOS: open up the Apple Store on your mobile device and search for 'Whova'. Android: open up the Google Play and search for 'Whova' or scan the QR code.

(You can click the left gray button to download the Whova App)

(just for Android)

How to Join the Meeting

1.Enter the email address you used for event registration or use your social media account.

2.After logging in, you can search 'ICME' for your event.

3.Then click the join button and enter the event invitation code: iicie

How to Vote

On Wednesday 10, July, there is a Star Innovator Session and you can vote for the finalists from 11:00am to 5:45pm. You can find the voting information in 'Messages', the activity will be shown at the top.

Name: ICME2019 Password: icme2019

197