3
Table of Contents
Guide Map...... 5 Schedule at a Glance...... 9 Monday, July 8, 2019...... 9 Tuesday, July 9, 2019...... 10 Wednesday, July 10, 2019...... 11 Thursday, July 11, 2019 ...... 12 Friday, July 12, 2019...... 13 Welcome Message from the General Chairs...... 14 Welcome Message from the Technical Program Committee Chairs...... 16 Organizing Committee...... 17 Keynote...... 24 K-01: Neural Circuit Plasticity: From Brain Research to Machine Learning and Back...... 24 K-02: AI Ethics: From Principles to Practices...... 25 K-03: Multimedia Driven Precise Medicine...... 26 Academic Panel...... 27 Towards an Excellent Academic Career ...... 27 Industry Panel...... 29 From Papers to Products: Bridging the Gap between Multimedia Research and Practical Applications...... 29 Multimedia Rising Star Panel...... 32 Multimedia Star Innovators...... 34 Multimedia Star Innovator Keynote Highlights...... 34 Multimedia Star Innovator Keynotes...... 36 Grand Challenges...... 37 Grand Challenge Highlights...... 37 G-01: Short Video Understanding Challenge -- Recommending All You Want to See...... 38 G-02: Grand Challenges of 106-p Facial Landmark Localization...... 40 G-03: Learning-Based Image Inpainting...... 42 G-04: Saliency4ASD: Visual attention modeling for Autism Spectrum Disorder...... 44 Tutorials...... 46 T-01: Big Data Intelligence: From Correlation Discovery to Casual Reasoning ...... 46 T-02: Human Behavior Understanding: From Human-Oriented Analysis to Action Recognition...... 47 T-03: Intelligent Image Enhancement and Restoration - From Prior Driven Model to Advanced Deep Learning ...... 48 T-04: Visual Search and Question Answering ...... 50 T-05: Object Detection Beyond Mask R-CNN and RetinaNet ...... 52
1 IEEE ICME2019
T-06: Computer Vision for Transportation...... 53 T-07: Causally Regularized Machine Learning ...... 55 T-08: Architecture Design for Deep Neural Networks ...... 57 T-09: Intelligent Multimedia Recommendation ...... 59 Oral Sessions...... 61 Best Paper Session...... 61 O-01: Content Recommendation and Cross-modal Hashing...... 62 O-02: Development of Multimedia Standards and Related Research...... 63 O-03: Classification and Low Shot Learning...... 64 O-04: 3D Media Computing...... 65 O-05: Special Session "Pedestrian Detection, Tracking and Re-identification in Videos"...... 66
O-06: Special Session "Multimedia Technologies Empowering Retail Experiences"...... 67 O-07: 3D and Low Level Vision...... 68 O-08: Object Detection I...... 69 O-09: Emerging Applications of Deep Learning...... 70 O-10: Multimedia Quality Assessment and Enhancement...... 71 O-11: Multimedia for Society and Health...... 72 O-12: Immersive Media...... 73 O-13: 3D and Stereo Computing ...... 74 O-14: Machine Learning Applications in Image and Video Coding I...... 75 O-15: Vision, Language and Text Processing...... 76 O-16: Media Classification and Segmentation II...... 77 O-17: AI for Human Understanding ...... 78 O-18: Image Quality Metrics ...... 79 O-19: Multimedia Recommendations ...... 80 O-20: Search and Retrieval...... 81 O-21: Media Understanding ...... 82 O-22: Super-resolution and Enhancement...... 83 O-23: Pose and Action Recognition II...... 84 O-24: Image and Video Enhancements I ...... 85 O-25: Face and Person Analysis ...... 86 O-26: Media Classification and Segmentation III...... 87 O-27: Image and Video Enhancements II ...... 88 O-28: Multimedia Learning and Adaptation...... 89 O-29: Person (Re-)Identification and People Detection...... 90
O-30: Multimedia and Language II...... 91 O-31: Multimedia Communications and Localization ...... 92 O-32: Multimedia Security, Privacy and Forensics II...... 93
2 O-33: Multimedia Sensing and Signal Processing ...... 94 O-34: Detection and Recognition ...... 95 O-35: Multi-modal Media Computing and Human-machine Interaction...... 96 Industry Track...... 97 Poster Sessions...... 98 Poster Session 1 & TMM Poster...... 98 P-01: Emerging Multimedia Applications and Technologies...... 98 P-02: Media Classification and Segmentation I...... 100 P-03: Oral-05 to Oral-12 ...... 102 TMM Poster...... 106 Poster Session 2...... 107
P-04: Multimedia Analysis, Search and Recommendation...... 107 P-05: Pose and Action Recognition I...... 109 P-06: Person and Emotion Understanding...... 111 P-07: Best Papers and Oral-01 to Oral-04...... 113 Poster Session 3 & Demo Session 1...... 116 P-08: Multimedia Creation and Enhancement...... 116 P-09: Multimedia and Vision I...... 118 P-10: Oral-17 to Oral-24...... 120 Demo Session 1...... 124 Poster Session 4 & Demo Session 2...... 125 P-11: Multimedia and Language I...... 125 P-12: Advances in Artificial Intelligence...... 127 P-13: Multimedia Security, Privacy and Forensics I...... 129 P-14: Machine Learning Applications in Image and Video Coding II...... 131 P-15: Multimedia and Vision II...... 133 P-16: Oral-13 to Oral-16...... 135 Demo Session 2...... 138 Poster Session 5 & Grand Challenge...... 139 P-17: Multimedia Understanding and Mixed Reality ...... 139 P-18: Media Classification and Segmentation IV ...... 141 P-19: Oral-29 to Oral-35...... 143 Grand Challenge ...... 147 Poster Session 6...... 149 P-20: Multimedia Communications, Networking and Mobility ...... 149
P-21: Object Detection II...... 151 P-22: Artificial Intelligence for Multimedia...... 152 P-23: Multimedia Quality Assessment and Metrics ...... 154
3 IEEE ICME2019
P-24: Oral-25 to Oral-28...... 155 Workshops...... 158 W-01: Multimedia Services and Technologies for Smart-health(MUST-SH)...... 158 W-02: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia (MMArt-ACM)...... 160 W-03: Visual Emotion Analysis: Theories and Applications...... 162 W-04: 1st International Workshop on Big Surveillance Data Analysis and Processing...... 163 W-05: Multimedia for Robot, Unmanned Aerial Vehicle and Driverless Car...... 165 W-06: Information Theory and Multimedia Computing (ITMC)...... 167 W-07: 6th IEEE International Workshop on Mobile Multimedia Computing (MMC)...... 170 W-08: Time-sequenced Multimedia Computing...... 172
W-09: Smart Camera Gigavision ...... 174 ( ) W-10: Cross-media Big Data Analysis for Semantic Knowledge Understanding...... 176 W-11: AI Technology for Visual Fashion Computing...... 178 W-12: 2nd IEEE International Workshop on Faces in Multimedia(FacesMM)...... 179 W-13: The Third Workshop on Human Identification in Multimedia (HIM)...... 180 Student Program...... 182 Student Career Lunch...... 182 3MT Competition ...... 182 Social Events...... 184 ICME 2019 Reception...... 184 ICME 2019 Student Career Dinner...... 184 ICME 2019 Banquet...... 184 Side Meetings...... 185 Area Chairs...... 186 Technical Program Committee Members...... 189 Sponsors...... 195 Organizational Sponsors...... 196 Whova Event App User Tutorial...... 197
4 Guide Map
5 IEEE ICME2019
6 7 IEEE ICME2019
8 Schedule at a Glance
T: Tutorial W: Workshop K: Keynote O: Oral P: Poster G: Grand Challenges
Posters of papers presented in the oral sessions of the regular program will be presented on the same day in one of the poster sessions
Monday, July 8, 2019
3E 3G 5A 5F 5H 5I 5J
8:30 T-03: Intelligent W-02: International Image Joint Workshop W-04: 1st W-06: T-01: Big Data W-01: Multimedia Enhancement T-05: Object on Multimedia International Information Intelligence - Services and and Restoration Detection Beyond Artworks Analysis Workshop on Theory and From Correlation Technologies - from Prior Mask R-CNN and and Attractiveness Big Surveillance Multimedia Discovery to Casual for Smart-health Driven Model to RetinaNet Computing in Data Analysis and Computing Reasoning (MUST-SH) Advanced Deep Multimedia Processing (ITMC) Learning (MMArt-ACM)
10:00 Coffee break - Meeting room Foyer (5F)
10:30 T-03: Intelligent W-02: International Image Joint Workshop W-04: 1st W-06: T-01: Big Data W-01: Multimedia Enhancement T-05: Object on Multimedia International Information Intelligence - Services and and Restoration Detection Beyond Artworks Analysis Workshop on Theory and From Correlation Technologies - from Prior Mask R-CNN and and Attractiveness Big Surveillance Multimedia Discovery to Casual for Smart-health Driven Model to RetinaNet Computing in Data Analysis and Computing Reasoning (MUST-SH) Advanced Deep Multimedia Processing (ITMC) Learning (MMArt-ACM)
12:00 Lunch Time
13:30 T-02: Human W-05: W-06: Behavior W-01: Multimedia Multimedia T-04: Visual W-03: Visual Information Understanding: T-06: Computer Services and for Robot, Search and Emotion Analysis: Theory and From Human- Vision for Technologies Unmanned Question Theories and Multimedia Oriented Analysis Transporation for Smart-health Aerial Vehicle Answering Applications Computing to Action (MUST-SH) and Driverless (ITMC) Recognition Car
15:00 Coffee break - Meeting room Foyer (5F)
15:30 T-02: Human W-05: W-06: Behavior W-01: Multimedia Multimedia T-04: Visual W-03: Visual Information Understanding: T-06: Computer Services and for Robot, Search and Emotion Analysis: Theory and From Human- Vision for Technologies for Unmanned Question Theories and Multimedia Oriented Analysis Transporation Smart -health Aerial Vehicle Answering Applications Computing to Action (MUST-SH) and Driverless (ITMC) Recognition Car
17:00
18:00 Welcome Reception - Pearl Hall (7F)
21:00 End of day
9 IEEE ICME2019
Tuesday, July 9, 2019 I & 3rd Floor TMM Poster Poster Session 2: Poster Session 1: Understanding & Segmentation I & and Technologies & and Technologies Recommendation & P-03: O-05 to O-12; P-06: Person and Emotion P-02: Media Classification and P-04: MM Analysis, Search and P-04: MM P-01: Emerging MM Applications MM P-01: Emerging P-05: Pose and Action Recognition P-05: Pose and P-07: Best Papers and O-01 to O-04 (7F) Buffet Grand Lunch Ballroom 5DE O-12: Media Media O-04: 3D Immersive Detection I Computing O-08: Object 5BC O-11: O-11: O-03: Vision Learning and Health Low Level for Society Multimedia Classification O-07: 3D and and Low Shot 3HI O-10: O-02: Retail Quality Session Research Standards Multimedia and Related "Multimedia Empowering Experiences" Technologies Technologies Development Enhancement O-06: Special of Multimedia Assessment and End of day 3CD Learning O-01: Content O-09: Emerging O-09: Emerging Tracking and Re- Tracking Applications of Deep Cross-modal Hashing Recommendation and O-05: Special Session "Pedestrian Detection, Coffee break - Meeting room Foyer (3F) Coffee break - Meeting room Foyer (3F) identification in Videos" identification in Break Break 5H TC meeting 3(ICME SC) 5F TC meeting 2(MMSP TC) 2(MMSP 5J TC meeting 1(TMM SC) Openning Auditorium 3F Best Paper Session From Brain Research to Excellent Academic Excellent Career Machine Learning and Back Academic Panel: Towards an Towards Academic Panel: K-01: Neural Circuit Plasticity: 16:30 16:45 17:00 17:45 15:00 15:30 8:15 8:30 9:30 10:00 11:00 11:15 12:30 13:30 14:00
10 Wednesday, July 10, 2019 & & & II & 3rd Floor Coding II & Language I & P-11: MM and P-11: Demo Session 1 Demo Session 2 Poster Session 3: Poster Session 4: in Image and Video Video in Image and P-08: MM Creation P-13: MM Security, P-13: MM Security, P-10: O-17 to O-24; P-16: O-13 to O-16; and Enhancement & P-12: Advances in AI P-15: MM and Vision Vision P-15: MM and P-14: Applications ML P-09: MM and Vision I Vision P-09: MM and Privacy and Forensics I (7F) Buffet Lunch Grand Ballroom 3B Keynotes Multimedia Star Innovator Industry Track 5DE Retrieval and Video and Video O-24: Image O-16: Media Enhancements I Segmentation II O-20: Search and Classification and 5BC Coffee break - Meeting room Foyer (3F) Processing O-15: Vision, O-15: Vision, Recognition II O-19: Multimedia Recommendations Language and Text Language and Text O-23: Pose and Action Pose and O-23: 3HI Coding I Learning O-18: Image O-22: Super- Enhancement resolution and O-14: Machine Applications in Quality Metrics Image and Video Image and Video Break End of day 3CD O-17: AI for Human O-21: Media O-13: 3D and Understanding Understanding Coffee break - Meeting room Foyer (3F) Stereo Computing Break 5I 3MT 3MT Student Student Student Program: Program: Competition Career Lunch Gala Banquet & Student Career Dinner - Grand Ballroom (7F) 6 5H (TCMC) TC meeting 5 5F MMTC) (ComSoc TC meeting 4 5J (TMM EB) TC meeting Practices Highlights Applications Auditorium 3F Industry Panel: From Papers to Multimedia Research and Practical Multimedia Star Innovator Keynote K-02: AI Ethics: From Principles To To AI Ethics: From Principles K-02: Products: Bridging the Gap between 15:00 15:30 17:00 17:45 8:30 9:30 10:00 11:00 11:15 12:30 13:30 14:00 16:30 16:45 17:30 18:00 21:00
11 IEEE ICME2019
Thursday, July 11, 2019 3rd Floor Mixed Reality & Grand Challenge Poster Session 5: Poster Session 6: P-23: MM Quality P-24: O-25 to O-28 P-22: AI for MM & P-22: P-19: O-29 to O-35; and Segmentation IV & and Segmentation IV P-18: Media Classification Assessment and Metrics & P-21: Object Detection II & Networking and Mobility & P-20: MM Communications, P-17: MM Understanding and (7F) Buffet Grand Lunch Ballroom 3B See G-04: Video Video Disorder Spectrum Modeling Landmark for Autism G-01: Short Challenge -- Localization 106-p Facial G-02: Grand Challenges of Understanding Saliency4ASD: Recommending All You Want to Want All You Visual Attention Visual 5DE O-28: O-32: machine Security, Security, Interaction Adaptation Computing Multimedia Multimedia Privacy and Forensics II and Human- O-35: Multi- Learning and modal Media 5BC and Video and Video O-27: Image O-34: Detection and Recognition and Localization Enhancements II Communications O-31: Multimedia Break End of day II III and 3HI O-30: O-33: Sensing and Signal Processing Multimedia Multimedia O-26: Media Classification Segmentation and Language Break Coffee break - Meeting room Foyer (3F) Coffee break - Meeting room Foyer (3F) 3CD O-29: G-03: Analysis Detection Learning- Inpainting and People and Person O-25: Face Person (Re-) Based Image Identification 5H TC meeting 9 (MSA TC) (MSA 5F TC EB) (IEEE meeting 8 MM-MAG 7 5J OC) (ICME 2019/2020 TC meeting Panel Highlights Auditorium 3F Grand Challenge Precise Medicine Multimedia Rising Star K-03: Multimedia Driven 8:30 9:30 10:00 11:00 11:15 12:30 13:30 14:00 15:00 15:30 16:30 16:45 17:00 17:45
12 Friday, July 12, 2019 5J Multimedia (HIM) Multimedia (HIM) Workshop on Faces in Workshop Multimedia (FacesMM) W-11: AI Technology AI for Technology W-11: Visual Fashion Computing Visual on Human Identification in on Human Identification in W-13: The Third Workshop The Third Workshop W-13: The Third Workshop W-13: W-12: 2nd IEEE International W-12: 5I (Gigavision) (Gigavision) W-09: Smart camera W-09: Smart camera W-09: W-10: Cross-media Big W-10: Cross-media Big W-10: Knowledge Understanding Knowledge Understanding Data Analysis for Semantic Data Analysis for Semantic Data 5F Lunch (MMC) (MMC) End of Conference Workshop on Mobile Workshop on Mobile Workshop W-08: Time-sequenced Time-sequenced W-08: Time-sequenced W-08: Multimedia Computing Multimedia Computing Multimedia Computing Multimedia Computing W-07: 6th IEEE International W-07: 6th IEEE International W-07: Coffee break - Meeting room Foyer (5F) Coffee break - Meeting room Foyer (5F) 5DE Recommendation Recommendation Deep Neural Networks Deep Neural Networks T-09: Intelligent Multimedia T-09: Intelligent Multimedia T-09: T-08: Architecture Design for T-08: Architecture Design for T-08: 5BC Machine Learning Machine Learning T-07: Causally Regularized T-07: Causally Regularized T-07: 8:30 10:00 12:00 15:00 18:00 10:30 13:30 15:30
13 IEEE ICME2019
Welcome Message from the General Chairs
On behalf of the Organizing Committee, it is our great pleasure to welcome you to the 2019 IEEE International Conference on Multimedia and Expo (ICME 2019) and the beautiful city of Shanghai. Shanghai is the financial center of China and a popular tourist destination renowned for its historical landmarks such as The Bund, City God Temple and Yu Garden, as well as the extensive and growing Lujiazui skyline. It has been a real honor and privilege to serve as the General Chairs of this conference. Since 2000, ICME has been the flagship multimedia conference sponsored by four IEEE societies: Circuits and Systems, Communications, Computer, and Signal Processing. It serves as a premier forum to promote the exchange of the latest advances in multimedia technologies, systems, and applications from both the research and development perspectives of the four research communities.
ICME 2019 will enable you to enjoy an outstanding program, exchange your ideas with leading researchers in various disciplines of multimedia, and make new friends in the international science community. Some highlights include three Keynote talks on the latest exciting topics of multimedia, ranging from the fundamental topic of human brain analysis to the fast-growing artificial intelligence (AI) applications; a wide range of tutorials and workshops; the best paper session; aca- demic and industry panel discussions; the newly established multimedia star innovator award with its highlight presentations and keynotes; four grand challenges with over 1,000 participants; industrial programs with very exciting demonstrations and talks; a student-focused program, and other exciting technical and social events. The Technical Program Chairs, Marta Mrak (BBC R&D, UK), Jun Wu (Tongji University, China), Zhu Li (University of Missouri, Kansas City, USA) representing the IEEE Signal Processing Society Multimedia Signal Processing Technical Committee (MMSP), Honggang Wang (University of Massachusetts Dartmouth, USA) representing the IEEE Communications Society Multimedia Communications Technical Committee (MMTC), Lei Zhang (Microsoft, USA) representing the IEEE Circuits and Systems Society Multimedia Systems & Applications Technical Committee (MSA), and Roger Zimmermann (National University of Singapore, Singapore) repre- senting the IEEE Computer Society Technical Committee on Multimedia Computing (TCMC), put tremendous effort into the creation of an exciting program which is composed of one third of the around 1,000 submitted papers.
Many individuals and organizations contributed to the success of this conference. We would like to acknowledge the efforts of the Panel Session Chairs, Chang Wen Chen (CUHK-SZ, China / SUNY-Buffalo, USA), Chia-Wen Lin (National Tsing Hua University, Taiwan) and Fernando Pereira (Instituto Superior Técnico, Portugal); the Workshop Chairs, Susanne Boll (University of Oldenburg, Germany), Jingdong Wang (Microsoft Research Asia, China) and Z. Jane Wang (University of British Columbia, Canada); the Tutorial Chairs, Jiebo Luo (University of Rochester, USA) and Zheng-Jun Zha (University of Science and Technology of China, China); the Special Session Chairs, Junwei Han (Northwestern Polytechnical University, China) and Enrico Magli (Politecnico di Torino, Italy); the Grand Challenge Chairs, Gene Cheung (York University, Canada) and Jiaying Liu (Peking University, China); the Award Chairs, Mei-Ling Shyu (University of Miami, USA) and Yonggang Wen (Nanyang Technological University, Singapore); the Industrial Program Chairs, Liang Lin (Sun Yat-Sen University, China), Chonggang Wang (InterDigital, USA) and Xiaoqing Zhu (Cisco, USA); the Student Program Chairs, Weiyao Lin (Shanghai Jiao Tong University, China), Xiaoyan Sun (Microsoft Research Asia, China) and Shaoen Wu (Ball State Universi- ty, USA); the Demo Chairs, Yu-Gang Jiang (Fudan University, China), Cong Shen (University of Science and Technology of China, China) and Dong Tian (InterDigital, USA); the Web Chairs, Wu Liu (JD AI Research, China) and Dalei Wu (University of Tennessee, USA). Together with the Technical Program Committee, they worked diligently to select papers and speakers that met the criteria of high quality and relevance to various fields within the scope of IEEE ICME. It takes time and effort to review a paper carefully, and every member of the Technical Program Committee is to be commended for his or her contri- bution to the success of this conference. The TPC chairs selected six papers as candidates for the Best Paper Award and these were submitted to the Award Committee and will be presented in the Best Paper Session. The winners will be announced during the banquet of ICME 2019 in Shanghai.
We would like to further extend our appreciation to the Local Chairs, Chong Luo (Microsoft Research Asia, China), Hanli Wang (Tongji University, China) and Dan Zeng (Shanghai University, China); the Sponsorship Chairs, Le Dong (Univ of Electronic Science and Technology of China, China), Nian Tong (University of Science and Technology of China, China), Junsong Yuan (State University of New York, Buffalo, USA) and Yongdong Zhang (University of Science and Technology of China, China); the Publication Chairs, Qi Tian (University of Texas at San Antonio, USA), Rui Wang (Tongji University, Chi- na) and Jian Zhang (University of Technology Sydney, Australia); the Publicity Chairs, Wen-Huang Cheng (National Chiao Tung University, Taiwan), Richang Hong (Hefei University of Technology, China), Shiwen Mao (Auburn University, USA) and Shui Yu (University of Technology Sydney, Australia); the Finance Chairs, Chengcui Zhang (University of Alabama at Birmingham, USA) and Dongdong Zhang (Tongji University, China); and the Registration Chairs, Dong Liu (University of Science and Technology of China, China), Haoqi Ren (Tongji University, China) and Liquan Shen (Shanghai University, Chi- na).
The conference would not have been possible without the dedication and the hard work of all members of the Organizing Committee. In addition to members of the Organization Committee, many volunteers have contributed to the success of the
14 conference. Volunteers helped in editing this conference booklet, and helped with local arrangements and on-site setups, and many other important tasks. While it is difficult to list all their names here, we would like to take this opportunity to sincere- ly thank them all.
Special thanks to our keynote speakers, Nozha Boujemaa (MEDIAN Technologies, France), Mu-Ming Poo (Institute of Neu- roscience, Chinese Academy of Sciences, China), and Harry Shum (Microsoft, USA). We greatly value their participation and look forward to their insightful vision and thoughts. Our thanks also go to all invited speakers for tutorials, panels, work- shops, rising star forum, grand challenges, and hands-on expos.
We are very grateful to the academic panelists, Frederic Dufaux (Paris-Sud University, France), Abed El Saddik (University of Ottawa, Canada), Lina Karam (Arizona State University, USA), Jay Kuo (University of Southern California, USA), Yong Lian (University of Singapore, Singapore), Dapeng Wu (University of Florida, USA), and the industry panelists, Xinxin Gao (The Jiangmen, China), Nozha Boujemaa (MEDIAN Technologies, France), Xian-Sheng Hua (DAMO Academy/Al- ibaba Cloud, China), Wenjun Zeng (Microsoft Research Asia, China), Marta Mrak (BBC, UK), Qibin Sun (University of Science and Technology and Founder of Xietong Info-Tech Pte. Ltd). Special thanks to Chang Wen Chen (CUHK-SZ, China / SUNY-Buffalo, USA) who set up the selection procedure of newly established Multimedia Rising Star Awards, and the pan- elists, Yi-Hsuan Yang (Academia Sinica, Taiwan), Jiwen Lu (Tsinghua University, China), Weiyao Lin (Shanghai Jiao Tong University, China), Lu Fang (Tsinghua University, China).
We are very grateful to members of the Multimedia Star Innovator Award Board, Alex Acero, (Senior Director, Siri at Apple, USA), Nikhil Balram (Sr. Director of Engineering, AR/VR at Google, USA), Hanno Basse (CTO, 20th Century Fox, USA), Achin Bhowmik (CTO & Executive VP Engineering, Starkey Hearing Tech, USA), Nozha Boujemaa (Chief Science & Inno- vation Officer, Median Technologies, France), Baining Guo (Assistant Managing Director, Microsoft Research Asia, China), Ramesh Jain (Professor, UC Irvine, USA), Kevin Jou (CTO, MediaTek, China), Chuen-Chien Lee (SVP, Sony Corporation of America), Shipeng Li (VP, iFlyTek, China), Pieter J. Mosterman (Chief Scientist and Director, The Mathworks, USA), Yong Rui (CTO, Lenovo, China), Anthony Vetro (VP, MERL, USA), Susie Wee (SVP, CISCO, USA), and Bowen Zhou (VP, JD.com, China). The Multimedia Star Innovator Award was initiated this year to recognize pioneers of transformative tech- nologies and business models in areas within the technical scope of IEEE ICME. The Award showcases innovations that have had great impact on human experiences with technology or are anticipated to do so in the near future. We received fourteen nominations, out of which four finalists were selected through voting by the Award Board. The four finalists will be compet- ing on-site at ICME 2019 for the Multimedia Star Innovator Award. Conference attendees can vote for the top winner with the deadline to cast all votes before the banquet. The top-voted finalist will be announced at the banquet.
We are grateful to the strong support of the ICME Steering Committee, the four sponsoring societies and respective Techni- cal Committees. ICME is unique because of the joint support of these four societies, and we are honored to serve as general co-chairs for such a unique interdisciplinary conference. We would also like to thank our industrial sponsors, including Kuaishou, JD.COM, MEGVII, iQIYI, Microsoft, DiDi, Alibaba, Horizon Robotics, Baidu, Lenovo Research, Unisound, SenseTime, and The Jiangmen. Last but not least, we would like to extend our most sincere congratulations to all authors and speakers for a job well done. We would also like to acknowledge the exhibitors for their supports and contributions to the ICME 2019 program.
We look forward to welcoming you in person in Shanghai and we hope that you will enjoy ICME 2019 and the beautiful summer of Shanghai!
General Chairs
Lina J. Karam
Arizona State University, USA
Tao Mei
AI Research of JD.COM, China
Feng Wu
University of Science and Tech of China, China
15 IEEE ICME2019
Welcome Message from the Technical Program Committee Chairs
On behalf of the ICME 2019 Technical Program Committee (TPC), we are delighted to welcome you to Shanghai! During its 20 years of history, ICME has been presenting advances in the field of multimedia from various IEEE research communities, which this year resulted in a record popularity according to the number of submissions to the main conference track. This has been achieved by strong engagement of its four IEEE sponsors: the IEEE Signal Processing Society, IEEE Communication Society, IEEE Circuit and System Society and IEEE Computer Society.
9-11 July 2019 are the core days of this year’s conference. During these three days, the program of each morning is organized into a single track starting with a keynote talk from the most distinguished experts in our communities. Keynote talks will be followed by a comprehensive programme which includes sessions for Best Papers, the Rising Star Program, and many more. Each afternoon will be busy with numerous parallel sessions composed of accepted submissions, including two Special Ses- sions.
One day before and one day after the core conference days, you will have the opportunity to attend workshops which are traditionally held in conjunction with ICME. This year we have the pleasure to bring to you 13 workshops, covering recent developments in visual fashion computing, visual emotion analysis and smart-health technologies, among other emerging topics.
This year is special for ICME. In addition to a notable 20th anniversary, we are pleased to report a record high number of submissions to the conference main track. From more than 1,000 papers submitted to the conference, approx. 30% were accepted (313 papers). The Technical Program Co-chairs recruited 72 Area Chairs who first of all assisted in the recruitment of reviewers to cover 32 distinctive subject areas. This year the selection of subject areas mainly reflected the growing use of machine learning and artificial intelligence in multimedia applications. Out of 32 subject areas, 15 covered various topics of deep learning and artificial intelligence for multimedia, which were then addressed by approx. 1/2 of all submissions. The next most popular topic was Multimedia and Vision, with approx. 1/8 of the submissions to this area.
With a target to have at least three high quality reviews for each paper, 617 reviewers provided on average 3.85 reviews per paper, with 99% of papers receiving 3 or more reviews. Finally, based on the obtained reviews, our Area Chairs provided recommendations for each paper so that the difficult task of making decisions for acceptance could be performed. Finally, the TPC recommended papers for 36 oral and 18 poster sessions. Amongst a number of highly rated manuscripts, 6 of the very best papers have been shortlisted for awards, where the final selection will be decided during the conference in association with the Awards Chairs.
Our gratitude goes to the Area Chairs and the reviewers whose technical expertise and dedication were not only thorough and crucial for the technical assessment of the selection of papers, but also inspirational in making the whole process even more pleasurable. We will recognise those colleagues who made the most valuable contributions with special awards for Area Chairs and reviewers.
Lastly, we thank the General Chairs Tao Mei, Feng Wu and Lina Karam as well as ICME Steering Committee Chair Yap- Peng Tan for their patience and guidance. Many thanks also to all the members of the Organizing Committee for their full support in preparation of the conference. Finally, we would like to thank our authors from all over the world whose valuable and novel contributions are essential for both the continued success of ICME and the advancement of technology for humani- ty.
ICME 2019 Technical Program Committee Co-Chairs Marta Mrak, BBC R&D, UK
Jun Wu, Tongji University, China
Honggang Wang, University of Massachusetts Dartmouth, USA
Roger Zimmermann, National University of Singapore, Singapore
Zhu Li, University of Missouri, Kansas City, USA
Lei Zhang, Microsoft, USA
16 Organizing Committee
General Chairs
Lina J. Karam Tao Mei Feng Wu
Arizona State University, USA JD AI Research, China University of Science and Technology of China, China
Program Chairs
Jun Wu Marta Mrak Zhu Li
Tongji University, China BBC R&D, UK University of Missouri, Kansas City, USA
Honggang Wang Lei Zhang Roger Zimmermann
University of Massachusetts Microsoft, USA National University of Singapore, Dartmouth, USA Singapore
17 IEEE ICME2019
Panel Chairs
Chang Wen Chen Chia-Wen Lin Fernando Pereira
CUHK-SZ, China / National Tsing Hua Instituto Superior Técnico, SUNY-Buffalo, USA Portugal University, Taiwan
Workshop Chairs
Susanne Boll Jingdong Wang Z. Jane Wang
University of Oldenburg, Microsoft Research Asia, University of British Columbia, Germany China Canada
Tutorial Chairs
Jiebo Luo Zheng-Jun Zha
University of Rochester, USA University of Science and Technology of China, China
18 Special Session Chairs
Junwei Han Enrico Magli
Northwestern Polytechnical Univer- Politecnico di Torino, Italy sity, China
Grand Challenges Chairs
Gene Cheung Jiaying Liu
York University, Canada Peking University, China
Award Chairs
Mei-Ling Shyu Yonggang Wen
University of Miami, USA Nanyang Technological University, Singapore
19 IEEE ICME2019
Industrial Program Chairs
Liang Lin Chonggang Wang Xiaoqing Zhu
Sun Yat-Sen University, China InterDigital, USA Cisco, USA
Student Program Chairs
Weiyao Lin Xiaoyan Sun Shaoen Wu
Shanghai Jiao Tong University, China Microsoft Research Asia, Ball State University, USA China
Poster/Demo Chairs
Yu-Gang Jiang Cong Shen Dong Tian
Fudan University, China University of Science and Tech- InterDigital, USA nology of China, China
20 Web Chairs
Wu Liu Dalei Wu
JD AI Research, China University of Tennessee, USA
Local/Event Chairs
Chong Luo Hanli Wang Dan Zeng
Microsoft Research Asia, China Tongji University, China Shanghai University, China
Sponsorship Chairs
Le Dong Nian Tong Junsong Yuan
University of Electronic Science and University of Science and State University of New York, Buffalo, Technology of China, China Technology of China, China USA
21 IEEE ICME2019
Yongdong Zhang
University of Science and Tech- nology of China, China
Publication Chairs
Qi Tian Rui Wang Jian Zhang
University of Texas at San Tongji University, China University of Technology Sydney, Antonio, USA Australia
Publicity Chairs
Wen-Huang Cheng Richang Hong Shiwen Mao
National Chiao Tung University, Hefei University of Auburn University, USA Taiwan Technology, China
22 Shui Yu
University of Technology Sydney, Australia
Finance Chairs
Chengcui Zhang Dongdong Zhang
University of Alabama at Birming- Tongji University, China ham, USA
Registration Chairs
Dong Liu Haoqi Ren Liquan Shen
University of Science and Tongji University, China Shanghai University, China Technology of China, China
23 IEEE ICME2019
Keynote
Tuesday, July 9, 2019
K-01: Neural Circuit Plasticity: From Brain Research to Machine Learning and Back
Time: 8:30 - 9:30 AM
Room: Auditorium 3F
Speaker: Mu-Ming Poo Institute of Neuroscience, Chinese Academy of Sciences & CAS Center for Excellence in Brain Sci- ence and Intelligence Technology, China
Chair: Feng Wu University of Science and Technology of China, China
Abstract
The most important feature of the brain is the plasticity of neural circuits, i.e., the structure and function of neural circuits could be modified by experience. This plasticity is the basis of most of our cognitive functions, such as sensory perception, multisensory integration, learning and memory, pattern recognition, attention and decision-making. In this lecture, I will summarize our current concept of neural circuit plasticity, which represents a major achievement of neuroscience in the past decades. First, the architecture of the neural circuits is shaped by experience through experience-induced sculpting (pruning) of connections, a process most prominent in early brain development and continued to a limited extent in the adult brain. Second, the efficiency of signal transmission in specific neural circuits at the junctions (synapses) between nerve cells (neu- rons) could be modified by experience-induced neural activities in a manner that depends on the pattern (frequency and tim- ing) of electrical spikes in the pre- and postsynaptic neurons. Activity-induced long-term potentiation (LTP) and long-term depression (LTD) of existing synapses are the predominant mechanism underlying learning and memory of the adult brain. Concepts of neural circuit architecture and synaptic plasticity had triggered the emergence of efficient machine learning -al gorithms in the past decades, and will continue to inspire the development of new machine learning methods in the future. Conversely, I will argue that findings in the field of machine learning and artificial neural networks could help to facilitate the elucidation of new features and functions of neural circuits in the brain. This was illustrated by our own discovery of back-propagating LTP and LTD in neural circuits, in a line of research inspired by the back-propagation algorithm used in supervised machine learning. Finally, I will summarize our effort in exploring whether higher cognitive functions such as self-awareness may originate from experience-dependent neural circuit plasticity. We discovered that mirror self-recognition, a hallmark of self-awareness known to be limited to humans and great apes, could be acquired by rhesus monkeys following extensive training for visual-somatosensory or visual-proprioceptive association, thus providing a new experimental system for studying neural circuit mechanism and plasticity underlying self-awareness. Thus, crosstalk between researchers in neuro- science and machine learning will trigger new development in both fields. Mu-Ming Poo Institute of Neuroscience, Chinese Academy of Sciences & CAS Center for Ex- cellence in Brain Science and Intelligence Technology, China
Bio: Mu-ming Poo is the founding and current director of Institute of Neuroscience, Chinese Academy of Sciences, director of CAS Center for Excellence in Brain Science and Intelligence Technology, and Paul Licht Distinguished Professor in Biology Emeritus at University of Cali- fornia, Berkeley. He studied physics at Tsinghua University (Taiwan) and received Ph D in bio- physics from Johns Hopkins University. He had served on the faculty of University of California at Irvine, Yale University, Columbia University, and University of California at San Diego, and University of California, Berkeley. He has made seminal contributions in studying neuronal differentiation, axon guidance and synaptic plasticity. He is a member of Academia Sinica, US National Academy of Sciences, Chinese Academy of Scienc- es, and Academy of Science of Hong Kong. He had received Ameritec Prize, Docteur Honoris Causa from Ecole Normale Supérieure, Paris and from Hong Kong University of Science and Technology (2014), P. R. China International Science & Technology Cooperation Award (2005), Qiushi Distinguished Scientist Award (2011), and Gruber Neuroscience Prize (2016). He is currently on the editorial board of more than 10 academic journals, including Neuron, and serve as the Executive Edi- tor-in-Chief for National Science Review.
24 Wednesday, July 10, 2019
K-02: AI Ethics: From Principles to Practices
Time: 8:30 - 9:30 AM
Room: Auditorium 3F
Speaker: Harry Shum Executive Vice President of Microsoft’s Artificial Intelligence and Research Group, USA
Chair: Lina J. Karam Arizona State University, USA
Abstract
Recent achievements and advancements in AI have outpaced what anyone would have thought imaginable even five to ten years ago. For instance, we are fast approaching human parity across many areas of AI — speech, vision, language and knowledge. But many practitioners building AI technology and deploying AI products have not always thought through the societal implications such as fairness and transparency. In this talk, I will discuss how we can best address these societal chal- lenges before the next AI innovation and development cycle. Up to this point, the answer has centered on principles – guide- lines to help companies and countries navigate the complexities and implications of AI. But principles alone are no longer enough—industry, academia and government need to take actions now to move from principles to practices. I will share what we have been practicing in Microsoft AI and Research from doing research in explainable and interpretable AI, to leveraging useful tools like datasheets and checklists commonly used in other industries, to forming an internal AI ethics committee providing guidelines for shipping AI products, to sharing and learning best practices with other companies through the Part- nership in AI.
Harry Shum Executive Vice President of Microsoft’s Artificial Intelligence and Research Group, USA
Bio: Harry Shum is executive vice president of Microsoft’s Artificial Intelligence (AI) and Research group. He is responsible for driving the company’s overall AI strategy and for- ward-looking research and development efforts spanning infrastructure, services, apps and agents. He oversees AI-focused product groups including Bing and Cortana. He also leads Microsoft Research, one of the world’s premier computer science research organizations, and its integration with the engineering teams across the company. Previously, Dr. Shum served as the corporate vice president responsible for Bing search product development from 2007 to 2013. Prior to his engineering leadership role at Bing and online services, he oversaw the research activities at Microsoft Research Asia and the lab’s collaborations with universities in the Asia Pacific region, and was responsible for the Internet Services Research Center, an applied research organization dedicated to advanced technology investment in search and advertising at Microsoft. Dr. Shum joined Microsoft Research in 1996 as a researcher based in Redmond, Washington. In 1998 he moved to Beijing as one of the founding members of Microsoft Research China (later renamed Microsoft Research Asia). There he began a nine-year tenure as a researcher, subsequently moving on to be- come research manager, assistant managing director and managing director of Microsoft Research Asia and a Distinguished Engineer. Dr. Shum is an IEEE Fellow and an ACM Fellow for his contributions to computer vision and computer graph- ics. He received his Ph.D. in robotics from the School of Computer Science at Carnegie Mellon University. In 2017, he was elected to the National Academy of Engineering of the United States.
25 IEEE ICME2019
Thursday, July 11, 2019
K-03: Multimedia Driven Precise Medicine
Time: 8:30 - 9:30 AM
Room: Auditorium 3F
Speaker: Nozha Boujemaa Chief Science and Innovation Officer, MEDIAN Technologies, France
Chair: Tao Mei AI Research of JD.COM, China
Abstract:
Multimedia Research studies have yet multiple applications to empower AI technologies deployment in health and precise medicine domain. For example today, clinical decision-making in oncology is often guided by the results of a biopsy, an inva- sive procedure that is potentially at risk for the patient and does not take into account the tumor context as a whole, because the tumors are in essence heterogeneous. One of the major challenges for personalized and predictive medicine is to identify whether or not a patient will respond to treatment. In immuno-oncology, 80% of patients receive treatments to which they are not responders. Using imaging to perform virtual biopsies, we fully analyze the tumor and its environment, non-invasively from medical imaging modalities such as scanner or MRI. This talk will show some examples of how multimodal informa- tion retrieval can identify certain cancer phenotypes by extracting image signatures and mapping tumor heterogeneity. Deep learning will help predict the patient’s prognosis and select and target patients for clinical trials. The great advantage of AI is ensuring patient well-being, reducing the time to market for drug innovations and informing clinical decisions. AI technology robustness is key to achieve trustworthy and responsible AI services in this context.
Nozha Boujemaa Chief Science and Innovation Officer, MEDIAN Technologies, France
Bio: Nozha Boujemaa is a Key Opinion Leader in the field of Artificial Intelligence and data sciences. She is a Director of Research at Inria (the French National Institute for computer science and applied mathematics). She was the scientific Head of the IMEDIA/Inria Research Group (Large Scale Multimedia Content Search) for 10 years before becoming Director of the Inria Saclay Research Center from 2010 to 2015 and Advisor to the Chairman and CEO of Inria in Data Sciences. In 2017, Nozha Boujemaa founded the DATAIA Institute, an interdisciplinary Institute on Data Sciences, Artificial Intelligence & Society, which she will run until the end of 2018. As an expert in Interactive Visual Content Indexing and Retrieval and in unsupervised & semi-supervised learning, Nozha Boujemaa has contributed to the emergence of next-generation large-scale multimedia search engines. She managed several flagship collaborative projects with French and European indus- trials, is the co-author of over 150 publications in peer-reviewed journals and international conferences and has supervised over 25 PhD and master students.
Nozha Boujemaa is Knight of the National Order of Merit and Member of the Board of Directors of Big Data Value Associ- ation (BDVA), Vice-Chair of the Artificial Intelligence High Level Expert Group (AI HLEG) of the European Commission and member of the AI Group of Experts of the OECD (AIGO). Nozha is also International Advisor for Japan Science and Technology Agency Program “Advanced Core Technologies for Big Data Integration” and Senior Scientific Advisor for “The AI Initiative“ (Harvard Kennedy School). She is President of the Scientific Council of the Institute of Technological Research “SystemX” until the end of 2018.
26 Academic Panel
Tuesday, July 9, 2019
Towards an Excellent Academic Career
Time: 11:15 AM - 12:30 PM
Room: Auditorium 3F
Synopsis:
The panelists of this panel are leading academic scholars who have achieved distinguished status in various areas of multi- media research. They are from well-known universities around the world and shall share with the ICME2019 attendants their precious experiences at different stages of academic career, including their successful strategies working towards an excellent academic career. Their experiences shall not only just applicable to those who are pursuing academic career or who intend to pursue academic career, but also closely relevant for those researchers who are working in research labs and corporate R&D divisions. We all look forward to this exciting panel which will definitely benefit to the growth of our next generation young multimedia researchers.
Moderators:
Chang Wen Chen Fernando Pereira
CUHK-SZ, China / Instituto Superior Técnico, SUNY-Buffalo, USA Portugal
27 IEEE ICME2019
Panelists:
Frederic Dufaux Abed El Saddik Lina Karam Universite Paris-Sud, France University of Ottawa, Canada Arizona State University, USA
Jay Kuo Yong Lian Dapeng Wu University of Southern University of Singapore, University of Florida, USA California, USA Singapore
28 Industry Panel
Wednesday, July 10, 2019
From Papers to Products: Bridging the Gap between Multimedia Research and Practical Applications
Time: 11:15 AM - 12:30 PM Room: Auditorium 3F
Synopsis: The ultimate goal of innovations in any field of engineering is to make an impact in practice. The path from brilliant research ideas to their practical applications, however, is almost never straightforward. At times, the barriers to adoption of novel tech- nologies may even originate from non-technical sources. At ICME 2019, we gather a panel of distinguished panelists to share their personal journeys and perspectives in navigating the landscape of applied research. Our panelists draw from their own distinguished careers to discuss how to best foster innovation in industry, for instance, how to select research topics with both practical relevance and intellectual merit. They will also discuss and debate about potential pitfalls to avoid along the journey of innovation.
Moderator: Xinxin Gao CEO, The Jiangmen, China
Bio: Gao Xinxin Vanessa is the co-founder and CEO of Jiangmen. Jiangmen is a venture capital firm focuses on early stage tech companies’ investment, with an innovative model. Jiangmen Runs high-qual- ity AI Chinese researchers, scientists and tech experts’ tech community in China. Jiangmen also partner with over 50+ global Fortunes 500 companies and China leading companies in Retail, Logistic, Finance, Healthcare, Automobile and more, to connect technology application into business scenarios. Before founding Jiangmen, Gao Xinxin Vanessa was the CEO of Microsoft Ventures China. Before that she worked for Microsoft Research and Microsoft China. Gao Xinxin Vanessa received her master degree from INSEAD, and from Tsinghua University.
Panelists: Nozha Boujemaa Chief Science and Innovation Officer, MEDIAN Technologies, France
Bio: Nozha Boujemaa is a Key Opinion Leader in the field of Artificial Intelligence and data sciences. She is a Director of Research at Inria (the French National Institute for computer science and applied mathematics). She was the scientific Head of the IMEDIA/Inria Research Group (Large Scale Multi- media Content Search) for 10 years before becoming Director of the Inria Saclay Research Center from 2010 to 2015 and Advisor to the Chairman and CEO of Inria in Data Sciences. In 2017, Nozha Bou- jemaa founded the DATAIA Institute, an interdisciplinary Institute on Data Sciences, Artificial Intelli- gence & Society, which she will run until the end of 2018. As an expert in Interactive Visual Content In- dexing and Retrieval and in unsupervised & semi-supervised learning, Nozha Boujemaa has contributed to the emergence of next-generation large-scale multimedia search engines. She managed several flag- ship collaborative projects with French and European industrials, is the co-author of over 150 publications in peer-reviewed journals and international conferences and has supervised over 25 PhD and master students. Nozha Boujemaa is Knight of the National Order of Merit and Member of the Board of Directors of Big Data Value Associ- ation (BDVA), Vice-Chair of the Artificial Intelligence High Level Expert Group (AI HLEG) of the European Commission and member of the AI Group of Experts of the OECD (AIGO). Nozha is also International Advisor for Japan Science and Technology Agency Program “Advanced Core Technologies for Big Data Integration” and Senior Scientific Advisor for “The AI Initiative” (Harvard Kennedy School). She is President of the Scientific Council of the Institute of Technological Research “SystemX” until the end of 2018.
29 IEEE ICME2019
Xian-Sheng Hua Head of AI Center/Distinguished Engineer, DAMO Academy/Alibaba Cloud, China
Bio: Xiansheng Hua is now a Distinguished Engineer/VP of Alibaba DAMO Academy. He received the B.S. and Ph.D. degrees in applied mathematics from Peking University in 1996 and 2001, respectively. He joined Microsoft Research Asia in 2001, as a Researcher. He was a Principal Research and a Development Lead in multimedia search with the Microsoft Search Engine in USA, from 2011 to 2013. He was a Senior Researcher with Microsoft Re- search Redmond from 2013 to 2015. He became a Researcher and the Senior Director of the Alibaba Group in 2015, where he is also leading the Visual Computing Team, Search Divi- sion, Alibaba Cloud, and then DAMO Academy. He is currently a Distinguished Engineer/ Vice President of the Alibaba Group, where he is leading a team working on large-scale visual intelligence on the cloud. He has authored or co-authored more than 200 research papers and has filed more than 90 patents. His research interests include big multimedia data search, advertising, understanding, and mining, pattern recognition, and machine learning. He is an IEEE Fellow and an ACM Distinguished Scientist. He was one of the recipients of the 2008 MIT Technology Review TR35 Young Innovator Award for his outstanding contributions on video search. He was also a recipient of the Best Paper Awards at ACM Multimedia 2007, and the Best Paper Award of the IEEE Transactions on Circuits and Systems for Video Technology in 2014. He served as a Program Co-Chair for IEEE ICME 2012, ACM Multimedia 2012, and IEEE ICME 2013. He will be serving as a General Co-Chair of ACM Multimedia in 2020.
Wenjun Zeng Principal Research Manager, Microsoft Research Asia, China
Bio: Wenjun (Kevin) Zeng is a Principal Research Manager and a member of the Senior Lead- ership Team (SLT) at Microsoft Research Asia. He is a Fellow of the IEEE. He has been leading the video analytics research powering the Microsoft Cognitive Services, Azure Media Analytics Services, Microsoft Office, and Windows Machine Learning (ML) since 2014. He was with Univ. of Missouri from 2003 to 2016, most recently as a Full Professor. Prior to that, he had worked for PacketVideo Corp., Sharp Labs of America, Bell Labs, and Panasonic Technology. He received his B.E., M.S., and Ph.D. degrees from Tsinghua Univ., the Univ. of Notre Dame, and Princeton Univ., respectively. Dr. Zeng is on the Editorial Board of International Journal of Computer Vision. He was an Associate Editor-in-Chief of IEEE Multimedia Magazine, an Associate Editor of IEEE Trans. on Circuits & Systems for Video Technology (TCSVT), IEEE Trans. on Info. Forensics & Security, and IEEE Trans. on Multimedia (TMM), and was on the Steering Committee of IEEE Trans. on Mobile Computing and IEEE TMM. He was a Special Issue Guest Editor for the Proceedings of the IEEE, TMM, ACM TOMCCAP, TCSVT, and IEEE Communications Magazine. He served as the Steering Committee Chair of IEEE ICME in 2010 and 2011, and has served as the General Chair or TPC Chair of several IEEE conferences (e.g., ICME2018, ICIP2017). He was the recipient of several best paper awards.
Marta Mrak Lead Research Engineer at BBC, UK
Bio: Marta Mrak is a Lead R&D Engineer at BBC R&D working on video compression, new con- tent experiences and data analytics. She is also an Honorary Professor at Queen Mary University of London (QMUL), Multimedia and Vision Research Group. Within IEEE, Marta is active in numerous activities, e.g. she serves as General Co-Chair of IEEE ICME 2020, Lead TPC Co-Chair for IEEE ICME 2019 and Vice Chair of Technical Committee on Multimedia Signal Processing.
Qibin Sun Professor at University of Science and Technology and Founder of Xietong Info-Tech Pte. Ltd., China
Bio: Dr. Qibin Sun is currently a Professor at University of Science and Technology of China and the Founder of Xietong Info-Tech Pte. Ltd. He was a Senior Manager in HP, and later a Distin- guished Engineer in Cisco. Qibin owns full-stack experiences from research, productization, to commercialization.
30 Lei Zhang Algorithm Scientist at Kuaishou Technology, China
Bio: Dr. Lei Zhang graduated from Institute of Computing Technology of the Chinese Academy of Sciences in 2015 with a PhD degree. Lei’s research primarily focuses on large-scale multi- media retrieval. He has published 11 IEEE journal articles in the field of computer vision and multimedia research. He is the lead author among 7 of the 11 published papers, which have been widely cited and used. After graduation, Lei joined Huawei’s 2012 Hardware Engineering Institute and worked on developing the mobile phone camera algorithm. He participated in the research of Huawei’s flagship mobile phone core camera algorithms including the first binocular flagship P9 binocular fusion, Mate 9 hybrid zoom, and P20 night scene shooting. His research specialization includes mobile phone camera algorithms, image processing and deep learning. Lei joined Kuaishou in 2018, and is currently engaged in researching target retrieval algorithms. He is also responsible for building the Hangzhou AI team in the Multimedia Understanding Department of Kuashou.
31 IEEE ICME2019
Multimedia Rising Star Panel
Thursday, July 11, 2019
Time: 10:00 - 11:00 AM
Room: Auditorium 3F
Synopsis:
This is a unique panel featuring young and accomplished researchers working in emerging areas of multimedia research. The selection process for these Multimedia Rising Stars has been very competitive. Each of four ICME sponsoring IEEE Societ- ies is asked to nominate up to three candidates who have graduated from their PhD within 10 years. The final four stars have been selected after thorough assessment of each candidate collectively by the Panel Chairs. Multiple factors have been care- fully considered, including their scholastic achievements after their PhD degree, their professional service records, and their proposed presentation topics. These emerging topics range from “Machine Learning for Creative AI Applications in Music” to “Deep Metric Learning for Multimedia Content Understanding”, and from “Large-scale Multimedia Semantic Information Extraction and Coding” to “Computational Light Field Imaging and Intelligent Reconstruction.” This Rising Star panel in- deed shall provide a live feast for multimedia researchers at all levels.
Moderators:
Chang Wen Chen Chia-Wen Lin
CUHK-SZ, China / SUNY-Buf- National Tsing Hua Universi- falo, USA ty, Taiwan
Panelist 1: Yi-Hsuan Yang
Talk Title: Machine Learning for Creative AI Applications in Music
Yi-Hsuan Yang
Academia Sinica, Taiwan
32 Panelist 2: Jiwen Lu
Talk Title: Deep Metric Learning for Multimedia Content Understanding
Jiwen Lu
Tsinghua University, China
Panelist 3: Weiyao Lin
Talk Title: Large-scale multimedia semantic information extraction and coding
Weiyao Lin
Shanghai Jiao Tong University, China
Panelist 4: Lu Fang
Talk Title: Multiscale Camera Array for Future Vision Intelligence
Lu Fang
Tsinghua University, China
33 IEEE ICME2019
Multimedia Star Innovators
Wednesday, July 10, 2019
Multimedia Star Innovator Keynote Highlights
Time: 10:00 - 11:00 AM
Room: Auditorium 3F
Chair: Lina J. Karam Arizona State University, USA
* Note: the onsite voting will start from 11:00AM till 5:45PM through the conference mobile app.
Innovation Award Finalists
Achin Bhowmik Chief Technology Officer & Executive VP Engineering, Starkey Hearing Technologies, USA
Bio: Dr. Achin Bhowmik is the chief technology officer and executive vice president of engi- neering at Starkey Hearing Technologies, a privately-held medical devices business with 6,000 employees and operations in over 100 countries worldwide. In this role, he is responsible for overseeing the company’s technology strategy, product development and engineering depart- ments, and is leading the drive to redefine medical wearable devices with advanced sensors and artificial intelligence technologies.
Prior to joining Starkey, Dr. Bhowmik was vice president and general manager of the Perceptual Computing Group at Intel Corporation. There, he was responsible for the R&D, engineering, operations, and businesses in the areas of 3D sensing and interactive computing, computer vision and artificial intelligence, autonomous robots and drones, and immersive virtual and merged reality devices. Previously, he served as the chief of staff of the Personal Computing Group, Intel’s largest business unit with >$30B annual revenues in 2010.
As an adjunct professor and guest lecturer, Dr. Bhowmik advises graduate research and teaches courses on human-computer interactions and perceptual computing technologies at the University of California, Berkeley, Stanford University, Liquid Crystal Institute of the Kent State University, Kyung Hee University, Seoul, and the Indian Institute of Technology, Gandhi- nagar.
Dr. Bhowmik was elected a Fellow of the Society for Information Display (SID). He serves on the board of advisors for the Fung Institute for Engineering Leadership at UC Berkeley, the executive board for SID, and the board of directors for OpenCV. He also serves on the board of directors and advisors for several technology startup companies. He received the Industrial Distinguished Leader Award from the Asia-Pacific Signal and Information Processing Association. He has over 200 publications, including two books and 34 issued patents.
Touradj Ebrahimi Professor, Swiss Federal Institute of Technology (EPFL), Switzerland
Bio: Touradj Ebrahimi received his M.Sc. and Ph.D., both in Electrical Engineering, from the Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland, in 1989 and 1992 re- spectively. In 1993, he was a research engineer at the Corporate Research Laboratories of Sony Corporation in Tokyo, where he conducted research on advanced video compression techniques for storage applications. In 1994, he served as a research consultant at AT&T Bell Laboratories working on very low bitrate video coding. He is currently Professor at EPFL heading its Multi- media Signal Processing Group. He is also the Convenor of JPEG standardization Committee. He was also adjunct Professor with the Center of Quantifiable Quality of Service at Norwegian University of Science and Technology (NTNU)between 2008 and 2012.
Prof. Ebrahimi has been the recipient of various distinctions and awards, such as the IEEE and Swiss national ASE award, the
34 SNF-PROFILE grant for advanced researchers, Four ISO-Certificates for key contributions to MPEG-4 and JPEG 2000, and the best paper award of IEEE Trans. on Consumer Electronics . He became a Fellow of the international society for optical engineering (SPIE) in 2003. Prof. Ebrahimi has initiated more than two dozen National, European and International coop- eration projects with leading companies and research institutes around the world. He is a co-founder of Genista SA, a high- tech start-up company in the field of multimedia quality metrics. In 2002, he founded Emitall SA, start-up active in the area of media security and surveillance. In 2005, he founded EMITALL Surveillance SA, a start-up active in the field of privacy and protection. He is or has been associate Editor with various IEEE, SPIE, and EURASIP journals, such as IEEE Signal Pro- cessing Magazine, IEEE Trans. on Image Processing, IEEE Trans. on Multimedia, EURASIP Image Communication Journal, EURASIP Journal of Applied Signal Processing, SPIE Optical Engineering Magazine. Prof. Ebrahimi is a member of Sci- entific Advisory Board of various start-up and established companies in the general field of Information Technology. He has served as Scientific Expert and Evaluator for Research Funding Agencies such as those of European Commission, The Greek Ministry of Development, The Austrian National Foundation for Scientific Research, The Portuguese Science Foundation, as well as a number of Venture Capital Companies active in the field of Information Technologies and Communication Systems. His research interests include still, moving, and 3D image processing and coding, visual information security (rights protec- tion, watermarking, authentication, data integrity, steganography), new media, and human computer interfaces (smart vision, brain computer interface).
He is the author or the co-author of more than 200 research publications, and holds 14 patents. Prof. Ebrahimi is a member of IEEE, SPIE, ACM and IS&T.
Henrique "Rico" S. Malvar Chief Scientist, Microsoft Research, USA
Bio: Henrique (Rico) Malvar was born in Brazil. He got a PhD from MIT in signal processing in 1986. He joined Microsoft Research in 1997, where he currently is the Chief Scientist. Rico and his teams developed new technologies for multimedia compression used in Windows, Of- fice, Xbox, Skype, and Azure. He contributed to standard formats such as G.722.1 and H.264, and was a main architect for the WMA and JPEG XR formats. He also worked on audio signal enhancement and beamforming technologies, used in Windows, Skype, Kinect, and HoloLens. He has authored or co-authored over 160 technical publications and over 120 issued US pat- ents. He is a Fellow of the IEEEE, and received the Technical Achievement Award from the IEEE Signal Processing Society in 2002. He was elected to the US National Academy of Engineering in 2012.
In 2015, Rico founded the Microsoft Research NExT Enable group, which leverages new multimedia interfaces to develop systems and applications to improve the lives of people with disabilities. The group co-developed, with the Microsoft Win- dows engineering team, new eye tracking software interfaces and the new Eye Control user interfaces, which allows a person to have full control of a device running Windows 10 using only their eye movements. The group also developed and shipped the Soundscape application, which uses 3D audio to help people with low or no vision to build a mental map of points of interest, as they walk about in the world, without the need to look at screens. The Enable group works closely with NGOs dedicated to accessibility, such as Tam Gleason, the ALS Association, Guide Dogs for the Blind, and the Lighthouse for the Blind.
Rajan Patel Rajan Patel, Senior Director, Google, USA
Bio: Rajan Patel is a Senior Director leading Augmented Reality engineering teams. Prior to working on AR, he worked on Google’s Search algorithm, leading teams working to im- prove topicality and freshness of search results.
He received a Ph.D. in from the Biostatistics department at Emory University, where he developed statistical methods to analyze the functional connectivity of the brain using func- tional magnetic resonance imaging (fMRI) data.
35 IEEE ICME2019
Aparna Chennapragada Aparna C, Vice President, Google, USA
Bio: Aparna Chennapragada is the Vice President of Augmented Reality and Google Lens. She also serves on the board of Capital One.
Aparna previously worked as a Senior Director and Technical Assistant to the CEO of Google, helping drive company-wide product efforts. She also led Google Now, a proactive digital assis- tant effort, and worked on many areas in Google Search and YouTube over the years. With over 20 years of experience in the tech industry as a computer scientist and product leader, she is ex- cited about the potential of AI and algorithms to build products that improve everyday life.
Wednesday, July 10, 2019
Multimedia Star Innovator Keynotes
Time: 15:30 - 17:30 PM
Room: 3B
Chair: Lina J. Karam Arizona State University, USA
36 Grand Challenges
Thursday, July 11, 2019
Grand Challenge Highlights
Time: 11:15 AM - 12:30 PM
Room: Auditorium 3F
Chair: Gene Cheung York University, Canada
Jiaying Liu Peking University, China
Schedule:
11:15 - 11:20 Opening Remarks
11:20 - 11:35 Grand Challenge: 106-p Facial Landmark Localization
Overview Hailin Shi AI Platform and Research, JD.com, China
Winner Talk
11:35 - 11:50 Grand Challenge: Learning-Based Image Inpainting
Overview Dong Liu University of Science and Technology of China, China
Winner Talk
11:50 - 12:05 Grand Challenge: Short Video Understanding Challenge -- Recommending All You
Want to See
Overview Changhu Wang Bytedance AI Lab, China
Winner Talk
12:05 - 12:30 Grand Challenge: Saliency4ASD: Visual attention modeling for Autism Spectrum
Disorder
Overview Patrick Le Callet University of Nantes, France
Winner Talk for Track 1
Winner Talk for Track 2
37 IEEE ICME2019
Thursday, July 11, 2019
G-01: Short Video Understanding Challenge -- Recommending All You Want to See
Time: 14:00 - 15:00 PM
Room: 3B
Description:
This challenge provides multi-modal video features, including visual features, text features and audio features, as well as user interactive behavior data, such as click, like, and follow. Each participant needs to model the user’s interest through a video and user interaction behavior data set, and then predict the user’s click behavior on another video dataset. The rank of our challenge accords to the model and predicted results submitted by the participants, based on a predefined score specified in the evaluation criteria. Website: http://ai-lab-challenge.bytedance.com/tce/vc/
Organizers:
Changhu Wang Yi Ma Wei-Ying Ma
Bytedance AI Lab, China University of California, Bytedance AI Lab, China Berkeley, USA
38 14:00 - 14:06 OPENING REMARKS
Dr. Changhu Wang
Bytedance AI Lab, China
14:06 - 14:14 ORAL SESSION:
Enhanced Short Video Understanding by Integrating User Behavior and Multimedia
Content Information
Lin Zhu
Ctrip Travel Network Technology Co., Limited, China
14:14 - 14:22 PREDICTING USER BEHAVIOR USING ITEM2VEC WITH FREQUENC
Chun Tao1, Haocheng Xu2, Kang Yang3, Feng Lu4, Xue Du5
1Nanjing Tech University, China, 2University of Electronic Science and Technology of China, China, 3Sichuan University, China, 4Hisense, China, 5Chongqing Univers -ty of Posts and Telecommunications, China
14:22 - 14:30 TRUNCATED SVD-BASED FEATURE ENGINEERING FOR SHORT VIDEO
UNDERSTANDING AND RECOMMENDATION
Tsun-Hsien Tang1,2, Kuan-Ta Chen2, Hsin-Hsi Chen1,3
1National Taiwan University,Taiwan, 2Academia Sinica, Taiwan, 3MOST Joint Research Center for AI Technology and All Vista Healthcare, Taiwan 14:30 - 14:38 SHORT VIDEO CONTENT UNDERSTANDING AND RECOMMENDATION
BASED ON GRADIENT BOOSTING TREE AND DEEP NETWORK
Yaxi Wu1, Majing Lou2, Zhibin Lian3 1JD, China, 2mininglamp technology, China, 3South China Normal University, China
14:38 - 14:46 ENGINEERING IMPLEMENTATION IN ICME GRAND CHALLENGE
Guanhao Cheng, Huiqin Xiao, Jianwei Li, Dongwei Zhao, Xiaosheng Wu
Netease, China
14:46 - 14:54 BUILDING EFFECTIVE SHORT VIDEO RECOMMENDATION
Yang Liu1, Cheng Lyu1, Zhiyuan Liu1, and Dacheng Tao2
1Southeast University, China, 2the University of Sydney, Australia
14:54 - 15:00 AWARDING SESSION
39 IEEE ICME2019
Thursday, July 11, 2019
G-02: Grand Challenges of 106-p Facial Landmark Localization
Time: 15:30 - 16:30 PM
Room: 3B
Description:
As the deep learning methods have been largely developed in facial landmark localization task, the requirements of practical applications are growing fast. However, for large poses and occlusion, the accuracy of localization needs to be improved. Here, JD AI Research and NLPR, CASIA sincerely invited researchers and developers from academia and industry to partici- pate in this competition and encourage further discussion on technical and application issues.
Website: https://facial-landmarks-localization-challenge.github.io
Organizers:
Hailin Shi Xiaobo Wang Xiangyu Zhu
JD AI Platform and Research, JD AI Platform and Research, Chinese Academy of Sciences, China China China
Yinglu Liu Hao Shen
JD AI Platform and Research, JD AI Platform and Research, China China
40 15:30 -15:40 OPENING REMARKS
Dr. Yinglu Liu
JD AI
15:40 - 15:55 FACIAL LANDMARK LOCALIZATION BASED ON AUTO-STACKED HOURGLASS NETWORK AND EXPECTATION CONSENSUS
Zhibin Hong, Hanqi Guo, Ziyuan Guo, Yanqin Chen, Bi Li, Teng Xi
Department of Computer Vision Technology (VIS), Baidu Inc, China
15:55 - 16:10 MULTI-SCALE DENSELY U-NETS REFINE NETWORK FOR FACE ALIGNMENT.
Jun Yu1, Haonian Xie1, Guochen Xie1, Mengyan Li1, Zengfu Wang2
1University of Science and Technology of China, China, 2Institute of Intelligent Machines, Chinese Academy of Sciences, China
16:10 - 16:25 IMPROVED HOURGLASS STRUCTURE FOR HIGH PERFORMANCE FA CIAL LANDMARK DETECTION
Shenqi Lai, Zhenhua Chai, and Xiaoming Wei
Vision and Image Center of Meituan, China
16:25 - 16:30 AWARDS
41 IEEE ICME2019
Thursday, July 11, 2019
G-03: Learning-Based Image Inpainting
Time: 16:45 - 17:45 PM
Room: 3CD
Description:
Image inpainting, also known as image completion, is the process of filling-in the missing areas of an incomplete image so that the completed image is visually plausible. While this task is indispensable in many applications, such as disocclusion, object removal, error concealment, and so on, the task is still regarded very difficult thus far. Traditionally, several different approaches have been proposed for image inpainting, including partial differential equation-based inpainting, constrained tex- ture synthesis, structure propagation, database-assisted, and so on.
In recent years, deep learning has revolutionized the research of image inpainting, and a number of deep models have been designed. Nonetheless, the lack of a public, widely acknowledged dataset has been a significant issue in developing advanced, learning-based inpainting solution.
This challenge is meant to consolidate research efforts about image inpainting using learning, especially deep learning ap- proach. We will prepare two tracks: error concealment (EC) and object removal (OR). In the EC track, we simulate the case of transmission error that incurs missing areas (usually square blocks) in a decoded image. In the OR track, we carefully select some objects in an image to be removed, and produce missing areas with irregular shapes. In both tracks we challenge the researchers to inpaint the incomplete image. The major difference between the two tracks is that, in the first track, we want to recover the missing areas so that the completed image is similar to the original (although this can be very difficult!), and in the second track, we are satisfied as long as the completed image is visually plausible and pleasing.
We are aware of a previous competition in conjunction with ECCV 2018, which also addresses the problem of image (and video) inpainting. Different from that competition, in our challenge we evaluate the quality of completed images by both ob- jective metrics (PSNR, SSIM) and subjective evaluation (MOS).
Website: https://icme19inpainting.github.io/
Organizers:
Dong Liu Ming-Hsuan Yang
University of Science and University of California at Technology of China, China Merced, USA
42 16:45 - 16:55 OPENING REMARKS
Dr. Dong Liu
University of Science and Technology of China, China
16:55 - 17:10 INTERLEAVED ZOOMING NETWORK FOR IMAGE INPAINTING
Sen Liu, Zongyu Guo, Jiale Chen, Tao Yu, Zhibo Chen
University of Science and Technology of China, China
17:10 - 17:25 MSMC-NET: IMAGE INPAINTING USING DEEP MULTI-SCALE AND MULTI-CONNECTION NETWORKS
Miaohui Wang, Xiaoming Chen, Weiqian Chen, Yuan Yuan
Shenzhen University, China
17:25 - 17:40 IMAGE INPAINTING UNDER CHESSBOARD-LIKE MASKING
Shiqi Lin, Jilong Liu, Zhiyuan Zhou, Haoran Zhang, Xueliang Liu
Hefei University of Technology, China
17:40 - 17:45 AWARD CEREMONY
43 IEEE ICME2019
Thursday, July 11, 2019
G-04: Saliency4ASD: Visual attention modeling for Autism Spectrum Disorder
Time: 16:45-17:45 PM
Room: 3B
Description:
The purpose of the Grand Challenge Saliency4ASD is to drive efforts of visual attention modeling community towards a healthcare societal challenge. Gaze features related to saccades and fixations have demonstrated their usefulness in the iden- tification of mental states, cognitive processes and neuropathologies (Tseng et al., 2013; Itti, 2015), notably for people with ASD (Autism Spectrum Disorder).
Website:https://saliency4asd.ls2n.fr
Organizers
Guangtao Zhai Zhaohui Che
Shanghai Jao Tong University, Shanghai Jao Tong University, China China
Jesus Guttirez Patrick Le Callet
University of Nantes, France University of Nantes, France
44 16:45 - 16:55 OPENING REMARKS 16:55 - 17:10 SALIENCY PREDICTION VIA MULTI-LEVEL FEATURES AND DEEP SUPERVISION FOR CHILDREN WITH AUTISM SPECTRUM DOISORDER
Weijie Wei1, Zhi Liu1, Lijin Huang1, Alexis Nebout2, Olivier Le Meur2 1Shanghai University, China, 2 University of Rennes 1, France
VISUAL ATTENTION MODELING FOR AUTISM SPECTRUM DISORDER BY U-NET
Yuming Fang, Hanqin Huang, Boyang Wan, and Yifan Zuo Jiangxi University of Finance and Economics, China
PREDICTING SALIENCY MAPS FOR ASD PEOPLE
Alexis Nebout1, Weijie Wei2, Zhi Liu2, Lijin Huang2, Olivier Le Meur1 1University of Rennes 1, France, 2Shanghai University, China
CLASSIFYING AUTISM SPECTRUM DISORDER BASED ON SCANPATHS AND SALIENCY
Mikhail Startsev, Michael Dorr Technical University of Munich, Germany
EXPLOITING VISUAL BEHAVIOUR FOR AUTISM SPECTRUM DISORDER IDENTIFICATION
Giuliano Arru, Pramit Mazumdar, Federica Battisti Roma Tre University, Italy
SP-ASDNET: CNN-LSTM BASED ASD CLASSIFICATION MODEL USING OBSERVER
SCANPATHS
Yudong Tao, Mei-Ling Shyu University of Miami, USA
PREDICTING AUTISM DIAGNOSIS USING IMAGE WITH FIXATIONS AND SYNTHETIC SAC- CADE PATTERNS
Chongruo Wu1, Sidrah Liaqat2, Sen-ching Cheung2, Chen-Nee Chuah1, Sally Ozonoff1 1University of California, Davis, USA, 2University of Kentucky, USA
17:10 - 17:15 ANNOUNCEMENT OF THE RESULTS 17:15 - 17:30 ORAL PRESENTATION WINNER TRACK 1. 17:30 - 17:45 ORAL PRESENTATION WINNER TRACK 2.
45 IEEE ICME2019
Tutorials
Monday, July 8, 2019
T-01: Big Data Intelligence: From Correlation Discovery to Casual Reasoning
Time: 8:30 AM - 12:00 PM
Room: 3E
Speaker: Fei Wu Zhejiang University, Hangzhou, China
Abstract
The discovery of correlations from large scale of data set is an interested issue nowadays. Artificial intelligence is now head- ing towards how to integrate data-driven learning and knowledge-guided inference to perform better reasoning and decision instead of correlation learning via metric matching. This talk will discuss the potential ways to fuse symbolic AI, data-driven learning and reinforcement learning to support causal reasoning.
Speaker
Fei Wu Zhejiang University, Hangzhou, China Bio: Fei Wu received his B.Sc., M.Sc. and Ph.D. degrees in computer science from Lanzhou University, University of Macau and Zhejiang University in 1996, 1999 and 2002 respectively. From October 2009 to August 2010, Fei Wu was a visiting scholar at Prof. Bin Yu’s group, University of California, Berke- ley. Currently, He is a Qiushi distinguished professor of Zhejiang University at the college of computer science. He is the vice-dean of college of computer science, and the director of Institute of Artificial Intelligence of Zhejiang University. He is the chairman of IEEE CAS Hangzhou-Chapter since Oct, 2018. He is currently the Associate Editor of Multimedia System, the editorial members of Frontiers of Information Technology & Electronic Engineering. He has won various honors such as the Award of National Science Fund for Distinguished Young Scholars of China (2016). His research interests mainly include Artificial Intelligence, Multimedia Analysis and Retrieval and Machine Learning.
46 Monday, July 8, 2019
T-02: Human Behavior Understanding: From Human-Oriented Analysis to Action Recognition
Time: 13:30 - 17:00 PM
Room: 3E
Speakers: Ting Yao JD AI Research, Beijing, China Wu Liu JD AI Research, Beijing, China
Abstract
Analyzing human behaviour in videos is one of the fundamental problems of computer vision and multimedia understanding. The task is very challenging as video is an information-intensive media with large variations and complexities in content. With the development of deep learning techniques, researchers have strived to push the limits of human behaviour under- standing in a wide variety of applications from action recognition to event detection. This tutorial will present recent advanc- es under the umbrella of human behavior understanding, which range from the fundamental problem of how to learn “good” video representations, to the challenges of categorizing video content into human action classes, finally to multimedia event detection and surveillance event detection in complex scenarios.
Speakers
Ting Yao JD AI Research, Beijing, China Bio: Ting Yao is currently a Principal Researcher in Vision and Multimedia Lab at JD AI Research, Beijing, China. His research interests include video understanding, large-scale multimedia search and deep learning. Prior to joining JD AI Research, he was a Researcher with Microsoft Research Asia in Beijing, China. Ting is an active participant of several benchmark evaluations. He is the principal designer of several top-performing multimedia analytic systems in worldwide competitions such as COCO Image Captioning, Visual Domain Adaptation Challenge 2017, ActivityNet Large Scale Ac- tivity Recognition Challenge 2018, 2017 and 2016, THUMOS Action Recognition Challenge 2015, and MSR-Bing Image Retrieval Challenge 2014 and 2013. He is one of the organizers of the MSR Video to Language Challenge 2017 and 2016. For his contributions to Multimedia Search by Self, External and Crowdsourc- ing Knowledge, he was awarded the 2015 SIGMM Outstanding Ph.D. Thesis Award.
Wu Liu JD AI Research, Beijing, China Bio: Wu Liu is a Senior Researcher in JD AI Research, China. He received his Ph.D. degree from the Institute of Computing Technology, Chinese Academy of Science in 2015. His current research inter- ests include video analytics, human behavior analysis, and intelligent video surveillance. He has pub- lished more than 30 papers in prestigious conferences and journals in computer vision and multimedia, including CVPR, ACM MM, IJCAI, AAAI, UBICOMP, IEEE T-MM, IEEE T-CYB, etc. He received Chinese Academy of Sciences Outstanding Ph.D. Thesis Award in 2016, Best Student Paper Awards at ICME in 2016, and the Deans Special Award of Chinese Academy of Sciences in 2015, etc. He is also the founding member of ACM FCA, the guest editor of MTAP and MVA, and the Web Chair of ICME 2019.
47 IEEE ICME2019
Monday, July 8, 2019
T-03: Intelligent Image Enhancement and Restoration - From Prior Driven Model to Advanced Deep Learning
Time: 8:30 AM - 12:00 PM
Room: 3G
Speakers: Jiaying Liu Peking University, Beijing, China Wenhan Yang National University of Singapore, Singapore Chen Change Loy Nanyang Technological University, Singapore
Abstract
Intelligent image/video editing is a fundamental topic in image processing which has witnessed rapid progress in the last two decades. Due to various degradations in the image and video capturing, transmission and storage, image and video include many undesirable effects, such as low resolution, low light condition, rain streak and rain drop occlusions. The recovery of these degradations is ill-posed. With the wealth of statistic-based methods and learning-based methods, this problem can be unified into the cross-domain transfer, which cover more tasks, such as image stylization. In our tutorial, we will discuss recent progresses of image stylization, rain streak/drop removal, image/video super-resolution, and low light image enhancement. This tutorial covers both traditional statistics based and deep-learning based methods, and contains both biological-driven model, i.e. Retinex model, and data-driven model. An image processing viewpoint that con- siders the popular deep networks as a traditional Maximum-a-Posteriori (MAP) Estimation is provided. The side priors, de- signed by researchers and learned by multi-task learnings, and automatically learned priors, captures by adversarial learning are two kinds of important priors in this framework. Three works under this framework, including single image super-resolu- tion, low light image enhancement, and single image raindrop removal are presented. Single image super-resolution is a classical problem in computer vision. It aims at recovering a high-resolution image from a single low-resolution image. This problem is an underdetermined inverse problem, of which solution is not unique. In this tutorial, we will discuss how we can solve the problem by deep convolutional networks in a data-driven manner. We will review different model variants and important techniques such as adversarial learning for image super-resolution. We will then discuss recent work on hallucinating faces of unconstrained poses and with very low resolution. Finally, the tutorial will discuss challenges of implementing image super-resolution in real-world scenarios.
Speakers
Jiaying Liu Peking University, Beijing, China Bio: Jiaying Liu is currently an Associate Professor with the Institute of Computer Science and Technology, Peking University. She received the Ph.D. degree (Hons.) in computer science from Peking University, Beijing China, 2010. She has authored over 100 technical articles in refereed journals and proceedings, and holds 34 granted patents. Her current research interests include multi- media signal processing, compression, and computer vision. Dr. Liu is a Senior Member of IEEE and CCF. She was a Visiting Scholar with the University of Southern California, Los Angeles, from 2007 to 2008. She was a Visiting Researcher with the Microsoft Research Asia in 2015 supported by the Star Track Young Faculties Award. She has served as a member of Multimedia Systems & Applications Technical Committee (MSA-TC), Visual Signal Processing and Communications Technical Committee (VSPC) and Education and Outreach Tech- nical Committee (EO-TC) in IEEE Circuits and Systems Society, a member of the Image, Video, and Multimedia (IVM) Technical Committee in APSIPA. She has also served as the Technical Program Chair of IEEE VCIP-2019/ACM ICMR- 2021, the Publicity Chair of IEEE ICIP-2019/VCIP-2018, the Grand Challenge Chair of IEEE ICME-2019, and the Area Chair of ICCV-2019. She was the APSIPA Distinguished Lecturer (2016-2017). In addition, Dr. Liu also devotes herself to teaching. She has run MOOC Programming Courses via Coursera/edX/Chi- neseMOOCs, which have been enrolled by more than 60 thousand students. She is also the organizer of the first Chinese
48 MOOC Specialization in Computer Science. She is the youngest recipient of Peking University Outstanding Teaching Award.
Wenhan Yang National University of Singapore, Singapore Bio: Wenhan Yang is a Postdoc research fellow with the Department of Computer Science, City University of Hong Kong. Wenhan Yang received the B.S degree and Ph.D. degree (Hons.) in computer science from Peking University, Beijing, China, in 2012 and 2018. Dr. Yang was a Visiting Scholar with the National University of Singapore, from 2015 to 2016. He has authored over 30 technical articles in refereed journals and proceedings. His current research interests include deep-learning based image processing, bad weather restoration, related applications and theories.
Chen Change Loy Nanyang Technological University, Singapore Bio: Chen Change Loy is a Nanyang Associate Professor with the School of Computer Science and Engineering, Nanyang Technological University, Singapore. He is also an Adjunct Asso- ciate Professor at the Chinese University of Hong Kong. He received his PhD (2010) in Com- puter Science from the Queen Mary University of London. Prior to joining NTU, he served as a Research Assistant Professor at the MMLab of the Chinese University of Hong Kong, from 2013 to 2018. He is the recipient of 2019 Nanyang Associate Professorship (Early Career Award) from Nanyang Technological University. His research interests include computer vision and deep learning, with a focus on face analysis, image processing, and visual surveillance. He has published more than 100 papers in top journals and conferences of computer vision and machine learn- ing. He and his team proposed a number of important methods for image super-resolution including SRCNN, SFTGAN and ESRGAN. As a co-author, his journal paper on SRCNN was selected as the `Most Popular Article’ by IEEE Transactions on Pattern Analysis and Machine Intelligence in 2016. It remains as one of the top 10 articles to date. ESRGAN has been widely used to remaster various classic games such as Half-Life, Resident Evil 2, Morrowind, and Final Fantasy 7. He serves as an Associate Editor of the International Journal of Computer Vision (IJCV) and IET Computer Vision Journal. He also serves/served as the Area Chair of CVPR 2019, BMVC 2019, ECCV 2018, and BMVC 2018. He is a senior member of IEEE.
49 IEEE ICME2019
Monday, July 8, 2019
T-04: Visual Search and Question Answering
Time: 13:30 - 17:00 PM
Room: 3G
Speakers: Lu Jiang Google Cloud AI, Sunnyvale, CA, USA Liangliang Cao University of Massachusetts, Amherst, MA, USA Yannis Kalantidis Facebook AI, California
Abstract
Personal photo and video data are being accumulated at an unprecedented speed. For example, 14 petabytes of personal pho- tos and videos were uploaded to Google Photo1 by 200 million users in 2015, while a tremendous amount of personal photos and videos are also being uploaded to Flickr every day. How to efficiently search and organize such data presents a huge challenge to both academic research and industrial applications. To attack this challenge, this tutorial will review the research efforts in related subjects and showcases of successful industri- al systems. We will discuss traditional visual search methods and the improvement of visual presentations brought by deep neural networks. The instructors will also share their experience of building large-scale fashion search and Flickr similarity search systems and bring insights on the challenges of extending the academic research to industrial applications. This tutorial will discuss the queries and logs of search engines, and analyze how to address the characteristics of personal media search. By leveraging searching techniques to visual question answering, this tutorial will introduce a new task named MemexQA: given a collection of photos or videos from the user, can we automatically answer questions that help users re- cover their memory about events captured in the collection? New datasets and algorithms of MemexQA will be reviewed. We hope MemexQA will shed light on the next generation computer interface of exploding amount of personal photos and videos.
Speakers
Lu Jiang Google Cloud AI, Sunnyvale, CA, USA Bio: Lu Jiang is a research scientist at Google CLoudAI, advised by Dr. Jia Li and Dr. Fei-Fei Li. He received his Ph.D. in Artificial Intelligence (Language Technology) from the Carnegie Mellon Uni- versity in 2017, advised by Dr. Alexander Hauptmann and Dr. Teruko Mitamura. Dr. Tat-Seng Chua and Dr. Louis-Philippe Morency are his thesis advisors. He was an intern scientist in Yahoo Research during the 2016 summer, working with Dr. Liangliang Cao, Dr. Yannis Kalantidis and Sachin Farfade on the personal photo and video search on Flickr. Prior to that, he was an intern in Google Research working with Dr. Paul Natsev and Dr. Balakrishnan Varadarajan on large-scale deep learning on the noisy YouTube-8M dataset. He interned at Mi- crosoft Research Asia in 2010, working with Dr. Qiang Wang and Dr. Dongmei Zhang on data mining. Before that, he was a research assistant at Xi’an Jiaotong University, supervised by Dr. Jun Liu on text mining, information retrieval. Lu’s primary interests lie in the interdisciplinary field of Multimedia, Machine Learning, Computer Vision, Information Retrieval, which, specifically, include video understanding and search, weakly supervised learning, deep learning, cloud machine learning, etc. He regularly serves on the programme committee of premier conferences such as ACM Multimedia, AAAI, and IJCAI. Lu is the recipient of the Yahoo Fellowship, Erasmus Mundus Scholar. He received the best poster award at IEEE Spoken Language Technology and the best Paper nomination at ACM International Conference on Multimedia Re- trieval.
50 Liangliang Cao University of Massachusetts, Amherst, MA, USA Bio: Liangliang Cao is a Staff Research Scientist at Google. He is also affiliated with UMass CICS as a research associate professor. His research interests include AI and large scale data learning, spanning computer vision, language, and speech. Before joining Google, he worked as a co-founder of HelloVe- ra, and earlier a senior scientist at Yahoo Labs and a research staff member at IBM Watson Research Center. He is an associate editor of the Visual Computer and JVIS. He won the 1st place of ImageNet LSVRC Challenge in 2010. He is a recipient of ACM SIGMM Rising Star Award.
Yannis Kalantidis Facebook AI, California
Bio: Yannis Kalantidis is a research scientist at Facebook AI in Menlo Park, California. He grew up in Athens, Greece and lived there till 2015, with brief breaks in Sweden, Spain and the United States. He got his PhD on large-scale search and clustering from the National Technical University of Athens in 2014. He was a postdoc and research scientist at Yahoo Research in San Francisco for two years, lead- ing the visual similarity search project at Flickr and participated in the Visual Genome dataset efforts with Stanford. He is currently conducting research on video understanding, representation learning and modeling of vision and language.
51 IEEE ICME2019
Monday, July 8, 2019
T-05: Object Detection Beyond Mask R-CNN and RetinaNet
Time: 8:30 AM - 12:00 PM
Room: 5A
Speakers: Gang Yu Face++, Beijing, China Yichen Wei Face++, Beijing, China Xiangyu Zhang Face++, Beijing, China
Abstract
Object detection is a fundamental problem in the computer vision society with numerous applications. Recently, as the devel- opment of Mask R-CNN and RetinaNet, the pipeline of the object detection seems to be mature. However, the performance for the current state-of-art object detection is still far from the requirements from the visual applications. In this tutorial, we will delve into the details of the object detection and present the improvements from five aspects: backbone, head, scale, batchsize, and post-processing. For the backbone, we will discuss a novel network called DetNet, which is specifically designed for the detection task. Det- Net preserves the spatial information of the network structure compared with traditional ImageNet pretrained backbone. For the head design, we will introduce Light-Head R-CNN for the fast inference speed and moreover a novel Localization Sen- sitive Head (LSH) will be discussed which decouples the classification and regression tasks into two branches. For the scale issue, we present a novel algorithm called SFace which can address the large scale variation problem in the Face Detection problem. Also, large batch-size detector will be discussed to significantly reduce the number of model training time. Besides, a new dataset called CrowdHuman will be discussed to address the NMS issue during the post-processing stage.
Speakers
Gang Yu Face++, Beijing, China Bio: Gang YU is the team leader for the detection in MEGVII. He graduated from Nanyang Technolog- ical University, Singapore, in 2014. He then joined MEGVII and his research interest focuses on com- puter vision and machine learning, including object detection, segmentation, skeleton, and human action analysis. Gang Yu has obtained the winners of COCO2017 and COCO2018 detection challenge and Keypoint challenge.
Yichen Wei Face++, Beijing, China
Bio: Dr. Wei joined Megvii on July, 2018. Before that, he spent 12 years in Visual Computing group, Mi- crosoft Research Asia. He received my Ph.D degree in Hong Kong University of Science and Technology in 2006, and B.S. degree in Peking University in 2001, respectively.
Dr Wei’s research interests include 3D vision, object recognition, detection, tracking and pose estimation. His Google Scholar citation is about 5,700, h-index is 31. His work has been transferred to Kinect Identi- ty in XBox, Windows Hello, Microsoft Cognitive Service, Bing, Office, and Microsoft XiaoIce, etc.
Xiangyu Zhang Face++, Beijing, China Bio: Xiangyu Zhang is currently the team leader of base model group in MEGVII Research. He received his doctoral degree from Xi’an Jiaotong University in 2017 and then joined MEGVII Technology. His re- search interest mainly focuses on deep learning models for computer vision, including CNN architecture design, network pruning and acceleration, neural architecture search and object detection/segmentation. He got CVPR 2016 best paper award and won a series of vision competitions such as ILSVRC 2015, COCO 2015/2017/2018. The total number of his Google Scholar citations is over 30000.
52 Friday, July 12, 2019
T-06: Computer Vision for Transportation
Time: 13:30 - 17:00 PM
Room: 5A
Speakers: Haifeng Shen AI Labs, Didi Chuxing, China Zhengping Che AI Labs, Didi Chuxing, China Guangyu Li AI Labs, Didi Chuxing, China Yuhong Guo AI Labs, Didi Chuxing & Carleton University, China Jieping Ye Didi Chuxing & University of Michigan, Ann Arbor
Abstract
Computer vision in transportation has recently received increasing attention from both industry and academia due to the popularity of modern mobile transportation platforms and the rapid development of autonomous driving. In this tutorial, we systematically introduce the recent progresses of computer vision techniques and their applications in transportation. Spe- cifically, we will provide a general overview of the key problems, common formulations, existing methodologies and future directions. This tutorial will inspire the audience and facilitate research in computer vision for transportation. The tutorial mainly consists of three parts: Lecture 1: Challenges using object recognition, optical character recognition and face recognition in transportation. • The recent progresses of the object recognition, optical character recognition and face recognition technologies. • The difficulties and problems when applying these technologies in transportation. • The solutions and applications.
Lecture 2: Towards Driving Scenario Understanding. • Object detection, tracking, and segmentation in driving scenarios. • Vision based 3D reconstruction of driving scenario. • Driving behavior modeling and safety risk analysis.
Lecture 3: Applying transfer learning in CV. • The introduction of the recent transfer learning technologies. • The applications of the transfer learning technologies in CV.
Speakers
Haifeng Shen AI Labs, Didi Chuxing, China
Bio: Haifeng Shen is a senior expert algorithm engineer in Didi Chuxing and leads the computer vision group in the AI Labs. He received his Ph.D. degree in signal and information processing from Beijing University of Posts and Telecommunications, in 2006. He has worked at Panasonic, Baidu, and Microsoft. He built the first speech recognition interface for XiaoIce chatbot in Mic- rosoft and his current work focuses on computer vision in transportation. His research interests include computer vision, speech recognition and natural language processing.
53 IEEE ICME2019
Zhengping Che AI Labs, Didi Chuxing, China Bio: Zhengping Che is a senior research scientist at DiDi AI Labs. He received his Ph.D. in Computer Science from the University of Southern California. Before that, he received his B.E. in Computer Science from Pilot CS Class (Yao Class), Tsinghua University. His current research interests lie in machine learning, deep learning and data mining with applications to temporal data and vision data. He has published several papers in ICML, KDD, ICDM, AMIA and other venues and interned at DiDi AI Labs, Mayo Clinic, IBM Research, Google and Hulu.
Guangyu Li AI Labs, Didi Chuxing, China Bio: Guangyu Li is a senior research scientist at DiDi AI Labs. In this role, he works on intelligent vehicles regarding autonomous driving, intelligent cockpit, and IoT systems. Before that, he de- veloped perception algorithms for self-driving trucks at TuSimple, an autonomous truck unicorn. Besides industrial experience, he is also a PhD candidate in University of Southern California. His research interests lie in computer vision, large scale sensor systems, and virtual/augmented/mixed reality with a focus on their applications in modern intelligent transportation.
Yuhong Guo AI Labs, Didi Chuxing & Carleton University, China Bio: Yuhong Guo is a principal research scientist at Didi Chuxing. She is also an associate profes- sor at Carleton University, a faculty affiliate of the Vector Institute, and a Canada Research Chair in Machine Learning. She received her PhD from the University of Alberta, and has previously worked at the Australian National University and Temple University. Her research interests in- clude machine learning, artificial intelligence, computer vision, and natural language processing. She has won paper awards from both IJCAI and AAAI. She has served in the Senior Program Committees of AAAI, IJCAI and ACML, and is currently serving as an Associate Editor for TPA- MI.
Jieping Ye Didi Chuxing, China & University of Michigan, Ann Arbor, USA
Bio: Jieping Ye is Head of DiDi AI Labs, a VP of Didi Chuxing and a DiDi Fellow. He is also a Professor at the University of Michigan, Ann Arbor. His research interests include data mining and machine learning with applications in transportation and biomedicine. He has served as a Senior Program Committee/Area Chair/Program Committee Vice Chair of many conferences including NIPS, ICML, KDD, IJCAI, AAAI, ICDM, and SDM. He serves as an Associate Edi- tor of Data Mining and Knowledge Discovery and IEEE Transactions on Knowledge and Data Engineering. He won the NSF CAREER Award in 2010. His papers have been selected for the outstanding student paper at ICML in 2004, the KDD best research paper runner up in 2013, and the KDD best student paper award in 2014.
54 Friday, July 12, 2019
T-07: Causally Regularized Machine Learning
Time: 8:30 AM - 12:00 PM
Room: 5BC
Speakers: Peng Cui Tsinghua University, Beijing, China Kun Kuang Tsinghua University, Beijing, China Bo Li Tsinghua University, Beijing, China
Abstract
Owing to the popularity of Big Data, abundant multimedia data are accumulated in various domains. At the same time, many machine learning methods are proposed to exploit these data for prediction. These methods have been proved to be success- ful in prediction-oriented applications. However, the lack of interpretability of most predictive algorithms makes them less attractive in many settings, especially those requiring decision making. How to improve the interpretability of learning algo- rithms is of paramount importance for both academic research and real applications? Causal inference, which refers to the process of drawing a conclusion about a causal connection based on the conditions of the occurrence of an effect, is a powerful statistical modeling tool for explanatory analysis. In this tutorial, we focus on caus- ally regularized machine learning, aiming to explore causal knowledge from observational data to improve the explainability and stability of machine learning algorithms. First, we will give some examples on how machine learning algorithms today focus on correlation analysis and prediction, and why those methods are not insufficient for decision making. Then, we will give introduction to causal inference and introduce some recent data-driven approaches to explore causal knowledge from observational data, especially in high dimensional setting. Aiming to bridge the gap between causal inference and machine learning, we will introduce some recently causally regularized machine learning algorithms for improving the stability and interpretability of prediction on multimedia data. Finally, we will discuss future directions of the landscape of open research and challenges in machine learning with causal inference.
Speakers
Peng Cui Tsinghua University, Beijing, China Bio: Peng Cui is an Associate Professor in Tsinghua University. He got his PhD degree from Tsinghua University in 2010. His research interests include network representation learning, social dynamics modeling and human behavior modeling. He has published more than 60 papers in prestigious con- ferences and journals in data mining and multimedia. His recent research won the ICDM 2015 Best Student Paper Award, SIGKDD 2014 Best Paper Finalist, IEEE ICME 2014 Best Paper Award, ACM MM12 Grand Challenge Multimodal Award, and MMM13 Best Paper Award. He is the Area Chair of ICDM 2016, ACM MM 2014-2015, IEEE ICME 2014-2015, ICASSP 2013, Associate Editor of IEEE TKDE, ACM TOMM, Elsevier Journal on Neurocomputing. He was the recipient of ACM China Rising Star Award in 2015.
Kun Kuang Tsinghua University, Beijing, China Bio: Kun Kuang received the B.E. degree from the Department of Computer Science and technology of Beijing Institute of Technology in 2014. He is a fifth- year Ph.D. candidate in the Department of Computer Science and Technology of Tsinghua University. His main research interests including data mining, high dimensional inference and data driven causal model. He has published several papers on data-driven causal inference and high dimensional inference in top data mining and machine learning conferences/journals of the relevant field such as SIGKDD, AAAI, and ICDM etc.
55 IEEE ICME2019
Bo Li Tsinghua University, Beijing, China Bio: Bo Li received a Ph. D degree in Statistics from the University of California, Berkeley, and a bach- elor’s degree in Mathematics from Peking University. He is an Associate Professor at the School of Eco- nomics and Management, Tsinghua University. His research interests are statistical methods for high-di- mensional data, statistical causal inference and data-driven decision making. He has published widely in academic journals across a range of fields including statistics, management science and economics.
56 Friday, July 12, 2019
T-08: Architecture Design for Deep Neural Networks Time: 8:30AM - 12:00PM
Room: 5DE
Speakers: Gao Huang Tsinghua University, Beijing, China Jingdong Wang MSRA, Beijing, China Lingxi Xie Huawei Inc., Beijing, China
Abstract
Recent years have witnessed great success in the deployment of deep learning for various tasks. Neural architecture innova- tion plays an important role in advancing this research direction. From AlexNet and VGG to ResNet and DenseNet, better architecture design has pushed the depth limit of deep models from 7 layers to over one thousand layers. The unprecedent depth endows neural networks with strong representation power. This tutorial will review classical convolutional network architectures, discuss their underlying design principles, and analyze their strengths and weaknesses. Particularly, we will address the recent trend of developing highly efficient light-weighted deep models for practical applications with limited computational resources, e.g., mobile phones and wearable devices. Besides hand designed structures that incorporate human intuition, neural architectures obtained via automatic search have gained great popularity in the recent two years. This newly emerged research direction, usually referred as AutoML, will also be covered in this tutorial.
Speakers
Gao Huang Tsinghua University, Beijing, China Bio: Dr. Gao Huang is an Assistant Professor with the Department of Automation at Tsinghua University. Previously, he was a postdoc with the Department of Computer Science at Cornell University. His re- search interests lie in machine learning and computer vision, with a special focus on deep learning. He has authored more than 30 papers, which collect more than 6000 citations. He is a recipient of the CVPR Best Paper Award (DenseNet), CAA Doctoral Dissertation Award and the Super AI Leader - Pioneer Award.
Jingdong Wang MSRA, Beijing, China Bio: Jingdong Wang is a Senior Researcher with the Visual Computing Group, Microsoft Research, Beijing, China. His areas of current interest include CNN architecture design, human pose estimation, semantic segmentation, person re-identification, large-scale indexing, and salient object detection. He has authored one book and 100+ papers in top conferences and prestigious international journals in computer vision, multimedia, and machine learning. He authored a comprehensive survey on learning to hash in TPAMI. His paper was selected into the Best Paper Finalist at ACM MM 2015. Dr. Wang is an Associate Editor of IEEE TPAMI, IEEE TCSVT and IEEE TMM. He was an Area Chair or a Senior Program Committee Member of top conferences, such as CVPR, ICCV, ECCV, AAAI, IJCAI, and ACM Multimedia. He is an ACM Distinguished Member and a Fellow of the IAPR. His homepage is https://jingdongwang2017.github.io/.
His representative works include deep high-resolution representation learning (HRNet), interleaved group convolutions, su- pervised saliency detection (discriminative regional feature integration, DRFI), neighborhood graph search (NGS) for large scale similarity search, composite quantization for compact coding, the Market-1501 dataset for person re-identification, and so on. He has shipped a dozen of technologies to Microsoft products, including Bing search, Bing Ads, Cognitive service, and XiaoIce Chatbot. His NGS algorithm is a foundational element of many products. He has developed Bing image search color filter using his efficient salient object algorithm. He has developed the first commercial color-sketch image search system.
57 IEEE ICME2019
Lingxi Xie Huawei Inc., Beijing, China Bio: Lingxi Xie is currently a senior researcher at Noah’s Ark Lab, Huawei Inc. He obtained B.E. and Ph.D. in engineering, both from Tsinghua University, in 2010 and 2015, respectively. He also served as a post-doctoral researcher at the CCVL lab from 2015 to 2019, having moved from the University of Cali- fornia, Los Angeles to the Johns Hopkins University. His homepage is http://lingxixie.com/. Lingxi’s research interests lie in computer vision, in particular the application of deep learning models. His research covers image classification, object detection, semantic segmentation and other vision tasks. He is also interested in medical image analysis, especially object segmentation in CT or MRI scans. Lingxi has published over 40 papers in top-tier international conferences and journals. In 2015, he received the outstanding Ph.D. thesis award from Tsinghua University. He is also the winner of the best paper award at ICMR 2015.
58 Friday, July 12, 2019
T-09: Intelligent Multimedia Recommendation
Time: 13:30 - 18:00 PM
Room: 5DE
Speakers: Jialie Shen Queen’s University Belfast, Belfast, United Kingdom Jian Zhang University of Technology Sydney, Sydney, Australia
Abstract
Due to the rapid growth of multimedia big data and related novel applications, intelligent recommendation systems have be- come more and more important in our daily life. During last decades, various multimedia technologies have been developed by different research communities (e.g., multimedia systems, information retrieval, and machine learning). Meanwhile, rec- ommendation techniques have been successfully leveraged by commercial systems (e.g., Amazon, Youtube and Spotify) to assist general users to deal with information overload and provide them high quality contents, interactions and services. While several tutorials and courses were dedicated to multimedia recommendation in the last few years, to the best of our knowledge, this tutorial should be the advanced and comprehensive one focusing on intelligent content analytics and its core applications on recommending various types of media contents. We plan to summarize the research along this direction and provide a good balance between theoretical methodologies and real system development (including several industrial ap- proaches). Core contributions to literature largely include: • Introducing why advanced recommendation system is important for Web scale multimedia retrieval, understand- ing and sharing. • Examining current commercial systems and research prototypes, focusing on comparing the advantages and the disadvantages of the various strategies and schemes for different types of media documents (e.g., image, video, audio and text) and their composition. • Reviewing key challenges and technical issues in building and evaluating modern recommendation systems under different contexts. • Discussing and reviewing various limitations of the current generation of systems. • Make predictions about the road that lies ahead for the scholarly exploration and industrial practice in multimedia and other related communities.
We also plan to have open discussion in this tutorial on several promising research directions with significant technical im- portance and explore potential solutions. Thus, we hope that this study provides an impetus for further research on this im- portant direction.
Speakers
Jialie Shen Queen’s University Belfast, Belfast, UK Bio: Dr. Jialie Shen is a Reader in Computer Science, School of Electronics, Electrical Engi- neering and Computer Science, Queen’s University Belfast (QUB), Belfast, United Kingdom. He received his PhD in Computer Science from the University of New South Wales (UNSW), Australia in the area of large-scale media retrieval and database access methods. Dr. Shen worked as a faculty member at Hong Kong, Singapore, Australia and England and researcher at information retrieval research group (Led by Professor Keith van Rijsbergen), the University of Glasgow, Scotland before moving to the QUB. Dr. Shen’s main research interests include in- formation retrieval, machine learning, multimedia systems and audio/video analytics. His research has been published or is forthcoming in leading journals and international conferences, including ACM SIGIR, ACM Multimedia, IJCAI, AAAI, IEEE Transactions and ACM Transactions.
59 IEEE ICME2019
Jian Zhang University of Technology Sydney, Sydney, Australia Bio: A/Prof. Jian Zhang: Dr. Jian Zhang is an Associate Professor in School of Electrical & Data Engineering, University of Technology Sydney, Australia. He received a PhD in electrical engineering from the University of New South Wales (UNSW), Sydney, Aus- tralia in area of image processing and video communication. From 1997 to 2003, he was with the Visual Information Processing Laboratory, Motorola Labs, Sydney, as a Principal Research Engineer and Research Manager of Visual Communications. From 2004 to July 2011, he was a Principal Researcher and a Project Leader with Data61 (formerly INCTA) Australia and a Conjoint Associate Professor with the School of Computer Science and Engineering, UNSW. He is currently an Associate Professor with the Global Big Data Technologies Centre, School of Electrical and Data Engineering, Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney. He is the author or co-author of more than 150 paper publications, book chapters, and six issued US and China patents. His current research interests include social multimedia signal processing, large scale image and vid- eo content analytics, retrieval and mining, 3D based computer Vision and intelligent video surveillance systems. Dr. Zhang was the General Co-Chair of the International Conference on Multimedia and Expo in 2012 and Technical Program Co-Chair of IEEE Visual Communications and Image Processing 2014. Currently, he is an Associated Edi- tors for the IEEE TRANSACTIONS ON MULTIMEDIA and the EURASIP Journal on Image and Video Processing (2016 – now). He was an Associate Editor for the IEEE TRANSCTIONS ON CIRCUITS AND SYSTEMS FOR VID- EO TECHNOLOGY (2006 – 2015).
60 Oral Sessions
Tuesday, July 9, 2019
Best Paper Session
Time: 10:00 - 11:00 AM
Room: 3CD
Chair: Yonggang Wen Nanyang Technological University, Singapore
Marta Mrak BBC, UK
10:00 AN END-TO-END ARCHITECTURE FOR CLASS-INCREMENTAL OBJECT DETECTION WITH KNOWLEDGE DISTILLATION
Yu Hao1, Yanwei Fu1, Yu-Gang Jiang1,2, Qi Tian3
1Fudan University, China, 2Jilian Technology Group(Video++) ,China, 3Huawei Noah’s Ark Lab, China
10:15 REAL-TIME INDOOR SCENE RECONSTRUCTION WITH RGBD AND INERTIAL IN PUT
Zunjie Zhu1, Feng Xu2, Chenggang Yan1, Xinhong Hao3, Xiangyang Ji2, Yongdong Zhang4, Qionghai Dai2
1Hangzhou Dianzi University, China, 2Tsinghua University, China, 3Beijing Institute of Technology, China, 4University of Science and Technology of China, China
10:30 DOUBLY SEMI-SUPERVISED MULTIMODAL ADVERSARIAL LEARNING FOR CLASSIFICATION, GENERATION AND RETRIEVAL
Changde Du1, Changying Du2, Huiguang He1
1Institute of Automation Chinese Academy of Sciences, China, 2Qihoo 360 Search Lab, China
10:45 TOWARDS DIGITAL RETINA IN SMART CITIES: A MODEL GENERATION, UTILIZATION AND COM- MUNICATION PARADIGM
Yihang Lou1,4, Ling-Yu Duan1,4, Yong Luo1,4, Ziqian Chen1,4, Tongliang Liu2, Shiqi Wang3, Wen Gao1,4
1Peking University, China, 2University of Sydney, Australia, 3City University of Hongkong, China, 4The Peng Cheng Laboratory, Shenzhen, China
61 IEEE ICME2019
Tuesday, July 9, 2019
O-01: Content Recommendation and Cross-modal Hashing
Time: 14:00 - 15:00 PM
Room: 3CD
Chair: Chengcui Zhang University of Alabama at Birmingham, USA
14:00 SDP: AN IMPROVED BASELINE ESTIMATION MODEL BASED ON STANDARD DEVIATION PRO- PORTION
Zhenhua Tan, Danke Wu, Liangliang He, Qiuyun Chang, Bin Zhang
Northeastern University, China
14:15 CITATION RECOMMENDATION BASED ON WEIGHTED HETEROGENEOUS IN FORMATION NET- WORK CONTAINING SEMANTIC LINKING
Jie Chen, Yang Liu, Shu Zhao, Yanping Zhang
Anhui University, China
14:30 FUSION-SUPERVISED DEEP CROSS-MODAL HASHING
Li Wang, Lei Zhu, En Yu, Jiande Sun, Huaxiang Zhang
Shandong Normal University, China
14:45 DOMAIN UNCERTAINTY BASED ON INFORMATION THEORY FORCROSS-MODAL HASH RETRIEV- AL
Wei Chen1, Nan Pu1, Yu Liu2, Erwin M. Bakker1, Michael S. Lew1,
1Leiden University, Holland, 2ESAT-PSI, KU Leuven, Belgium
62 Tuesday, July 9, 2019
O-02: Development of Multimedia Standards and Related Research
Time: 14:00 - 15:00 PM
Room: 3HI
Chair: Cheolkon Jung Xidian University, China
14:00 ADAPTIVE PLANE PROJECTION FOR VIDEO-BASED POINT CLOUD CODING
Eurico Lopes, João Ascenso, Catarina Brites, Fernando Pereira
Instituto Superior Técnico, Universidade de Lisboa - Instituto de Telecomunicações, Lisboa, Portugal
14:15 FAST CU PARTITIONING ALGORITHM FOR H.266/VVC INTRA-FRAME CODING
Ting Fu1, Hao Zhang 1, Fan Mu1, Huanbang Chen2
1Central South University, China, 2Huawei Base, China
14:30 TWO-STAGE FAST MULTIPLE TRANSFORM SELECTION ALGORITHM FOR VVC INTRA COD- ING
Ting Fu1, Hao Zhang 1, Fan Mu1, Huanbang Chen2
1Central South University, China, 2Huawei Base, China
14:45 HISTORY-BASED MOTION VECTOR PREDICTION FOR FUTURE VIDEO CODING
Junru Li1, Meng Wang2, Li Zhang3, Kai Zhang3, Hongbin Liu3, Shiqi Wang2, Siwei Ma1, Wen Gao1
1Peking University, China, 2City University of Hong Kong, China, 3Bytedance Inc., San Diego CA. 92122 USA, USA
63 IEEE ICME2019
Tuesday, July 9, 2019
O-03: Classification and Low Shot Learning
Time: 14:00 - 15:00 PM
Room: 5BC
Chair: Yu-Gang Jiang Fudan University, China
14:00 AMS-SFE: TOWARDS AN ALIGNMENT OF MANIFOLD STRUCTURES VIA SEMANTIC FEATU- REEXPANSION FOR ZERO-SHOT LEARNING
Jingcai Guo, Song Guo
The Hong Kong Polytechnic University, China
14:15 LOW-SHOT PALMPRINT RECOGNITION BASED ON META-SIAMESE NET WORK
Xuefeng Du1, Dexing Zhong1,2, Pengna Li1
1Xi’an Jiaotong University, China, 2Research Institute of Xi’an Jiaotong University, China
14:30 SR-GAN: SEMANTIC RECTIFYING GENERATIVE ADVERSIAL NETWORK FOR ZERO-SHOT LEARNING
Zihan Ye1,5, Fan Lyu1,2, Linyan Li3, Qiming Fu1,6, Jinchang Ren4, Fuyuan Hu1,7
1Suzhou University, China, 2Tianjin University, China, 3Suzhou Institute of Trade & Commerce, China, 4University of Strathclyde, UK, 5Virtual Reality Key Laboratory of Intelligent Interaction and Application Technology of Suzhou, China, 6Key Laboratory of Intelligent Building Energy Efficiency,China, 7Suzhou Key Laboratory for Big Data and Information Service, China
14:45 COMPARE MORE NUANCED: PAIRWISE ALIGNMENT BILINEAR NETWORK FOR FEW-SHOT FINE-GRAINED LEARNING
Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Jingsong Xu
University of Technology Sydney, Australia
64 Tuesday, July 9, 2019
O-04: 3D Media Computing
Time: 14:00 - 15:00 PM
Room: 5DE
Chair: Dan Zeng Shanghai University, China
14:00 FEATURE-AWARE AND CONTENT-WISE DENOISING OF 3D STATIC AND DY NAMIC MESHES USING DEEP AUTOENCODERS
Gerasimos Arvanitis1, Aris S. Lalos2, and Konstantinos Moustakas1
1University of Patras, Greece, 2“ATHENA” Research Center, Greece
14:15 REAL-TIME MONOCULAR VISUAL SLAM BY COMBINING POINTS AND LINES
Xinyu Wei, Jun Huang, Xiaoyuan Ma
Shanghai Advanced Research Institute, China
14:30 F-NUMBER ADAPTATION FOR MAXIMIZING THE SENSOR USAGE OF LIGHT FIELD CAMER- AS
Chuanpu Li, Xin Jin, Junke Li and Qionghai Dai
Shenzhen Key Lab of Broadband Network and Multimedia, China
14:45 BLIND CALIBRATION FOR FOCUSED PLENOPTIC CAMERAS
Xufu Sun, Xin Jin, Pei Wang, Yanqin Chen and Qionghai Dai
Shenzhen Key Lab of Broadband Network and Multimedia, China
65 IEEE ICME2019
Tuesday, July 9, 2019
O-05: Special Session "Pedestrian Detection, Tracking and Re-identification in Videos"
Time: 15:30 - 16:30 PM
Room: 3CD
Chair: Guiguang Ding Tsinghua University, China
Sicheng Zhao University of California, Berkeley, USA
Jungong Han Lancaster University, UK
15:30 PARTICLE SWARM LOSS FOR LIGHTWEIGHT OBJECT DETECTION
Peizhen Zhang1,2,4, Feng Zheng3, Junlong Du2, Jun Zhang2, Xiaowei Guo2, Wei-Shi Zheng1,4
1Sun Yat-sen University, China, 2Youtu Lab, Tencent, China, 3Southern University of Science and Technology, China, 4Key Laboratory of Machine Intelligence and Advanced Computing, China, Ministry of Education, China
15:45 INCORPORATING CATEGORY TAXONOMY IN DEEP REINFORCEMENT LEARNING BASED IMAGE HASHING
Qiang Fu1, Linsen Dong2, Ziyuan Liu2, Yong Luo2, Yonggang Wen2, Ying Li1, Ling-Yu Duan3
1Peking University, China, 2Nanyang Technological University, Singapore, 3Peking University, China
16:00 TRUNCATED GRADIENT CONFIDENCE-WEIGHTED BASED ONLINE LEARNING FOR IMBAL- ANCE STREAMING DATA
Ji Hu1, Chenggang Yan1, Xing Liu1, Jiyong Zhang1, Dongliang Peng1, Yi Yang2
1HangZhou DianZi University, China, 2UTS, Australia
16:15 UAV TARGET TRACKING BY DETECTION VIA DEEP NEURAL NETWORKS
Mohamed A. Kassab1, Ali Maher2, Fathy Elkazzaz3, Zhang Baochang1,4
1Beihang University, China, 2Military Technical college, Egypt, 3Benha University, Egypt, 4Shenzhen Academy of Aerospace Technology, China
66 Tuesday, July 9, 2019
O-06: Special Session "Multimedia Technologies Empowering Retail Experi- ences"
Time: 15:30 - 16:30 PM
Room: 3HI
Chair: Wu Liu JD AI Research, China
Liang Zheng Australian National University, Australia
Yi Yang University of Technology Sydney, Australia
Lexing Xie Australian National University, Australia
15:30 QUARTER-POINT CODEWORD EXPANSION FOR PRODUCT QUANTIZATION
Shan An1,2, Zhibiao Huang1, Guangfu Che1, Xianglong Liu2, Xin Ma3, Yu Chen1
1Department of Data Intelligence, JD.com, China, 2Beihang University, China, 3Shandong University, China
15:45 CONTEXT-AWARE AFFECTIVE GRAPH REASONING FOR EMOTION RECOGNITION
Minghui Zhang, Yumeng Liang, Huadong Ma
Beijing University of Posts and Telecommunications, China
16:00 SPL: EXPLOITING UNLABELED DATA FOR MULTI-LABEL IMAGE CLASSIFICATION
Weibo Zhang1,2, Fuqing Zhu1, Jiao Dai1, Songlin Hu1, Jizhong Han1, Tao Guo1
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2School of Cyber Security, University of Chinese Academy of Sciences, China
16:15 MLTS: A MULTI-LANGUAGE SCENE TEXT SPOTTER
Yu Zhou1, Shancheng Fang2, Hongtao Xie1, Zheng-Jun Zha1, Yongdong Zhang1
1University of Science and Technology of China, China,2Institute of Information Engineering, Chinese Academy of Sciences, China
67 IEEE ICME2019
Tuesday, July 9, 2019
O-07: 3D and Low Level Vision
Time: 15:30 - 16:30 PM
Room: 5BC
Chair: Wei Hu Peking University, China
15:30 UNSUPERVISED MONOCULAR DEPTH ESTIMATION BASED ON DUAL ATTENTION MECHA- NISM AND DEPTH-AWARE LOSS
Xinchen Ye1, Mingliang Zhang1,2, Rui Xu1, Wei Zhong1, Xin Fan1, Zhu Liu1, Jiaao Zhang1
1Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, China, 2Dalian University of Technology of Liaoning Province, China
15:45 TOWARDS HIGH-QUALITY INTRINSIC IMAGES IN THE WILD
Gang Fu1, Qing Zhang2, Chunxia Xiao1
1Wuhan University, China, 2Sun Yat-sen University, China
16:00 UNSUPERVISED LEARNING FOR OPTICAL FLOW ESTIMATION USING PYRAMID CONVOLU- TION LSTM
Shuosen Guan1,3, Haoxin Li2,3, Wei-Shi Zheng1,3
1School of Data and Computer Science, Sun Yat-sen University, China, 2School of Electronics and Information Tech- nology, Sun Yat-sen University, China, 3Key Laboratory of Machine Intelligence and Advanced Computing, Ministry of Education, China
16:15 MAST: MASK-ACCELERATED SHEARLET TRANSFORM FOR DENSELY-SAMPLED LIGHT FIELD RECONSTRUCTION
Yuan Gao1, Robert Bregovic2, Atanas Gotchev2, Reinhard Koch1
1Kiel University, German, 2Tampere University, Finland
68 Tuesday, July 9, 2019
O-08: Object Detection I
Time: 15:30 - 16:30 PM
Room: 5DE
Chair: Wengang Zhou University of Science and Technology of China, China
15:30 CODA: COUNTING OBJECTS VIA SCALE-AWARE ADVERSARIAL DENSITY ADAPTION
Li Wang1, Yongbo Li2, Xiangyang Xue1
1Fudan University, China, 2Megvii Inc (Face++), China
15:45 PDNET: PRIOR-MODEL GUIDED DEPTH-ENHANCED NETWORK FOR SALIENT OBJECT
Chunbiao Zhu, Xing Cai, Kan Huang, Thomas H Li, Ge Li
SECE, China, Shenzhen Graduate School, China, Peking University, China
16:00 CONTINUOUS SCALE ADAPTION FOR EFFICIENT BOX-BASED SCENE TEXT
Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo, Wei Zhong
Dalian University of Technology, China
16:15 MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETEC- TION NETWORK
Xiaobao Guo1, Jinxing Li2, Bingzhi Chen1, Guangming Lu1
1Harbin Institute of Technology (Shenzhen), China, 2The Chinese University of Hong Kong (Shenzhen), China
69 IEEE ICME2019
Tuesday, July 9, 2019
O-09: Emerging Applications of Deep Learning
Time: 16:45 - 17:45 PM
Room: 3CD
Chair: Aris Lalos Industrial System Institute, Greece
16:45 DMPR-PS: A NOVEL APPROACH FOR PARKING-SLOT DETECTION USING DIRECTIONAL MARKING-POINT REGRESSION
Junhao Huang1, Lin Zhang1, Ying Shen1, Huijuan Zhang1, Shengjie Zhao1, Yukai Yang2
1Tongji University, China, 2Uppsala University, Sweden
17:00 ADAPTING SEMANTIC SEGMENTATION OF URBAN SCENES VIA MASK- AWARE GATED DIS- CRIMINATOR
Yong-Xiang Lin1, Daniel Stanley Tan1, Wen-Huang Cheng2, Kai-Lung Hua1
1National Taiwan University of Science and Technology, Taiwan,2National Chiao Tung University,Taiwan
17:15 STOCHASTIC VIDEO GENERATION WITH DISENTANGLED REPRESENTATIONS
Maomao Li1, Chun Yuan1, Zhihui Lin1,2, Zhuobin Zheng1,2, Yangyang Cheng1,2
1Graduate School at Shenzhen, Tsinghua University, China, 2Tsinghua University, China
17:30 Z-ORDER RECURRENT NEURAL NETWORKS FOR VIDEO PREDICTION
Jianjin Zhang, Yunbo Wang, Mingsheng Long, Jianmin Wang, and Philip S. Yu
Tsinghua University, China
70 Tuesday, July 9, 2019
O-10: Multimedia Quality Assessment and Enhancement
Time: 16:45 - 17:45 PM
Room: 3HI
Chair: Chun-Shien Lu Academia Sinica, China
16:45 ENERGY-BASED RECURRENT MODEL FOR STOCHASTIC MODELING OF MUSIC
Yingru Liu1, Dongliang Xie2, Xin Wang1
1Stony Brook University, USA, 2Peking University, China
17:00 RESIDUAL FRAME FOR NOISY VIDEO CLASSIFICATION ACCORDING TO PERCEPTUAL QUALITY IN CONVOLUTIONAL NEURAL NETWORKS
Huaixuan Zhang1, Yuhai Lan3, Tao Dai1,2, Ruizhi Qiao 4, Ying Xu 1, Yao Yao 1, Shu-Tao Xia1,2
1Graduate School at Shenzhen, Tsinghua University, China, 2PCL Research Center of Networks and Communications, Peng Cheng Laboratory, China, 3Harbin Institute of Technology, China, 4Tencent Youtu Lab, China
17:15 RESIDUAL DILATED NETWORK WITH ATTENTION FOR IMAGE BLIND DENOISING
Guanqun Hou1, Yujiu Yang1, Jing-Hao Xue2
1Graduate School at Shenzhen, Tsinghua University, China, 2University College London, UK
17:30 COLLABORATIVE DEEP REINFORCEMENT LEARNING FOR IMAGE CROPPING
Zhuopeng Li, Xiaoyan Zhang
Shenzhen University, China
71 IEEE ICME2019
Tuesday, July 9, 2019
O-11: Multimedia for Society and Health
Time: 16:45 - 17:45 PM
Room: 5BC
Chair: Wolfgang Hürst Utrecht University, Holland
16:45 SIMILARITY-AWARE DEEP ADVERSARIAL LEARNING FOR FACIAL AGE ESTIMATION
Penghui Sun, Hao Liu, Xing Wang, Zhenhua Yu1, Suping Wu
Ningxia University, Yinchuan, China
17:00 LEARNING TRANSMISSION FILTERING NETWORK FOR IMAGE-BASED PM2.5 ESTIMATION
Yinghong Liao1, Bin Qiu1, Zhuo Su1, Ruomei Wang1, Xiangjian He2,3
1Sun Yat-sen University, China, 2Minjiang University, China, 3University of Technology Sydney, Australia
17:15 VIDEO-BASED EARLY ASD DETECTION VIA TEMPORAL PYRAMID NETWORKS
Yuan Tian, Xiongkou Min, Guangtao Zhai, Zhiyong Gao
Shanghai Jiao Tong Unversity, China
17:30 AUTOMATIC USER CATEGORIZATION THROUGH LARGE TRANSACTION DATA
Ying Zhang, YinJia Zhang, Qinpei Zhao, Weixiong Rao
Tongji University, China
72 Tuesday, July 9, 2019
O-12: Immersive Media
Time: 16:45 - 17:45 PM
Room: 5DE
Chair: Fernando Pereira Instituto Superior Técnico, Portugal
16:45 FEATURE PRESERVING AND UNIFORMITY-CONTROLLABLE POINT CLOUD SIMPLIFICATION ON GRAPH
Junkun Qi, WeiHu, Zongming Guo
Peking University, China
17:00 360SRL: A SEQUENTIAL REINFORCEMENT LEARNING APPROACH FOR ABR TILE-BASED 360 VID- EO STREAMING
Jun Fu, Xiaoming Chen, Zhizheng Zhang, Shilin Wu, Zhibo Chen
University of Science and Technology of China, China
17:15 CONTENT-AWARE PERSPECTIVE PROJECTION OPTIMIZATION FOR VIEWPORT RENDERING OF 360° IMAGES
Falah Jabar, Joao Ascenso, Maria Paula Queluz
Universidade de Lisboa, Portugal
17:30 AN AR BENCHMARK SYSTEM FOR INDOOR PLANAR OBJECT TRACKING
Ziming Wu1, Jiabin Guo2, Shuangli Zhang2, Chen Zhao2, Xiaojuan Ma1
1Hong Kong University of Science and Technology, China, 2Netease AR, China
73 IEEE ICME2019
Wednesday, July 10, 2019
O-13: 3D and Stereo Computing
Time: 14:00 - 15:00 PM
Room: 3CD
Chair: Joao Ascenso Instituto Superior Técnico, Portugal
14:00 GLOBAL AS-CONFORMAL-AS-POSSIBLE NON-RIGID REGISTRATION OF MULTI-VIEW SCANS
Zhenchao Wu1, Kun Li1, Yu-Kun Lai2, Jingyu Yang1
1Tianjin University, China, 2Cardiff University, UK
14:15 A LIGHT-WEIGHTED NETWORK FOR FACIAL LAND MARK DETECTION VIA COMBINED HEAT- MAP AND COORDINATE REGRESSION
Zhengning Wang1, Longfei Feng1, Fanwei Zeng1, Guang Hu1, Xiang Zhang1, Xia Lv1, Fengjun Zhang2
1University of Electronic Science and Techonogy of China, China, 2No.30 Institute of CETC, China
14:30 LIGHT WEIGHT STEREO MATCHING VIA DEEP EXTRACTION AND INTEGRATION OF LOW AND HIGH LEVEL INFORMATION
Xianzhe Xu1, Yonghong Hou1, Pichao Wang2, Zhongyu Jiang1, Wanqing Li3
1Tianjin University, China, 2Alibaba Group (U.S.) Inc., 3University of Wollongong, Australia
14:45 JUSTLOOKUP: ONE MILLISECOND DEEP FEATURE EXTRACTION FOR POINT CLOUDS BY LOOK- UP TABLES
Hongxin Lin1,2, Zelin Xiao1,2, Yang Tan1,2, Hongyang Chao1, Shengyong Ding1
1Sun Yat-sen University, China, 2Pixtalks Tech, China
74 Wednesday, July 10, 2019
O-14: Machine Learning Applications in Image and Video Coding I
Time: 14:00 - 15:00 PM
Room: 3HI
Chair: Frederic Dufaux Universite Paris-Saclay, France
14:00 MULTIPLE GRAPH CONVOLUTIONAL NETWORKS FOR CO-SALIENCY DETECTION
Bo Jiang1, Xingyue Jiang1, Jin Tang1, Bin Luo1, Shilei Huang2
1Anhui University, China, 2PKU-HKUST Shenzhen Hong Kong Institution, China
14:15 QUANNET: JOINT IMAGE COMPRESSION AND CLASSIFICATION OVER CHANNELS WITH LIMIT- ED BANDWIDTH
Lahiru Dulanjana Chamain Hewa Gamage1, Sen-ching S Cheung2, Zhi Ding1
1University of Califirnia Davis, USA, 2University of Kentucky, USA
14:30 HIGH EFFICIENCY LIGHT FIELD COMPRESSION VIA VIRTUAL REFERENCE AND HIERARCHI- CAL MV-HEVC
Jiawen Gu, Bichuang Guo, Jiangtao Wen
Tsinghua University, China
14:45 SELF-PACED SUBSPACE CLUSTERING
Youfa Liu, Bo Du, Lefei Zhang
Wuhan University, China
75 IEEE ICME2019
Wednesday, July 10, 2019
O-15: Vision, Language and Text Processing
Time: 14:00 - 15:00 PM
Room: 5BC
Chair: Jiande Sun Shandong University, China
14:00 COLLOQUIAL IMAGE CAPTIONING
Xuri Ge, Fuhai Chen, Chen Shen, Rongrong Ji
Xiamen University, China
14:15 IMPROVING CAPTIONING FOR LOW-RESOURCE LANGUAGES BY CYCLE CONSISTENCY
Yike Wu1, Shiwan Zhao2, Jia Chen3, Yinng Zhang1, Xiaojie Yuan1, Zhong Su2
1Nankai University, China, 2IBM Research, China, 3Carnegie Mellon University, USA
14:30 FRAMERANK: A TEXT PROCESSING APPROACH TO VIDEO SUMMARIZATION
Zhuo Lei1,2, Chao Zhang1,2, Qian Zhang2, Guoping Qiu3,4
1International Doctoral Innovation Center, UK 2The University of Nottingham Ningbo China, China, 3Shenzhen Uni- versity, China, 4University of Nottingham, UK
14:45 CHARACTER IMAGE SYNTHESIS BASED ON SELECTED CONTENT AND REFERENC ED STYLE EMBEDDING
Anna Zhu, Qiyang Zhang, Xiongbo Lu, Shengwu Xiong
Wuhan University of Technology, China
76 Wednesday, July 10, 2019
O-16: Media Classification and Segmentation II
Time: 14:00 - 15:00 PM
Room: 5DE
Chair: Zhu Li University of Missouri, USA
14:00 QUERY-FREE EMBEDDING ATTACK AGAINST DEEP LEARNING
Yujia Liu, Weiming Zhang, Nenghai Yu
University of Science and Technology of China, China
14:15 GRAPH ATTENTION NEURAL NETWORKS FOR POINT CLOUD RECOGNITION
Zongmin li, Jun Zhang, Guanlin Li, Yujie Liu, Siyuan Li
China University of Petroleum (East China), China
14:30 MAXIMAL CORRELATION EMBEDDING NETWORK FOR MULTILABEL LEARNING WITH MISS- ING LABELS
Lu Li, Yang Li, Xiangxiang Xu, Shao-Lun Huang, Lin Zhang
Tsinghua university, China
14:45 SELF-ADAPTION MULTI-CLASSIFIER FUSION NETWORKS FOR IMAGE RECOGNITION
Zengyuan Guo, Xinzhu Ma, Haojie Li, Zhihui Wang, Pengbo Zhang
Dalian University of Technology, China
77 IEEE ICME2019
Wednesday, July 10, 2019
O-17: AI for Human Understanding
Time: 15:30 - 16:30 PM
Room: 3CD
Chair: Bin Liu University of Science and Technology of China, China
15:30 VIDEO EMOTION RECOGNITION WITH CONCEPT SELECTION
Baohan Xu1, Yingbin Zheng2, Hao Ye2, Caili Wu3, Heng Wang1, Gufei Sun1
1Zhongan Technology, China, 2Videt Tech, USA, 3East China Normal University, China
15:45 GRAPH CONVOLUTIONAL LSTM MODEL FOR SKELETON-BASED ACTION RECOGNITION
Han Zhang, Yonghong Song, Yuanlin Zhang
Xi’an Jiaotong University, China
16:00 LEARNING RECURRENT STRUCTURE-GUIDED ATTENTION NETWORK FOR MULTI-PERSON POSE ESTIMATION
Zhongwei Qiu1, Kai Qiu2, Jianlong Fu2, Dongmei Fu1
1University of Science and Technology Beijing, China, 2Microsoft Reasearch, China
16:15 PCPCAD: PROPOSAL COMPLEMENTARY ACTION DETECTOR
Zhenying Fang1, Suguo Zhu1, Jun Yu1, Qi Tian2,3
1Hangzhou Dianzi University, China, 2Huawei Noah’s Ark Lab, China, 3The University of Texas at San Antonio, USA
78 Wednesday, July 10, 2019
O-18: Image Quality Metrics
Time: 15:30 - 16:30 PM Room: 3HI Chair: Patrick Le Callet Universite de Nantes, France
15:30 PERSONALITY DRIVEN MULTI-TASK LEARNING FOR IMAGE AESTHETIC ASSESSMENT
Leida Li1,2, Hancheng Zhu2, Sicheng Zhao3, Guiguang Ding4, Hongyan Jiang2, Allen Tan5
1Xidian University, China, 2China University of Mining and Technology, China, 3University of California Berkeley, USA, 4Tsinghua University, China, 5Tencent, China
15:45 VIDEO QUALITY TEMPORAL POOLING USING A VISIBILITY MEASURE
Chen Bai, Amy R. Reibman
Purdue University, USA 16:00 IMAGE QUALITY ASSESSMENT OF MULTI-EXPOSURE IMAGE FUSION FOR BOTH STATIC AND DYNAMIC SCENES
Yuming Fang1, Yan Zeng1, Hanwei Zhu1, Guangtao Zhai2
1Jiangxi University of Finance and Economics, China, 2Shanghai Jiao Tong University, China 16:15 NO-REFERENCE STEREOSCOPIC IMAGE QUALITY ASSESSMENT BASED ON LOCAL TO GLOBAL FEATURE REGRESSION
Sumei Li, Jianwei Xue, Yongtian Han
Tianjin University, China
79 IEEE ICME2019
Wednesday, July 10, 2019
O-19: Multimedia Recommendations
Time: 15:30 - 16:30 PM
Room: 5BC
Chair: Rui Wang Tongji University, China
15:30 HERDING EFFECT BASED ATTENTION FOR PERSONALIZED TIME-SYNC VIDEO RECOMMENDA- TION
Wenmian Yang1,2, Wenyuan Gao1, Xiaojie Zhou1, Weijia Jia1,2, Shaohua Zhang1,2, Yutao Luo1
1Shanghai JiaoTong University, China, 2University of Macau, China
15:45 SEQUENTIAL BEHAVIOR MODELING FOR NEXT MICRO-VIDEO RECOMMENDATION WITH COL- LABORATIVE TRANSFORMER
Shang Liu, Zhenzhong Chen
Wuhan University, China
16:00 BUTTONTIPS: DESIGNING WEB BUTTONS WITH SUGGESTIONS
Dawei Liu, Ying Cao, Rynson W.H. Lau, Antoni B. Chan
City University of Hongkong, China
16:15 KNOWING USER BETTER: MICRO-VIDEO RECOMMENDER SYSTEM BY JOINTLY OPTIMIZING TO CLICK-THROUGH AND PLAYTIME
Shengjie Ma, Zhengjun Zha, Feng Wu
University of Science and Technology of China, China
80 Wednesday, July 10, 2019
O-20: Search and Retrieval
Time: 15:30 - 16:30 PM
Room: 5DE
Chair: Jianquan Liu NEC Corporation, Japan
15:30 ADVERSARIAL CROSS-MODAL RETRIEVAL VIA LEARNING AND TRANSFERRING SINGLE-MOD- AL SIMILARITIES
Xin Wen1, Zhizhong Han1,2, Xinyu Yin1, Yu-Shen Liu1
1Tsinghua University, China, 2University of Maryland, USA
15:45 SEMI-SUPERVISED COMPATIBILITY LEARNING ACROSS CATEGORIES FOR CLOTHING MATCH- ING
Zekun Li, Zeyu Cui, Shu Wu, Xiaoyu Zhang, Liang Wang
Chinese Academy of Sciences, China
16:00 ADVERSARIAL LEARNING FOR FINE-GRAINED IMAGE SEARCH
Kevin Lin1, Fan Yang2, Qiaosong Wang2, Robinson Piramuthu2
1University of Washington, USA, 2eBay Inc, USA
16:15 A MASK BASED DEEP RANKING NEURAL NETWORK FOR PERSON RETRIEVAL
Lei Qi1, Jing Huo1, Lei Wang2, Yinghuan Shi1, Yang Gao1
1Nanjing University, China, 2University of Wollongong, Australia
81 IEEE ICME2019
Wednesday, July 10, 2019
O-21: Media Understanding
Time: 16:45 - 17:45 PM
Room: 3CD
Chair: Lingyu Duan Peking University, China
16:45 DISCO: DEPTH INFERENCE FROM STEREO USING CONTEXT
Kunal Swami, Kaushik Raghavan, Nikhilanj Pelluriy, Rituparna Sarkar, Pankaj Bajpai
Samsung Research Institute Bangalore, India
17:00 PANET: A CONTEXT BASED PREDICATE ASSOCIATION NETWORK FOR SCENE GRAPH GENERA- TION
Yunian Chen1, Yanjie Wang3, Yang Zhang1, Yanwen Guo1,2
1Nanjing University, China, 2The 28th Research Institute of China Electronics Technology Group Corporation, Chi- na, 3Zhejiang University, China
17:15 UNTARGETED ADVERSARIAL ATTACK VIA EXPANDING THE SEMANTIC GAP
Aming Wu1, Yahong Han1, Quanxin Zhang2, Xiaohui Kuang3
1Tianjin University, China, 2Beijing Institute of Technology, China ,3National Key Laboratory of Science and Technol- ogy on Information System Security, China
17:30 LEARNING GOAL-ORIENTED VISUAL DIALOG AGENTS: IMITATING AND SURPASSING ANALYTIC EXPERTS
Yen-Wei Chang, Wen-Hsiao Peng
National Chiao Tung University, Taiwan
82 Wednesday, July 10, 2019
O-22: Super-resolution and Enhancement
Time: 16:45 - 17:45 PM
Room: 3HI
Chair: Ge Li Peking University, China
16:45 GAN-BASED MULTI-LEVEL MAPPING NETWORK FOR SATELLITE IMAGERY SUPER-RESOLU- TION
Kui Jiang, Zhongyuan Wang, Peng Yi, Junjun Jiang, Guangcheng Wang, Zhen Han, Tao Lu
Wuhan University, China
17:00 QUALITY-GATED CONVOLUTIONAL LSTM FOR ENHANCING COMPRESSED VIDEO
Ren Yang3, Xiaoyan Sun1, Mai Xu2 and Wenjun Zeng1
1Microsoft Research, USA, 2Beihang University, China, 3ETH Zürich, Switzerland
17:15 COMPOUNDED LAYER-PRIOR UNROLLING: A UNIFIED TRANSMISSION-BASED IMAGE EN- HANCEMENT FRAMEWORK
Risheng Liu, Minjun Hou, Jinyuan Liu, Xin Fan, Zhongxuan Luo
Dalian University of Technology, China
17:30 DEEP PYRAMID VARIATION LEARNING FOR IMAGE INTERPOLATION
Fu Qiang, Wenhan Yang, Ying Li, and Jiaying Liu
Peking University, China
83 IEEE ICME2019
Wednesday, July 10, 2019
O-23: Pose and Action Recognition II
Time: 16:45 - 17:45 PM
Room: 5BC
Chair: Sheng Tang Institute of Computing Technology, Chinese Academy of Sciences, China
16:45 CLOTHES KEYPOINTS LOCALIZATION AND ATTRIBUTE RECOGNITION VIA PRIOR KNOWL- EDGE
Zhangxuan Gu, Jianfu Zhang, Ziqi Pan, Haohua Zhao, Liqing Zhang
Shanghai Jiao Tong University, China
17:00 SPATIO-TEMPORAL MULTI-FACTOR DISCRIMINANT ANALYSIS FOR INDIVIDUAL IDENTIFICA- TION
Yong Su, Zhiyong Feng
Tianjin University, China
17:15 CHANNEL-WISE TEMPORAL ATTENTION NETWORK FOR VIDEO ACTION RECOGNITION
Jianjun Lei1, Yalong Jia1, Bo Peng1, Qingming Huang2
1Tianjin University, China, 2University of Chinese Academy of Sciences, China
17:30 LOCALIZATION GUIDED FIGHT ACTION DETECTION IN SURVEILLANCE VIDEOS
Qichao Xu1, John See2, Weiyao Lin1
1Shanghai Jiao Tong University, China, 2Multimedia University, Malaysia
84 Wednesday, July 10, 2019
O-24: Image and Video Enhancements I
Time: 16:45 - 17:45 PM
Room: 5DE
Chair: Miaohui Wang Shenzhen University, China
16:45 RECURSIVE MULTI-STAGE UPSCALING NETWORK WITH DISCRIMINATIVE FUSION FOR SU- PER-RESOLUTION
Yue Lu1, Zhuqing Jiang1, Guodong Ju2, Liangheng Shen2, Aidong Men1
1Beijing University of Posts and Telecommunications, China, 2GuangDong TUS-TuWei Technology Co., Ltd, China
17:00 IMPROVING IMAGE SUPER-RESOLUTION VIA FEATURE RE-BALANCING FUSION
Yuanfei Huang, Jie Li, Xinbo Gao, Wen Lu, Yanting Hu
Xidian University, China
17:15 DIFFICULTY-AWARE IMAGE SUPER RESOLUTION VIA DEEP ADAPTIVE DUAL-NETWORK
Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen
Sun Yat-sen University, China
17:30 DENSE-CONNECTED RESIDUAL NETWORK FOR VIDEO SUPER-RESOLUTION
Xiaoting Du, Yuan Zhou, Yanfang Chen, Yeda Zhang, Jianxing Yang and Dou Jin
Tianjin University, China
85 IEEE ICME2019
Thursday, July 11, 2019
O-25: Face and Person Analysis
Time: 14:00 - 15:00 PM
Room: 3CD
Chair: Hailin Shi JD AI Research, China
14:00 DYNAMIC CASCADED REGRESSION NETWORK WITH REINFORCEMENT LEARNING FOR RO- BUST FACE ALIGNMENT
Zhihao Zhang, Liansheng Zhuang, Wengang Zhou, Houqiang Li
University of Science and Technology of China, China
14:15 DEEP LEARNING FACE HALLUCINATION VIA ATTRIBUTES TRANSFER AND ENHANCEMENT
Mengyan Li, Yuechuan Sun, Zhaoyu Zhang, Haonian Xie and Jun Yu
University of Science and Technology of China, China
14:30 EMOTION RECOGNITION FROM PHYSIOLOGICAL SIGNALS USING MULTI-HYPERGRAPH NEU- RAL NETWORKS
Junjie Zhu1, Xibin Zhao1, Han Hu2, Yue Gao1
1Tsinghua University, China, 2Beijing Institute of Technology, China
14:45 GPS: GROUP PEOPLE SEGMENTATION WITH DETAILED PART INFERENCE
Yue Liao1, Tianrui Hui1, Chen Gao1, Si Liu2, Yao Sun3, Hefei Ling4, Bo Li2
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2Beihang University, China, 3iie, China, 4Huazhong University of Science and Technology, China
86 Thursday, July 11, 2019
O-26: Media Classification and Segmentation III
Time: 14:00 - 15:00 PM
Room: 3HI
Chair: Chenggang Yan Hangzhou Dianzi University, China
14:00 MULTI-LABEL IMAGE RECOGNITION WITH JOINT CLASS-AWARE MAP DISENTANGLING AND LABEL CORRELATION EMBEDDING
Zhao-Min Chen1,2, Xiu-Shen Wei2, Xin Jin2, Yanwen Guo1,3
1Nanjing University, China, 2Megvii Technology, China, 3Science and Technology on Information Systems Engineer- ing Laboraty, China
14:15 REAL TIME COMPRESSED VIDEO OBJECT SEGMENTATION
Zhentao Tan, Bin Liu, Weihai Li, Nenghai Yu
University of Science and Technology of China, China
14:30 ACCURATE AND FAST FINE-GRAINED IMAGE CLASSIFICATION VIA DISCRIMINATIVE LEARN- ING
Zhihui Wang1, Shijie Wang1, Pengbo Zhang1, Haojie Li1, Bo Liu2
1Dalian University of Technology, China, 2Shanghai Jiao Tong University, China
14:45 POSE2BODY: POSE-GUIDED HUMAN PARTS SEGMENTATION
Zhong Li1, Xin Chen2, Wangyiteng Zhou2, Yingliang Zhang2, Jingyi Yu2
1University of Delaware, USA, 2ShanghaiTech University, China
87 IEEE ICME2019
Thursday, July 11, 2019
O-27: Image and Video Enhancements II
Time: 14:00-15:00 PM
Room: 5BC
Chair: Ce Zhu University of Electronic Science & Technology of China, China
14:00 RESIDUAL MAGNIFIER: A DENSE INFORMATION FLOW NETWORK FOR SUPER RESOLUTION
Zhan Shu1, Mengcheng Cheng1, Biao Yang1, Zhuo Su1, Xiangjian He2,3
1Sun Yat-sen University, China, 2Minjiang University, China, 3University of Technology Sydney, Australia
14:15 EVERYONE IS A CARTOONIST: SELFIE CARTOONIZATION WITH ATTENTIVE ADVERSARIAL NETWORKS
Xinyu Li, Wei Zhang, Tong Shen, Tao Mei
JD AI Research, China
14:30 SCALE-AWARE DEEP NETWORK WITH HOLE CONVOLUTION FOR BLIND MOTION DEBLURRING
Jichun Li, Ke Li, Bo Yan
Fudan University, China
14:45 REMOVING RAIN IN VIDEOS: A LARGE-SCALE DATABASE AND A TWO-STREAM CONVLSTM AP- PROACH
Tie Liu, Mai Xu and Zulin Wang
Beihang University, China
88 Thursday, July 11, 2019
O-28: Multimedia Learning and Adaptation
Time: 14:00 - 15:00 PM
Room: 5DE
Chair: Song Li Shanghai Jiao Tong University, China
14:00 TOWARDS QOS-AWARE CLOUD LIVE TRANSCODING: A DEEP REINFORCEMENT LEARNING AP- PROACH
Zhengyuan Pang, Lifeng Sun, Tianchi Huang, Zhi Wang, Shiqiang Yang
Tsinghua University, China
14:15 HIGH SPEED RECURRENT REGRESSION NETWORK FOR VISUAL TRACKING
Ding Ma, Xiangqian Wu
Harbin Institute of Technology, China
14:30 PAAE: A UNIFIED FRAMEWORK FOR PREDICTING ANCHOR LINKS WITH ADVERSARIAL EM- BEDDING
Yanmin Shang1, Zhezhou Kang1, Yanan Cao1, Dongjie Zhang1, Yangxi Li2, Yang Li3, Yanbing Liu1
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2National Computer network Emergency Response technical Team,China, 3State Information Center, China
14:45 MANIFOLD ALIGNMENT AND DISTRIBUTION ADAPTATION FOR UNSUPERVISED DOMAIN ADAP- TATION
Ying Li, Lin Cheng, Yaxin Peng, Zhijie Wen, Shihui Ying
Shanghai University, China
89 IEEE ICME2019
Thursday, July 11, 2019
O-29: Person (Re-)Identification and People Detection
Time: 15:30 - 16:30 PM
Room: 3CD
Chair: Bingpeng Ma University of Science and Technology of China, China
15:30 PEDESTRIAN RE-IDENTIFICATION BASED ON TREE BRANCH NETWORK WITH LOCAL AND GLOBAL LEARNING
Hui Li1, Meng Yang24, Zhihui Lai1, Weishi Zheng2, Zitong Yu3
1Shenzhen University, China, 2Sun Yat-sen University, China, 3University of Oulu, Finland, 4Key Laboratory of Ma- chine Intelligence and Advanced Computing(SYSU), Ministry of Education, China
15:45 ADVERSARIAL BINARY CODING FOR EFFICIENT PERSON RE-IDENTIFICATION
Zheng Liu1, Jie Qin2, Annan Li1, Yunhong Wang1, and Luc Van Gool3
1Beihang University, China, 2Inception Institute of Artificial Intelligence, UAE, 3Computer Vision Laboratory, ETH Zurich, Switzerland
16:00 PERSON RE-IDENTIFICATION WITH GRADUAL BACKGROUND SUPPRESSION
Yingzhi Tang, Xi Yang, Nannan Wang, Xinrui Jiang, Bin Song, Xinbo Gao
Xidian University, China
16:15 MULTI-BRANCH CONTEXT-AWARE NETWORK FOR PERSON RE-IDENTIFICATION
Yingxin Zhu1, Xiaoqiang Guo2, Jianlei Liu1, Zhuqing Jiang1
1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, Beijing, China
90 Thursday, July 11, 2019
O-30: Multimedia and Language II
Time: 15:30 - 16:30 PM
Room: 3HI
Chair: Annan Li Beijing University of Aeronautics and Astronautics, China
15:30 POST-PROCESSING OF WORD REPRESENTATIONS VIA VARIANCE NORMALIZATION AND DY- NAMIC EMBEDDING
Bin Wang1, Fenxiao Chen1, Angela Wang2 and C.-C. Jay Kuo1
1University of Southern California, USA, 2University of California, Berkeley, USA
15:45 MULTI-MODAL LANGUAGE ANALYSIS WITH HIERARCHICAL INTERACTION-LEVEL AND SELEC- TION-LEVEL ATTENTION
Dong Zhang, Liangqing Wu, Shoushan Li, Qiaoming Zhu, Guodong Zhou
Soochow University, China
16:00 MODELING THE CLAUSE-LEVEL STRUCTURE TO MULTIMODAL SENTIMENT ANALYSIS VIA REINFORCEMENT LEARNING
Dong Zhang, Shoushan Li, Qiaoming Zhu, Guodong Zhou
Soochow University, China
16:15 TWICE OPPORTUNITY KNOCKS SYNTACTIC AMBIGUITY: A VISUAL QUESTION ANSWERING MODEL WITH YES/NO FEEDBACK
Jianming Wang, Wei Deng, Yukuan Sun, Yuanyuan Li, Kai Wang, Guanghao Jin
Tianjin Polytechnic University, China
91 IEEE ICME2019
Thursday, July 11, 2019
O-31: Multimedia Communications and Localization
Time: 15:30 - 16:30 PM
Room: 5BC
Chair: Sanjeev Mehrotra Microsoft, USA
15:30 GEOCAPSNET: GROUND TO AERIAL VIEW IMAGE GEO-LOCALIZATION USING CAPSULE NET- WORK
Bin Sun1, Chen Chen2, Yingying Zhu1, Jianmin Jiang1
1Shenzhen University, China, 2University of North Carolina at Charlotte, USA
15:45 IMPROVING ROBUSTNESS OF DASH AGAINST NETWORK UNCERTAINTY
Bo Wang1,2, Fengyuan Ren1,2
1Beijing National Research Center for Information Science and Technology, China, 2Tsinghua University, China
16:00 HYBRID CONTROL-BASED ABR: TOWARDS LOW-DELAY LIVE STREAMING
Bo Wang1,2, Fengyuan Ren1,2, Chao Zhou3
1Beijing National Research Center for Information Science and Technology, China, 2Tsinghua University, China, 3Beijing Kuaishou Technology Co., Ltd, China
16:15 TAXI ORIGIN-DESTINATION DEMAND PREDICTION WITH CONTEXTUALIZED SPATIAL-TEMPO- RAL NETWORK
Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin
Sun Yat-sen University, China
92 Thursday, July 11, 2019
O-32: Multimedia Security, Privacy and Forensics II
Time: 15:30 - 16:30 PM
Room: 5DE
Chair: Wen Ji Institute of Computing Technology, Chinese Academy of Sciences, China
15:30 FAST IMAGE CLUSTERING BASED ON CAMERA FINGERPRINT ORDERING
Sahib Khan, Tiziano Bianchi
Politecnico di Torino, Italy
15:45 ENFORCING ACCESS CONTROL IN DISTRIBUTED VERSION CONTROL SYSTEMS
Xin Xu1,2, Quanwei Cai1,2, Jingqiang Lin1,2, Shiran Pan1,2, Liangqin Ren1,2
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sci- ences, China
16:00 ATTRIBUTE-BASED ACCOUNTABLE ACCESS CONTROL FOR MULTIMEDIA CONTENT WITH IN-NETWORK CACHING
Peixuan He1, Kaiping Xue1, Jie Xu1, Qiudong Xia1, Jianqing Liu2, Hao Yue3
1University of Science and Technology of China, China, 2University of Alabama in Huntsville, USA, 3San Francisco State University, USA
16:15 PRACTICAL IMAGE OBFUSCATION WITH PROVABLE PRIVACY
Liyue Fan
University at Albany, State University of New York, USA
93 IEEE ICME2019
Thursday, July 11, 2019
O-33: Multimedia Sensing and Signal Processing
Time: 16:45 - 17:45 PM
Room: 3HI
Chair: Zhi Jin Sun Yat-sen University, China
16:45 JOINTLY SOLVING DEBLURRING AND SUPER-RESOLUTION PROBLEMS WITH DUAL SUPER- VISED NETWORK
Zhenwen Liang, Dongyang Zhang, Jie Shao
University of Electronic Science and Technology of China, China
17:00 TWO-STAGED ACOUSTIC MODELING ADAPTION FOR ROBUST SPEECH RECOGNITION BY THE EXAMPLE OF GERMAN ORAL HISTORY INTERVIEWS
Michael Gref1,2, Christoph Schmidt1, Sven Behnke1,3, Joachim Köhler1
1Fraunhofer Institute for Intelligent Analysis and Information Systems, Germany, 2Niederrhein University of Applied Sciences, Germany, 3University of Bonn, Germany
17:15 AN ADAPTIVE AFFINITY GRAPH WITH SUBSPACE PURSUIT FOR NATURAL IMAGE SEGMENTA- TION
Yang Zhang1, Huiming Zhang1, Yanwen Guo1, Kai Lin2, Jingwu He1
1Nanjing University, China, 2Hubei University of Technology, China
17:30 PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIR- CULAR MICROPHONE ARRAY
Li He, Yi Zhou, Hongqing Liu
Chongqing University of Posts and Telecommunications, China
94 Thursday, July 11, 2019
O-34: Detection and Recognition
Time: 16:45 - 17:45 PM
Room: 5BC
Chair: Lifang Wu Beijing University of Technology, China
16:45 LOCALITY-CONSTRAINED SPATIAL TRANSFORMER NETWORK FOR VIDEO CROWD COUNTING
Yanyan Fang1, Biyun Zhan1, Wandi Cai1, Shenghua Gao2, Bo Hu1
1Fudan University, China, 2ShanghaiTech University, China
17:00 SPATIAL-AWARE NON-LOCAL ATTENTION FOR FASHION LANDMARK DETECTION
Yixin Li1, Shengqin Tang2, Yun Ye3, Jinwen Ma1
1Peking University, China, 2Xi’an Jiaotong University, China, 3JD AI Research, China
17:15 RELATIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
Wu Zheng1,2, Lin Li1,2, Zhaoxiang Zhang1,2, Yan Huang1,2, Liang Wang1,2
1Institute of Automation, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
17:30 MULTI-VIEW LEARNING FOR VEHICLE RE-IDENTIFICATION
Weipeng Lin1, Yidong Li1, Xiaoliang Yang1, Peixi Peng2, Junliang Xing2
1Beijing Jiaotong University, China, 2Institute of Automation, Chinese Academy of Sciences, China
95 IEEE ICME2019
Thursday, July 11, 2019
O-35: Multi-modal Media Computing and Human-machine Interaction
Time: 16:45 - 17:45 PM
Room: 5DE
Chair: Sanghoon Lee Yonsei University, Korea
16:45 MANY COULD BE BETTER THAN ALL: A NOVEL INSTANCE-ORIENTED ALGORITHMFOR MULTI-MODAL MULTI-LABEL PROBLEM
Yi Zhang, Cheng Zeng, Hao Cheng, Chongjun Wang, Lei Zhang
Nanjing University, China
17:00 AFFECTIVE VIDEO CONTENT ANALYSES BY USING CROSS-MODAL EMBEDDING LEARNING FEA- TURES
Benchao Li1,3, Zhenzhong Chen2, Shan Li3, WeiShi Zheng1,4
1Sun Yat-Sen University, China, 2Wuhan University, China, 3Tencent, America, 4Key Laboratory of Machine Intelli- gence and Advanced Computing, Ministry of Education, China
17:15 LEARNING A 3D GAZE ESTIMATOR WITH IMPROVED ITRACKER COMBINED WITH BIDIREC- TIONAL LSTM
Xiaolong Zhou, Jianing Lin, Jiaqi Jiang, Shengyong Chen
Zhejiang University of Technology, China
17:30 DETECTION OF OCCLUDED ROAD SIGNS ON AUTONOMOUS DRIVING VEHICLES
Jingda Guo, Xianwei Cheng, Qi Chen, Qing Yang
University of North Texas, USA
96 Industry Track
Wednesday, July 10, 2019
Time: 14:00 - 15:00 PM
Room: 3B
Chair: Guanbin Li Sun Yat-sen University, China
LOCALIZING ADVERTS IN OUTDOOR SCENES
Soumyabrata Dev
The ADAPT SFI Research Centre, Ireland
HIERARCHICAL RECURSIVE NETWORK FOR SINGLE IMAGE SUPER RESOLUTION
Minglan Su1, Shenqi Lai2, Zhenhua Chai2, Xiaoming Wei2, Yong Liu1
1University of Posts and Telecommunications, China, 2Meituandianping Group, China
PARALLEL VOLUME RENDERING METHOD FOR OUT-OF-CORE NON-UNIFORMLY PARTITIONED DATASETS
Jian Xue1, Xiaoye Zhu1, Ke Lu1, Yutong Kou2
1University of Chinese Academy of Sciences, China, 2Huazhong University of Science & Technology, China
VEHICLE RE-IDENTIFICATION WITH REFINED PART MODEL
Xingan Ma1, Kuan Zhu1, Haiyun Guo2, Jinqiao Wang1, Min Huang1, Qinghai Miao1
1University of Chinese Academy of Sciences, China, 2Institute of Automation, Chinese Academy of Sciences, China
97 IEEE ICME2019
Poster Sessions Poster Session 1 & TMM Poster
Tuesday, July 9, 2019
P-01: Emerging Multimedia Applications and Technologies
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Chun Yuan Tsinghua University, China
[ID:1] PAY BY SHOWING YOUR PALM: A STUDY OF PALMPRINT VERIFICATION ON MOBILE PLATFORMS
Yingyi Zhang1, Lin Zhang1, Xiao Liu1, Shengjie Zhao1, Ying Shen1, Yukai Yang2
1Tongji University, China, 2Uppsala University, Sweden
[ID:2] REGULARIZE NETWORK SKIP CONNECTIONS BY GATING MECHANISMS FOR ELECTRON MICROSCO- PY IMAGE SEGMENTATION
Yuze Guo, Wenjing Huang, Yajing Chen, Shikui Tu
Shanghai Jiao Tong University, China
[ID:3] CROSS MODALITY ALIGNMENT OF MEDICAL VOLUMES USING SPATIO-SEMANTIC ATTENTIVE CY- CLE-GAN
Xiaohui Lin1, Yi Xu1, Mingda Wang1, Bingbing Ni1, Xiaokang Yang1, Guangyu Tao2, Xiaodan Ye2
1Shanghai Jiao Tong University, China, 2Shanghai Chest Hospital, China
[ID:4] A NEW APPROACH TO AUTOMATIC CLOTHING MATTING FROM MANNEQUINS
Bin Yuan1, Zongqing Lu1, Jing-Hao Xue2, Qingmin Liao1
1Tsinghua University Graduate School at Shenzhen, China, 2University College London, UK
[ID:5] CLUSTERING AND DYNAMIC SAMPLING BASED UNSUPERVISED DOMAIN ADAPTATION FOR PERSON RE-IDENTIFICATION
Jinlin Wu1,2, Shengcai, Liao3, Zhen Lei1,2, Xiaobo Wang4, Yang Yang1,2, Stan Z. Li1,2
1Institute of Automation Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3Inception Institute of Artificial Intelligence, China,4 JD AI Research, China
[ID:6] SEMANTIC-EMBEDDING AND SHAPE-AWARE U-NET FOR ULTRASOUND EYEBALL SEGMENTATION
Fanchao Lin, Chuanbin Liu, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang
98 University of Science and Technology of China, China
[ID:7] LECTURE2NOTE: AUTOMATIC GENERATION OF LECTURE NOTES FROM SLIDE-BASED EDUCATIONAL VIDEOS
Chengpei Xu1, Ruomei Wang1, Shujin Lin1, Xiaonan Luo2, Baoquan Zhao2, Lijie Shao1, Mengqiu Hu1
1Sun Yat-sen University, China, 2Guilin University of Electronic Technology, China
[ID:8] DATA-ADAPTIVE PACKING METHOD FOR COMPRESSION OF DYNAMIC POINT CLOUD SEQUENCES
Jianqiang Liu, Jian Yao, Jingmin Tu, Junhao Cheng
Wuhan University, China
[ID:9] SEMANTIC GAN: APPLICATION FOR CROSS-DOMAIN IMAGE STYLE TRANSFER
Pengfei Li, Meng Yang
Sun Yat-sen University, China
[ID:10] IMPROVING EXTREME LOW-LIGHT IMAGE DENOISING VIA RESIDUAL LEARNING
Paras Maharjan1, Li Li1, Zhu Li1, Ning Xu2, Chongyang Ma3, Yue Li4
1University of Missouri-Kansas City, USA, 2Amazon Go, USA, 3Kwai Inc., China, 4University of Science and Technology of China, China
[ID:11] USER PROFILING WITH CAMPUS WI-FI ACCESS TRACE AND NETWORK TRAFFIC
Yang Gao, Jun Tao, Li Zeng, Xiaoming Fang, Qian Fang, Xiaoyan Li
Southeast University, China
[ID:12] A NEW VISUAL INTERFACE FOR SEARCHING AND NAVIGATING SLIDE-BASED LECTURE VIDEOS
Baoquan Zhao1, Songhua Xu2, Shujin Lin3, Ruomei Wang3 and Xiaonan Luo1
1Guilin University of Electronic Technology, China, 2University of South Carolina, Columbia, USA, 3Sun Yat-sen University, China
99 IEEE ICME2019
Tuesday, July 9, 2019
P-02: Media Classification and Segmentation I
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Toshihiko Yamasaki University of Tokyo, Japan
[ID:13] SELF-ATTENTIVE NETWORKS FOR ONE-SHOT IMAGE RECOGNITION
Pin Fang1, Yisen Wang2, Yuan Luo1
1Shanghai Jiao Tong University, China, 2JD AI Research, China
[ID:14] TREE-STRUCTURED KRONECKER CONVOLUTIONAL NETWORK FOR SEMANTIC SEGMENTATION
Tianyi Wu1,2, Sheng Tang1, Rui Zhang1,2, Juan Cao1, Jintao Li1
1Institute of Computing Technology, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
[ID:15] PART-BASED CONVOLUTIONAL NETWORK FOR IMBALANCED AGE ESTIMATION
Yixin Zhu, Jun-Yong Zhu, Wei-Shi Zheng
Sun Yat-sen University, China
[ID:16] LEARNING TO DISTINGUISH: A GENERAL METHOD TO IMPROVE COMPARE-BASED ONE-SHOT LEARNING FRAMEWORKS FOR SIMILAR CLASSES
Qiuzheng Chen, Ruoyu Yang
Nanjing University, China
[ID:17] PREDICTABILITY ANALYZING: DEEP REINFORCEMENT LEARNING FOR EARLY ACTION RECOGNI- TION
Xiaokai Chen1,2, Ke Gao1, Juan Cao1
1Institute of Computing Technology, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
[ID:18] A FAST END-TO-END METHOD WITH STYLE TRANSFER FOR ROOM LAYOUT ESTIMATION
Junming Chen, Jie Shao, Dongyang Zhang, Xuehui Wu
University of Electronic Science and Technology of China, China
[ID:19] RESOLVING INTRA-CLASS IMBALANCE FOR GAN-BASED IMAGE AUGMENTATION
Lijyun Huang, Kate Ching-Ju Lin, Yu-Chee Tseng
100 National Chiao Tung University, Taiwan
[ID:20] END-TO-END PANOPTIC SEGMENTATION WITH PIXEL-LEVEL NON-OVERLAPPING EMBEDDING
Weitong Zhang1,2,4, Qieshi Zhang1,2, Jun Cheng1,2, Cong Bai3, Pengyi Hao3
1Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China, 2The Chinese University of Hong Kong, China, 3Zhejiang University of Technology, China 4Shaanxi Normal University, China
[ID:21] ROBUST EMBEDDING FRAMEWORK WITH DYNAMIC HYPERGRAPH FUSION FOR MULTI-LABEL CLASSIFICATION
Kaixiang Wang
Nanjing Normal University, China
101 IEEE ICME2019
Tuesday, July 9, 2019
P-03: Oral-05 to Oral-12
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Jian Zhang University of Technology Sydney, Australia
[ID:22] PARTICLE SWARM LOSS FOR LIGHTWEIGHT OBJECT DETECTION
Peizhen Zhang1,2, Feng Zheng3, Junlong Du2, Jun Zhang2, Xiaowei Guo2, Wei-Shi Zheng1
1Sun Yat-sen University, China, 2Youtu Lab, Tencent, China, 3Southern University of Science and Technology, China
[ID:23] INCORPORATING CATEGORY TAXONOMY IN DEEP REINFORCEMENT LEARNING BASED IMAGE HASHING
Qiang Fu1, Linsen Dong2, Ziyuan Liu2, Yong Luo2, Yonggang Wen2, Ying Li1, Ling-Yu Duan3
1Peking University, China, 2Nanyang Technological University, Singapore, 3Peking University, China
[ID:24] TRUNCATED GRADIENT CONFIDENCE-WEIGHTED BASED ONLINE LEARNING FOR IMBALANCE STREAMING DATA
Ji Hu1, Chenggang Yan1, Xing Liu1, Jiyong Zhang1, Dongliang Peng1, Yi Yang2
1HangZhou DianZi University, China, 2UTS, Australia
[ID:25] UAV TARGET TRACKING BY DETECTION VIA DEEP NEURAL NETWORKS
Mohamed A. Kassab1, Ali Maher2, Fathy Elkazzaz3, Zhang Baochang1,4
1Beihang University, China, 2Military Technical college, Egypt, 3 Benha University, Egypt, 4Shenzhen Academy of Aerospace Technology, China
[ID:26] QUARTER-POINT CODEWORD EXPANSION FOR PRODUCT QUANTIZATION
Shan An1,2, Zhibiao Huang1, Guangfu Che1, Xianglong Liu2, Xin Ma3, Yu Chen1
1Department of Data Intelligence, JD.com, China, 2Beihang University, China, 3Shandong University, China
[ID:27] CONTEXT-AWARE AFFECTIVE GRAPH REASONING FOR EMOTION RECOGNITION
Minghui Zhang, Yumeng Liang, Huadong Ma
Beijing University of Posts and Telecommunications, China
[ID:28] SPL: EXPLOITING UNLABELED DATA FOR MULTI-LABEL IMAGE CLASSIFICATION
102 Weibo Zhang1,2, Fuqing Zhu1, Jiao Dai1, Songlin Hu1, Jizhong Han1, Tao Guo1
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2School of Cyber Security, University of Chinese Academy of Sciences, China
[ID:29] MLTS: A MULTI-LANGUAGE SCENE TEXT SPOTTER
Yu Zhou1, Shancheng Fang2, Hongtao Xie1, Zheng-Jun Zha1, Yongdong Zhang1
1University of Science and Technology of China, China, 2Institute of Information Engineering, Chinese Academy of Sciences, China
[ID:30] UNSUPERVISED MONOCULAR DEPTH ESTIMATION BASED ON DUAL ATTENTION MECHANISM AND DEPTH-AWARE LOSS
Xinchen Ye1, Mingliang Zhang1,2, Rui Xu1, Wei Zhong1, Xin Fan1, Zhu Liu1, Jiaao Zhang1
1Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, China, 2Dalian University of Technolo- gy of Liaoning Province, China
[ID:31] TOWARDS HIGH-QUALITY INTRINSIC IMAGES IN THE WILD
Gang Fu1, Qing Zhang2, Chunxia Xiao1
1Wuhan University, China, 2Sun Yat-sen University, China
[ID:32] UNSUPERVISED LEARNING FOR OPTICAL FLOW ESTIMATION USING PYRAMID CONVOLUTION LSTM
Shuosen Guan1,3, Haoxin Li2,3, Wei-Shi Zheng1,3
1School of Data and Computer Science, Sun Yat-sen University, China, 2School of Electronics and Information Technology, Sun Yat-sen University, China, 3Key Laboratory of Machine Intelligence and Advanced Computing, Ministry of Education, China
[ID:33] MAST: MASK-ACCELERATED SHEARLET TRANSFORM FOR DENSELY-SAMPLED LIGHT FIELD RECONSTRUCTION
Yuan Gao1, Robert Bregovic2, Atanas Gotchev2, Reinhard Koch1
1Kiel University, Germany, 2Tampere University, Finland
[ID:34] CODA: COUNTING OBJECTS VIA SCALE-AWARE ADVERSARIAL DENSITY ADAPTION
Li Wang1, Yongbo Li2, Xiangyang Xue1
1Fudan University, China, 2Megvii Inc (Face++), China
[ID:35] PDNET: PRIOR-MODEL GUIDED DEPTH-ENHANCED NETWORK FOR SALIENT OBJECT
Chunbiao Zhu, Xing Cai, Kan Huang, Thomas H Li, Ge Li
SECE, Shenzhen Graduate School, Peking University, China
103 IEEE ICME2019
[ID:36] CONTINUOUS SCALE ADAPTION FOR EFFICIENT BOX-BASED SCENE TEXT
Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo, Wei Zhong
Dalian University of Technology, China
[ID:37] MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETECTION NETWORK
Xiaobao Guo1, Jinxing Li2, Bingzhi Chen1, Guangming Lu1
1Harbin Institute of Technology (Shenzhen), China, 2The Chinese University of Hong Kong(Shenzhen), China
[ID:38] DMPR-PS: A NOVEL APPROACH FOR PARKING-SLOT DETECTION USING DIRECTIONAL MARK- ING-POINT REGRESSION
Junhao Huang1, Lin Zhang1, Ying Shen1, Huijuan Zhang1, Shengjie Zhao1, Yukai Yang2
1Tongji University, China, 2Uppsala University, Sweden
[ID:39] ADAPTING SEMANTIC SEGMENTATION OF URBAN SCENES VIA MASK-AWARE GATED DISCRIM- INATOR
Yong-Xiang Lin1, Daniel Stanley Tan1, Wen-Huang Cheng2, Kai-Lung Hua1
1National Taiwan University of Science and Technology, Taiwan, 2National Chiao Tung University, Taiwan
[ID:40] STOCHASTIC VIDEO GENERATION WITH DISENTANGLED REPRESENTATIONS
Maomao Li1, Chun Yuan1, Zhihui Lin1,2, Zhuobin Zheng1,2, Yangyang Cheng1,2
1Graduate School at Shenzhen, Tsinghua University, China, 2Tsinghua University, China
[ID:41] Z-ORDER RECURRENT NEURAL NETWORKS FOR VIDEO PREDICTION
Jianjin Zhang, Yunbo Wang, Mingsheng Long, Jianmin Wang, and Philip S. Yu
Tsinghua University, China
[ID:42] ENERGY-BASED RECURRENT MODEL FOR STOCHASTIC MODELING OF MUSIC
Yingru Liu1, Dongliang Xie2, Xin Wang1
1Stony Brook University, USA, 2Beijing University of Posts and Telecommunications
[ID:43] RESIDUAL FRAME FOR NOISY VIDEO CLASSIFICATION ACCORDING TO PERCEPTUAL QUALITY IN CONVOLUTIONAL NEURAL NETWORKS
Huaixuan Zhang1, Yuhai Lan3, Tao Dai1,2, Ruizhi Qiao4, Ying Xu1, Yao Yao1, Shu-Tao Xia1,2
1Graduate School at Shenzhen, Tsinghua University, China, 2PCL Research Center of Networks and Communications, Peng Cheng Laboratory, China, 3Harbin Institute of Technology, China, 4Tencent Youtu Lab, China
104 [ID:44] RESIDUAL DILATED NETWORK WITH ATTENTION FOR IMAGE BLIND DENOISING
Guanqun Hou1, Yujiu Yang1, Jing-Hao Xue2
1Graduate School at Shenzhen, Tsinghua University, China, 2University College London, UK
[ID:45] COLLABORATIVE DEEP REINFORCEMENT LEARNING FOR IMAGE CROPPING
Zhuopeng Li, Xiaoyan Zhang
Shenzhen University, China
[ID:46] SIMILARITY-AWARE DEEP ADVERSARIAL LEARNING FOR FACIAL AGE ESTIMATION
Penghui Sun, Hao Liu, Xing Wang, Zhenhua Yu1, Suping Wu
Ningxia University, China
[ID:47] LEARNING TRANSMISSION FILTERING NETWORK FOR IMAGE-BASED PM2.5 ESTIMATION
Yinghong Liao1, Bin Qiu1, Zhuo Su1, Ruomei Wang1, Xiangjian He2,3
1Sun Yat-sen University, China, 2Minjiang University, China, 3University of Technology Sydney, Australia
[ID:48] VIDEO-BASED EARLY ASD DETECTION VIA TEMPORAL PYRAMID NETWORKS
Yuan Tian, Xiongkou Min, Guangtao Zhai, Zhiyong Gao
Shanghai Jiao Tong Unversity, China
[ID:49] AUTOMATIC USER CATEGORIZATION THROUGH LARGE TRANSACTION DATA
Ying Zhang, YinJia Zhang, Qinpei Zhao, Weixiong Rao
Tongji University, China
[ID:50] FEATURE PRESERVING AND UNIFORMITY-CONTROLLABLE POINT CLOUD SIMPLIFICATION ON GRAPH
Junkun Qi, WeiHu, Zongming Guo
Peking University, China
[ID:51] 360SRL: A SEQUENTIAL REINFORCEMENT LEARNING APPROACH FOR ABR TILE-BASED 360 VIDEO STREAMING
Jun Fu, Xiaoming Chen, Zhizheng Zhang, Shilin Wu, Zhibo Chen
University of Science and Technology of China, China
105 IEEE ICME2019
[ID:52] CONTENT-AWARE PERSPECTIVE PROJECTI ON OPTIMIZATION FOR VIEWPORT RENDERING OF 360° IMAGES
Falah Jabar, Joao Ascenso, Maria Paula Queluz
Universidade de Lisboa, Portugal
[ID:53] AN AR BENCHMARK SYSTEM FOR INDOOR PLANAR OBJECT TRACKING
Ziming Wu1, Jiabin Guo2, Shuangli Zhang2, Chen Zhao2, Xiaojuan Ma1
1Hong Kong University of Science and Technology, China, 2Netease AR, China
Tuesday, July 9, 2019
TMM Poster
Time: 13:30 - 15:00 PM
Room: 3rd Floor
[ID:54] ENHANCING IMAGE WATERMARKING WITH ADAPTIVE EMBEDDING PARAMETER AND PSNR GUAR- ANTEE
Baoning Niu
Taiyuan University of Technology, China
106 Poster Session 2
Tuesday, July 9, 2019
P-04: Multimedia Analysis, Search and Recommendation
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Kate Ching-Ju Lin National Chiao Tung University, Taiwan
[ID:1] MULTI-SCALE SCENE TEXT DETECTION VIA RESOLUTION TRANSFORM
Peirui Cheng, Weiqiang Wang, Yuanqiang Cai
University of Chinese Academy of Sciences, China
[ID:2] TOWARDS ACCURATE INSTANCE-LEVEL TEXT SPOTTING WITH GUIDED ATTENTION
Haiyan Wang, Xuejian Rong, Yingli Tian
The City College of New York, USA
[ID:3] MULTI-SCALE GEM POOLING WITH N-PAIR CENTER LOSS FOR FINE-GRAINED IMAGE SEARCH
Youming Deng, Xianming Lin, Run Li, Rongrong Ji
Xiamen University, China
[ID:4] SEMI-SUPERVISED SEMANTIC-PRESERVING HASHING FOR EFFICIENT CROSS-MODAL RETRIEVAL
Xingzhi Wang1,2, Xin Liu1,2, Zhikai Hu1, Nannan Wang2, Wentao Fan1, Ji-Xiang Du1
1Huaqiao University, China, 2Xidian University, China
[ID:5] ROBUST MULTI-VIEW HASHING FOR CROSS-MODAL RETRIEVAL
Haitao Wang, Hui Chen, Min Meng, JiGang Wu
Guangdong University of Technology, China
[ID:6] SCENE TEXT RECOGNITION VIA GATED CASCADE ATTENTION
Siwei Wang, Yongtao Wang, Xiaoran Qin, Qijie Zhao, Zhi Tang
Peking University, China
[ID:7] TEXT-ATTENTIONAL CONDITIONAL GENERATIVE ADVERSARIAL NETWORK FOR SUPER-RESOLU- TION OF TEXT IMAGES
107 IEEE ICME2019
Yuyang Wang, Feng Su and Ye Qian
Nanjing University, China
[ID:8] ONLINE LEARNING TO RANK IN A LISTWISE APPROACH FOR INFORMATION RETRIEVAL
Fan Ma1, Haoyun Yang2, Haibing Yin2, Xiaofeng Huang2, Chenggang Yan2, Xiang Meng2
1University of Technology Sydney, Australia, 2Hangzhou Dianzi University, China
108 Tuesday, July 9, 2019
P-05: Pose and Action Recognition I
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Yicong Zhou University of Macau, China
[ID:9] RECOGNIZING MICRO ACTIONS IN VIDEOS: LEARNING MOTION DETAILS VIA SEGMENT-LEVEL TEM- PORAL PYRAMID
Yang Mi1, Song Wang1,2
1University of South Carolina, USA, 2Tianjin University, China
[ID:10] ENTANGLEMENT LOSS FOR CONTEXT-BASED STILL IMAGE ACTION RECOGNITION
Miao Xin1, Shuhang Wang2, Jian Cheng1
1Institute of Automation, Chinese Academy of Sciences, China, 2Harvard University, USA
[ID:11] BI-DIRECTIONAL MESSAGE PASSING BASED SCANET FOR HUMAN POSE ESTIMATION
Lu Zhou, Yingying Chen, Jinqiao Wang, Ming Tang and Hanqing Lu
Institute of Automation, Chinese Academy of Sciences, China
[ID:12] SPATIAL MASK CONVLSTM NETWORK AND INTRA-CLASS JOINT TRAINING METHOD FOR HUMAN ACTION RECOGNITION IN VIDEO
Jingjun Chen, Yonghong Song, Yuanlin Zhang
Xi’an Jiaotong University, China
[ID:13] SELF-ATTENTION GUIDED DEEP FEATURES FOR ACTION RECOGNITION
Renyi Xiao1, Yonghong Hou1, Zihui Guo1, Chuankun Li1, Pichao Wang2, Wanqing Li3
1Tianjin University, China, 2Alibaba Group (U.S.) Inc., USA, 3University of Wollongong, Australia
[ID:14] LEARNING SHAPE-MOTION REPRESENTATIONS FROM GEOMETRIC ALGEBRA SPATIO-TEMPORAL MODEL FOR SKELETON-BASED ACTION RECOGNITION
Yanshan Li1, Rongjie Xia1, Xing Liu1, Qinghua Huang2
1Shenzhen University, China, 2Northwestern Polytechnical University, China
[ID:15] ACPNET: ANCHOR-CENTER BASED PERSON NETWORK FOR HUMAN POSE ESTIMATION AND IN- STANCE SEGMENTATION
109 IEEE ICME2019
Yang Bai, Weiqiang Wang
University of Chinese Academy of Sciences, China
[ID:16] SPATIO-TEMPORAL MULTI-SCALE SOFT QUANTIZATION LEARNING FOR SKELETON-BASED HUMAN ACTION RECOGNITION
Jianyu Yang1, Chen Zhu1, Junsong Yuan2
1Soochow University, China, 2State University of New York at Buffalo, USA
[ID:17] LPHD: A LARGE-SCALE HEAD POSE DATASET FOR RGB IMAGES
Wei Sun1, Yezhao Fan1, Xiongkuo Min1, Shihao Peng1, Siwei Ma2 and Guangtao Zhai1
1Shanghai Jiao Tong University, China, 2Peking University, China
110 Tuesday, July 9, 2019
P-06: Person and Emotion Understanding
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Zheng Wang National Institute of Informatics, Japan
[ID:18] HUMAN-CENTERED EMOTION RECOGNITION IN ANIMATED GIFS
Zhengyuan Yang, Yixuan Zhang, Jiebo Luo
University of Rochester, USA
[ID:19] DEEP SEMI-SUPERVISED PERSON RE-IDENTIFICATION WITH EXTERNAL MEMORY
Qize Yang, Ancong Wu, Wei-Shi Zheng
Sun Yat-sen University, China
[ID:20] CONVOLUTIONAL TEMPORAL ATTENTION MODEL FOR VIDEO-BASED PERSON RE-IDENTIFICATION
Tanzila Rahman1, Mrigank Rochan2, Yang Wang2
1University of British Columbia, Canada, 2University of Manitoba, Canada
[ID:21] POOLING MAP ADAPTATION IN CONVOLUTIONAL NEURAL NETWORK FOR FACIAL EXPRESSION RECOGNITION
Zhiyuan Li1, Shizhong Han2, Ahmed Shehab Khan1, Jie Cai1, Zibo Meng3, James O’Reilly1, Yan Tong1
1University of South Carolina, USA, 212Sigma Technologies, China, 3Innopeak Technology Inc., USA
[ID:22] FAST PERSON SEARCH PIPELINE
Jianheng Li, Fuhang Liang, Yuanxun Li, Wei-Shi Zheng
Sun Yat-sen University, China
[ID:23] DYNAMIC REGION DIVISION FOR ADAPTIVE LEARNING PEDESTRIAN COUNTING
Gaoqi He1, Zhenwei Ma2, Binhao Huang2, Bin Sheng3, Yubo Yuan2
1East China Normal University, China, 2East China University of Science and Technology, China, 3Shanghai Jiao Tong Uni- versity, China
111 IEEE ICME2019
[ID:24] ANOTHER DIMENSION: TOWARDS MULTI-SUBNET NEURAL NETWORK FOR IMAGE SENTIMENT ANALYSIS
Jing Zhang, Han Sun, Zhe Wang, Tong Ruan
East China University of Science and Technology, China
[ID:25] TWO-STAGE MODEL FOR SOCIAL RELATIONSHIP UNDERSTANDING FROM VIDEOS
Pilin Dai, Jinna Lv, Bin Wu
Beijing University of Posts and Telecommunications, China
[ID:26] FPN++: A SIMPLE BASELINE FOR PEDESTRIAN DETECTION
Junhao Hu, Lei Jin, Shenghua Gao
ShanghaiTech University, China
[ID:27] AN END-TO-END LEARNING APPROACH FOR MULTIMODAL EMOTION RECOGNITION: EXTRACTING COMMON AND PRIVATE INFORMATION
Fei Ma, Wei Zhang, Yang Li, Shao-Lun Huang, Lin Zhang
Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, China
112 Tuesday, July 9, 2019
P-07: Best Papers and Oral-01 to Oral-04
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Xiaoping Zhang Ted Rogers School of Management, Ryerson University, Canada
[ID:28] AN END-TO-END ARCHITECTURE FOR CLASS-INCREMENTAL OBJECT DETECTION WITH KNOWL- EDGE DISTILLATION
Yu Hao1, Yanwei Fu1, Yu-Gang Jiang1,2, Qi Tian3
1Fudan University, China, 2Jilian Technology Group(Video++) ,China, 3Huawei Noah’s Ark Lab, China
[ID:29] REAL-TIME INDOOR SCENE RECONSTRUCTION WITH RGBD AND INERTIAL INPUT
Zunjie Zhu1, Feng Xu2, Chenggang Yan1, Xinhong Hao3, Xiangyang Ji2, Yongdong Zhang4 ,Qionghai Dai2
1Hangzhou Dianzi University, China, 2Tsinghua University, China, 3Beijing Institute of Technology, China, 4University of Sci- ence and Technology of China, China
[ID:30] DOUBLY SEMI-SUPERVISED MULTIMODAL ADVERSARIAL LEARNING FOR CLASSIFICATION, GENER- ATION AND RETRIEVAL
Changde Du1, Changying Du2, Huiguang He1
1Institute of Automation Chinese Academy of Sciences, China, 2Huawei Noah’s Ark Lab, China
[ID:31] TOWARDS DIGITAL RETINA IN SMART CITIES: A MODEL GENERATION, UTILIZATION AND COMMU- NICATION PARADIGM
Yihang Lou1, Ling-Yu Duan1, Yong Luo1, Ziqian Chen1, Tongliang Liu2, Shiqi Wang3, Wen Gao1
1Peking University, China, 2University of Sydney, Australia, 3City University of Hongkong, China, 4The Peng Cheng Labora- tory, China
[ID:32] SDP: AN IMPROVED BASELINE ESTIMATION MODEL BASED ON STANDARD DEVIATION PROPOR- TION
Zhenhua Tan, Danke Wu, Liangliang He, Qiuyun Chang, Bin Zhang
Northeastern University, China
[ID:33] CITATION RECOMMENDATION BASED ON WEIGHTED HETEROGENEOUS INFORMATION NETWORK CONTAINING SEMANTIC LINKING
Jie Chen, Yang Liu, Shu Zhao, Yanping Zhang
Anhui University, China
113 IEEE ICME2019
[ID:34] FUSION-SUPERVISED DEEP CROSS-MODAL HASHING
Li Wang, Lei Zhu, En Yu, Jiande Sun, Huaxiang Zhang
Shandong Normal University, China
[ID:35] DOMAIN UNCERTAINTY BASED ON INFORMATION THEORY FORCROSS-MODAL HASH RETRIEVAL
Wei Chen1, Nan Pu1, Yu Liu2, Erwin M. Bakker1, Michael S. Lew1
1 Leiden University, Holland, 2 ESAT-PSI, KU Leuven, Belgium
[ID:36] ADAPTIVE PLANE PROJECTION FOR VIDEO-BASED POINT CLOUD CODING
Eurico Lopes, João Ascenso, Catarina Brites, Fernando Pereira
Instituto Superior Técnico, Universidade de Lisboa - Instituto de Telecomunicações, Lisboa, Portugal
[ID:37] FAST CU PARTITIONING ALGORITHM FOR H.266/VVC INTRA-FRAME CODING
Ting Fu1, Hao Zhang 1, Fan Mu1, Huanbang Chen2
1Central South University, China, 2Huawei Base, China
[ID:38] TWO-STAGE FAST MULTIPLE TRANSFORM SELECTION ALGORITHM FOR VVC INTRA CODING
Ting Fu1, Hao Zhang 1, Fan Mu1, Huanbang Chen2
1Central South University, China, 2Huawei Base, China
[ID:39] HISTORY-BASED MOTION VECTOR PREDICTION FOR FUTURE VIDEO CODING
Junru Li1, Meng Wang2, Li Zhang3, Kai Zhang3, Hongbin Liu3, Shiqi Wang2, Siwei Ma1, Wen Gao1
1Peking University, China, 2City University of Hong Kong, China, 3Bytedance Inc., USA
[ID:40] AMS-SFE: TOWARDS AN ALIGNMENT OF MANIFOLD STRUCTURES VIA SEMANTIC FEATURE EX- PANSION FOR ZERO-SHOT LEARNING
Jingcai Guo, Song Guo
The Hong Kong Polytechnic University, China
[ID:41] LOW-SHOT PALMPRINT RECOGNITION BASED ON META-SIAMESE NETWORK
Xuefeng Du1, Dexing Zhong1,2, Pengna Li1
1Xi’an Jiaotong University, China, 2Research Institute of Xi’an Jiaotong University, China
114 [ID:42] SR-GAN: SEMANTIC RECTIFYING GENERATIVE ADVERSIAL NETWORK FOR ZERO-SHOT LEARNING
Zihan Ye1,5, Fan Lyu1,2, Linyan Li3, Qiming Fu1,6, Jinchang Ren4, Fuyuan Hu1,7
1Suzhou University of Science and Technology, China, 2Tianjin University, China, 3Suzhou Institute of Trade & Commerce, China, 4University of Strathclyde, UK, 5Virtual Reality Key Laboratory of Intelligent Interaction and Application Technology of Suzhou, China, 6Key Laboratory of Intelligent Building Energy Efficiency, China, 7Suzhou Key Laboratory for Big Data and Information Service, China
[ID:43] COMPARE MORE NUANCED: PAIRWISE ALIGNMENT BILINEAR NETWORK FOR FEW-SHOT FINE- GRAINED LEARNING
Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Jingsong Xu
University of Technology Sydney, Australia
[ID:44] FEATURE-AWARE AND CONTENT-WISE DENOISING OF 3D STATIC AND DYNAMIC MESHES US- ING DEEP AUTOENCODERS
Gerasimos Arvanitis1, Aris S. Lalos2, and Konstantinos Moustakas1
1University of Patras, Greece, 2"ATHENA" Research Center, Greece
[ID:45] REAL-TIME MONOCULAR VISUAL SLAM BY COMBINING POINTS AND LINES
Xinyu Wei, Jun Huang, Xiaoyuan Ma
Shanghai Advanced Research Institute, China
[ID:46] F-NUMBER ADAPTATION FOR MAXIMIZING THE SENSOR USAGE OF LIGHT FIELD CAMERAS
Chuanpu Li, Xin Jin, Junke Li and Qionghai Dai
Graduate School at Shenzhen, Tsinghua University, China
[ID:47] BLIND CALIBRATION FOR FOCUSED PLENOPTIC CAMERAS
Xufu Sun, Xin Jin, Pei Wang, Yanqin Chen and Qionghai Dai
Graduate School at Shenzhen, Tsinghua University, China
115 IEEE ICME2019
Poster Session 3 & Demo Session 1
Wednesday, July 10, 2019
P-08: Multimedia Creation and Enhancement
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Jing-Hao Xue University College London, UK
[ID:1] BOUNDARY AWARE MULTI-FOCUS IMAGE FUSION USING DEEP NEURAL NETWORK
Haoyu Ma, Juncheng Zhang, Shaojun Liu, Qingmin Liao
Graduate School at Shenzhen, Tsinghua University, China
[ID:2] A MULTI-LEVEL AGGREGATED NETWORK FOR IMAGE RESTORATION
Chenxi Ma, Weimin Tan, Bahetiyaer Bare, and Bo Yan
Fudan University, China
[ID:3] UNSUPERVISED FACIAL IMAGE SYNTHESIS USING TWO-DISCRIMINATOR ADVERSARIAL AUTOEN- CODER NETWORK
Xuehui Wu, Jie Shao, Dongyang Zhang, Junming Chen
University of Electronic Science and Technology of China, China
[ID:4] FACIAL IMAGE INPAINTING USING MULTI-LEVEL GENERATIVE NETWORK
Jie Liu, Cheolkon Jung
Xidian University, China
[ID:5] A VIDEO POST-FILTER DEBLOCKING METHOD BASED ON TEMPORAL BOOSTING RESIDUAL NET- WORKS
Jianyu Wang1, Shaohui Liu1,2, Feng Jiang1,2, Xiaoshuai Sun1, Yongliang Liu3
1Harbin Institute of Technology, China, 2Pengcheng Laboratory, China, 3Alibaba Group, China
[ID:6] DISTILLING WITH RESIDUAL NETWORK FOR SINGLE IMAGE SUPER RESOLUTION
Xiaopeng Sun, Wen Lu, Rui Wang, Furui Bai
Xidian University, China
116 [ID:7] RDGAN: RETINEX DECOMPOSITION BASED ADVERSARIAL LEARNING FOR LOW-LIGHT ENHANCE- MENT
Junyi Wang, Weimin Tan, Xuejing Niu and Bo Yan
Fudan University, China
[ID:8] SINGLE IMAGE DE-RAINING VIA GENERATIVE ADVERSARIAL NETS
Shichao Li, Yonghong Hou, Huanjing Yue, Zihui Guo
Tianjin University, China
[ID:9] SWITCHGAN FOR MULTI-DOMAIN FAICAL IMAGE TRANSLATION
Yuanlue Zhu, Mengchao Bai, Linlin Shen, Zhiwei Wen
Shenzhen University, China
[ID:10] A FEATURE-BASED APPROACH FOR LIGHT FIELD VIDEO ENHANCEMENT
Michele Brizzi, Federica Battisti, Alessandro Neri
Roma Tre University, Italy
117 IEEE ICME2019
Wednesday, July 10, 2019
P-09: Multimedia and Vision I
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Qixiang Ye University of Chinese Academy of Sctiences, China
[ID:11] EASY TRANSFER LEARNING BY EXPLOITING INTRA-DOMAIN STRUCTURES
Jindong Wang1, Yiqiang Chen1, Han Yu2, Meiyu Huang3, Qiang Yang4
1Chinese Academy of Sciences, China, 2Nanyang Technological University, Singapore, 3China Academy of Space Technology, China, 4Hong Kong University of Science and Technology, China
[ID:12] SKELETON-BASED ACTION RECOGNITION WITH SYNCHRONOUS LOCAL AND NON-LOCAL SPA- TIO-TEMPORAL LEARNING AND FREQUENCY ATTENTION
Guyue Hu1,2, Bo Cui1,2, Shan Yu1,2
1Institute of Automation Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
[ID:13] DEEP GEOMETRY EMBEDDING NETWORKS FOR ROBUST FACIAL LANDMARK DETECTION
Meilu Zhu, Daming Shi
Shenzhen University, China
[ID:14] JOINT PROJECTION AND SUBSPACE LEARNING FOR ZERO-SHOT RECOGNITION
Guangzhen Liu, Jiechao Guan, Manli Zhang, Jianhong Zhang, Zihao Wang, Zhiwu Lu
Renmin University of China, China
[ID:15] PART-PRESERVING POSE MANIPULATION FOR PERSON IMAGE SYNTHESIS
Haoye Dong1, Xiaodan Liang1, Chenxing Zhou1, Hanjiang Lai1, Jia Zhu2, Jian Yin1
1Sun Yat-sen University, China, 2South China Normal University, China
[ID:16] BREGMAN-TANIMOTO BASED METHOD FOR CONTRAST PRESERVING DECOLORIZATION
He Chen, Faming Fang
East China Normal University, China
[ID:17] TDCC: TOP-DOWN SEMANTIC AGGREGATION FOR COLOR CONSTANCY
Xiaoqiang Li, Yaqin Zhu, Jiayue Han, Jide Li, Weiqin Tong
118 Shanghai University, China
[ID:18] FROM MARKET TO DISH: MULTI-INGREDIENT IMAGE RECOGNITION FOR PERSONALIZED RECIPE RECOMMENDATION
Lin Zhang1, Jianbo Zhao2, Si Li2, Boxin Shi1,3, Ling-Yu Duan1,3
1Peking University, China, 2Beijing University of Posts and Telecommunications, China, 3Peng Cheng Laboratory, China
[ID:19] IMPROVING OPEN SET DOMAIN ADAPTATION USING IMAGE-TO-IMAGE TRANSLATION
Hongjie Zhang1, Ang Li2, Xu Han1, Zhaoming Chen1, Yang Zhang1, Yanwen Guo1
1Nanjing University, China, 2DeepMind, Mountain View, USA
[ID:20] STRUCTURE GENERATION AND GUIDANCE NETWORK FOR UNSUPERVISED MONOCULAR DEPTH ESTIMATION
Chaoqun Wang, Xuejin Chen, Shaobo Min, Feng Wu
University of Science and Technology of China, China
[ID:21] A CONDITIONAL BAYESIAN BLOCK STRUCTURE INFERENCE MODEL FOR OPTIMIZED AV1 ENCOD- ING
Xinyao Chen1, Bichuan Guo1, Minhao Tang1, Yuxing Han2, Jiangtao Wen1
1Tsinghua University, China, 2South China Agriculture University, China
[ID:22] LEARNING TO REMOVE REFLECTIONS FOR TEXT IMAGES
Ce Wang1, Renjie Wan2, Feng Gao3, Boxin Shi1,4, Ling-Yu Duan1,4
1Peking University, China, 2Nanyang Technological University, Singapore,3Tsinghua University, China, 4Peng Cheng Labora- tory, China
119 IEEE ICME2019
Wednesday, July 10, 2019
P-10: Oral-17 to Oral-24
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Roger Zimmermann National University of Singapore, Singapore
[ID:23] VIDEO EMOTION RECOGNITION WITH CONCEPT SELECTION
Baohan Xu1, Yingbin Zheng2, Hao Ye2, Caili Wu3, Heng Wang1, Gufei Sun1
1Zhongan Technology, China, 2Videt Tech, USA, 3East China Normal University, China
[ID:24] GRAPH CONVOLUTIONAL LSTM MODEL FOR SKELETON-BASED ACTION RECOGNITION
Han Zhang, Yonghong Song, Yuanlin Zhang
Xi’an Jiaotong University, China
[ID:25] LEARNING RECURRENT STRUCTURE-GUIDED ATTENTION NETWORK FOR MULTI-PERSON POSE ES- TIMATION
Zhongwei Qiu1, Kai Qiu2, Jianlong Fu2, Dongmei Fu1
1University of Science and Technology Beijing, China, 2Microsoft Reasearch, China
[ID:26] PCPCAD: PROPOSAL COMPLEMENTARY ACTION DETECTOR
Zhenying Fang1, Suguo Zhu1, Jun Yu1, Qi Tian2,3
1Hangzhou Dianzi University, China, 2Huawei Noah’s Ark Lab, China, 3The University of Texas at San Antonio, USA
[ID:27] PERSONALITY DRIVEN MULTI-TASK LEARNING FOR IMAGE AESTHETIC ASSESSMENT
Leida Li1,2, Hancheng Zhu2, Sicheng Zhao3, Guiguang Ding4, Hongyan Jiang2, Allen Tan5
1Xidian University, China, 2China University of Mining and Technology, China, 3University of California Berkeley, USA, 4Ts- inghua University, China, 5Tencent, China
[ID:28] VIDEO QUALITY TEMPORAL POOLING USING A VISIBILITY MEASURE
Chen Bai, Amy R. Reibman
Purdue University, USA
[ID:29] IMAGE QUALITY ASSESSMENT OF MULTI-EXPOSURE IMAGE FUSION FOR BOTH STATIC AND DY- NAMIC SCENES
Yuming Fang1, Yan Zeng1, Hanwei Zhu1, Guangtao Zhai2
1Jiangxi University of Finance and Economics, China, 2Shanghai Jiao Tong University, China
120 [ID:30] NO-REFERENCE STEREOSCOPIC IMAGE QUALITY ASSESSMENT BASED ON LOCAL TO GLOBAL FEA- TURE REGRESSION
Sumei Li, Jianwei Xue, Yongtian Han
Tianjin University, China
[ID:31] HERDING EFFECT BASED ATTENTION FOR PERSONALIZED TIME-SYNC VIDEO RECOMMENDATION
Wenmian Yang1,2, Wenyuan Gao1, Xiaojie Zhou1, Weijia Jia1,2, Shaohua Zhang1,2, Yutao Luo1
1Shanghai JiaoTong University, China, 2University of Macau, China
[ID:32] SEQUENTIAL BEHAVIOR MODELING FOR NEXT MICRO-VIDEO RECOMMENDATION WITH COLLABO- RATIVE TRANSFORMER
Shang Liu, Zhenzhong Chen
Wuhan University, China
[ID:33] BUTTONTIPS: DESIGNING WEB BUTTONS WITH SUGGESTIONS
Dawei Liu, Ying Cao, Rynson W.H. Lau, Antoni B. Chan
City University of Hongkong, China
[ID:34] KNOWING USER BETTER: MICRO-VIDEO RECOMMENDER SYSTEM BY JOINTLY OPTIMIZING TO CLICK-THROUGH AND PLAYTIME
Shengjie Ma, Zhengjun Zha, Feng Wu
University of Science and Technology of China, China
[ID:35] ADVERSARIAL CROSS-MODAL RETRIEVAL VIA LEARNING AND TRANSFERRING SINGLE-MODAL SIMILARITIES
Xin Wen1, Zhizhong Han1,2, Xinyu Yin1, Yu-Shen Liu1
1Tsinghua University, China, 2University of Maryland, USA
[ID:36] SEMI-SUPERVISED COMPATIBILITY LEARNING ACROSS CATEGORIES FOR CLOTHING MATCHING
Zekun Li, Zeyu Cui, Shu Wu, Xiaoyu Zhang, Liang Wang
Chinese Academy of Sciences, China
[ID:37] ADVERSARIAL LEARNING FOR FINE-GRAINED IMAGE SEARCH
Kevin Lin1, Fan Yang2, Qiaosong Wang2, Robinson Piramuthu2
1University of Washington, USA, 2eBay Inc., USA
[ID:38] A MASK BASED DEEP RANKING NEURAL NETWORK FOR PERSON RETRIEVAL
121 IEEE ICME2019
Lei Qi1, Jing Huo1, Lei Wang2, Yinghuan Shi1, Yang Gao1
1Nanjing University, China, 2University of Wollongong, Australia
[ID:39] DISCO: DEPTH INFERENCE FROM STEREO USING CONTEXT
Kunal Swami, Kaushik Raghavan, Nikhilanj Pelluriy, Rituparna Sarkar, Pankaj Bajpai
Samsung Research Institute Bangalore, India
[ID:40] PANET: A CONTEXT BASED PREDICATE ASSOCIATION NETWORK FOR SCENE GRAPH GENERATION
Yunian Chen1, Yanjie Wang3, Yang Zhang1, Yanwen Guo1,2
1Nanjing University, China, 2The 28th Research Institute of China Electronics Technology Group Corporation, China, 3Zheji- ang University, China
[ID:41] UNTARGETED ADVERSARIAL ATTACK VIA EXPANDING THE SEMANTIC GAP
Aming Wu1, Yahong Han1, Quanxin Zhang2, Xiaohui Kuang3
1Tianjin University, China, 2Beijing Institute of Technology, China, 3National Key Laboratory of Science and Technology on Information System Security, China
[ID:42] LEARNING GOAL-ORIENTED VISUAL DIALOG AGENTS: IMITATING AND SURPASSING ANALYTIC EX- PERTS
Yen-Wei Chang, Wen-Hsiao Peng
National Chiao Tung University, Taiwan
[ID:43] GAN-BASED MULTI-LEVEL MAPPING NETWORK FOR SATELLITE IMAGERY SUPER-RESOLUTION
Kui Jiang, Zhongyuan Wang, Peng Yi, Junjun Jiang, Guangcheng Wang, Zhen Han, Tao Lu
Wuhan University, China
[ID:44] QUALITY-GATED CONVOLUTIONAL LSTM FOR ENHANCING COMPRESSED VIDEO
Ren Yang3, Xiaoyan Sun1, Mai Xu2 and Wenjun Zeng1
1Microsoft Research, USA, 2Beihang University, China, 3ETH, Switzerland
[ID:45] COMPOUNDED LAYER-PRIOR UNROLLING: A UNIFIED TRANSMISSION-BASED IMAGE ENHANCE- MENT FRAMEWORK
Risheng Liu, Minjun Hou, Jinyuan Liu, Xin Fan, Zhongxuan Luo
Dalian University of Technology, China
[ID:46] DEEP PYRAMID VARIATION LEARNING FOR IMAGE INTERPOLATION
Fu Qiang, Wenhan Yang, Ying Li, and Jiaying Liu
Peking University, China
122 [ID:47] CLOTHES KEYPOINTS LOCALIZATION AND ATTRIBUTE RECOGNITION VIA PRIOR KNOWLEDGE
Zhangxuan Gu, Jianfu Zhang, Ziqi Pan, Haohua Zhao, Liqing Zhang
Shanghai Jiao Tong University, China
[ID:48] SPATIO-TEMPORAL MULTI-FACTOR DISCRIMINANT ANALYSIS FOR INDIVIDUAL IDENTIFICATION
Yong Su, Zhiyong Feng
Tianjin University, China
[ID:49] CHANNEL-WISE TEMPORAL ATTENTION NETWORK FOR VIDEO ACTION RECOGNITION
Jianjun Lei1, Yalong Jia1,Bo Peng1, Qingming Huang2
1Tianjin University, China, 2University of Chinese Academy of Sciences, China
[ID:50] LOCALIZATION GUIDED FIGHT ACTION DETECTION IN SURVEILLANCE VIDEOS
Qichao Xu1, John See2, Weiyao Lin1
1Shanghai Jiao Tong University, China, 2Multimedia University, Malaysia
[ID:51] RECURSIVE MULTI-STAGE UPSCALING NETWORK WITH DISCRIMINATIVE FUSION FOR SUPER-RES- OLUTION
Yue Lu1, Zhuqing Jiang1, Guodong Ju2, Liangheng Shen2, Aidong Men1
1Beijing University of Posts and Telecommunications, China, 2GuangDong TUS-TuWei Technology Co.,Ltd, China
[ID:52] IMPROVING IMAGE SUPER-RESOLUTION VIA FEATURE RE-BALANCING FUSION
Yuanfei Huang, Jie Li, Xinbo Gao, Wen Lu, Yanting Hu
Xidian University, China
[ID:53] DIFFICULTY-AWARE IMAGE SUPER RESOLUTION VIA DEEP ADAPTIVE DUAL-NETWORK
Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen
Sun Yat-sen University, China
[ID:54] DENSE-CONNECTED RESIDUAL NETWORK FOR VIDEO SUPER-RESOLUTION
Xiaoting Du, Yuan Zhou, Yanfang Chen, Yeda Zhang, Jianxing Yang and Dou Jin
Tianjin University, China
123 IEEE ICME2019
Wednesday, July 10, 2019
Demo Session 1
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Dong Liu University of Science and Technology of China, China
[ID:55] BASEBALL PLAYER BEHAVIOR RECOGNITION SYSTEM USING MULTIMODAL FEATURES WITH AN AUGMENTED REALITY DISPLAY ON A SMART GLASS
Wei-Chen Yen1, Chih-Chieh Fang2, Shih-Wei Sun1, Kai-Lung Hua3, and Huang-Chia Shih4
1Department of New Media Art, Taipei National University of the Arts, Taiwan,2Graduate Institute of Dance Theory, Taipei National University of the Arts, Taiwan, 3Dept. Computer Science and Information Engineering, Natl. Taiwan Univ. of Sci. and Tech., Taiwan,4Dept. Electrical Engineering, Yuan Ze University, Taiwan
[ID:56] PRACTICAL IMAGE OBFUSCATION WITH PROVABLE PRIVACY
Liyue Fan
University at Albany SUNY, USA
[ID:57] SMART ADVERTISING IN VIDEOS BASED ON COMPREHENSIVE CONTENT ANALYTICS
Yi Zhang, Fan Luan, Yu-Gang Jiang
Jilian Technology Group (Video++), Shanghai, China
[ID:58] LIVE DEMONSTRATION: HIGH PERFORMANCE FOCUSED PLENOPTIC CAMERA
Chuanpu Li, Xufu Sun, Xin Jin, Qionghai Dai
Graduate School at Shenzhen, Tsinghua University, Shenzhen, China
124 Poster Session 4 & Demo Session 2
Wednesday, July 10, 2019
P-11: Multimedia and Language I
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Liyue Fan University at Albany, USA
[ID:1] DYNAMIC PSEUDO LABEL DECODING FOR CONTINUOUS SIGN LANGUAGE RECOGNITION
Hao Zhou, Wengang Zhou, Houqiang Li
University of Science and Technology of China, China
[ID:2] DECOUPLING LOCALIZATION AND CLASSIFICATION IN SINGLE SHOT TEMPORAL ACTION DETEC- TION
Yupan Huang1, Qi Dai2, Yutong Lu1
1Sun Yat-sen University, China, 2Microsoft Research, USA
[ID:3] IMAGE-TO-TREE: A TREE-STRUCTURED DECODER FOR IMAGE CAPTIONING
Zhiming Ma, Chun Yuan, Yangyang Cheng, Xinrui Zhu
Tsinghua University, China
[ID:4] MULTIMODAL SEMANTIC ATTENTION NETWORK FOR VIDEO CAPTIONING
Liang Sun1, Bing Li2, Chunfeng Yuan2, Zhengjun Zha1, Weiming Hu2
1University of Science and Technology of China, China, 2Chinese Academy of Sciences, China
[ID:5] CONCRETE IMAGE CAPTIONING BY INTEGRATING CONTENT SENSITIVE AND GLOBAL DISCRIMINA- TIVE OBJECTIVE
Jie Wu1, Tianshui Chen1,2, Hefeng Wu1,3, Zhi Yang1, Qing Wang1,2, Liang Lin1,2
1Sun Yat-sen University, China, 2DarkMatter AI Research, United Arab Emirates, 3Guangdong University of Foreign Studies, China
[ID:6] REVNET: BRING REVIEWING INTO VIDEO CAPTIONING FOR A BETTER DESCRIPTION
Huidong Li, Dandan Song, Lejian Liao, Cuimei Peng
Beijing Institute of Technology, China
125 IEEE ICME2019
[ID:7] MULTIMODAL IMAGE CAPTIONING THROUGH COMBINING REINFORCED CROSS ENTROPY LOSS AND STOCHASTIC DEPRECATION
Xi Meng, Hao Kong, Dongqi Tang, Tong Lu
Nanjing University, China
126 Wednesday, July 10, 2019
P-12: Advances in Artificial Intelligence
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Baoning Niu Taiyuan University of Technology, China
[ID:8] INVERSENET: SOLVING INVERSE PROBLEMS WITH SPLITTING NETWORKS
Qi Wei1, Kai Fan2, Wenlin Wang3, Tianhang Zheng4, Amit Chakraborty5, Katherine Heller3, Changyou Chen4, Kui Ren4,6
1J.P. Morgan, USA, 2Alibaba DAMO Academy, China, 3Duke University, USA, 4SUNY at Buffalo, USA, 5Siemens Corporate Technology, China, 6Zhejiang University, China
[ID:9] HIGH-RESOLUTION DRIVING SCENE SYNTHESIS USING STACKED CONDITIONAL GANS AND SPEC- TRAL NORMALIZATION
Shaobo Lin1, Long Chen1, Qin Zou2, Wei Tian3
1Sun Yat-sen University, China, 2Wuhan University, China, 3Karlsruhe Institute of Technology, Germany
[ID:10] ZERO-SHOT LEARNING WITH FEW SEEN CLASS SAMPLES
Yuqi Huo, Jiechao Guan, Jianhong Zhang, Manli Zhang, Ji-Rong Wen, Zhiwu Lu
Renmin University of China, China
[ID:11] ATTENTIONDROP FOR CONVOLUTIONAL NEURAL NETWORKS
Zhihao Ouyang1, Yan Feng1,2, Zihao He1, Tianbo Hao1, Tao Dai1,2, Shu-Tao Xia1,2
1Tsinghua University, China, 2Peng Cheng Laboratory, China
[ID:12] MULTI-VIEW CLUSTERING VIA SIMULTANEOUSLY LEARNING GRAPH REGULARIZED LOW-RANK TENSOR REPRESENTATION AND AFFINITY MATRIX
Yongyong Chen, Xiaolin Xiao, Yicong Zhou
University of Macau, China
[ID:13] DATA AUGMENTATION FOR MONAURAL SINGING VOICE SEPARATION BASED ON VARIATIONAL AU- TOENCODER-GENERATIVE ADVERSARIAL NETWORK
Boxin He1, Shengbei Wang1, Weitao Yuan1, Jianming Wang1, Masashi Unoki2
1Tianjin Polytechnic University, China, 2Japan Advanced Institute of Science and Technology, Japan
127 IEEE ICME2019
[ID:14] TOWARDS BETTER UNCERTAINTY SAMPLING: ACTIVE LEARNING WITH MULTIPLE VIEWS FOR DEEP CONVOLUTIONAL NEURAL NETWORK
Tao He1, Xiaoming Jin1, Guiguang Ding1, Lan Yi2, Chenggang Yan3
1Tsinghua University, China, 2Cisco, USA, 3Hangzhou Dianzi University, China
[ID:15] LOCAL METRIC LEARNING BASED ON ANCHOR POINTS FOR MULTIMEDIA ANALYSIS
Chunbin Gu, Jiajun Bu, Keyue Shi, Zhi Yu, Beidou Wang, Liangcheng Li
Zhejiang University, China
[ID:16] SPARSE REGRESSION-BASED MULTIPLE SEQUENCE ALIGNMENT
Tung Doan1, Takasu Atsuhiro2
1SOKENDAI (The Graduate University for Advanced Studies), Japan, 2National Institute of Informatics, Japan
128 Wednesday, July 10, 2019
P-13: Multimedia Security, Privacy and Forensics I
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Qieshi Zhang Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China
[ID:17] SINGLE IMAGE DERAINING USING A RECURRENT MULTI-SCALE AGGREGATION AND ENHANCE- MENT NETWORK
Youzhao Yang, Hong Lu
Fudan University, China
[ID:18] NEURAL NETWORK BASED PHASE COMPENSATION METHODS ON MONAURAL SPEECH SEPARA- TION
Chunpeng Wang, Jie Zhu
Shanghai Jiao Tong University, China
[ID:19] PALMGAN FOR CROSS-DOMAIN PALMPRINT ECOGNITION
Huikai Shao, Dexing Zhong, Yuhan Li
Xi’an Jiaotong University, China
[ID:20] EVENT-BASED VISION ENHANCED: A JOINT DETECTION FRAMEWORK IN AUTONOMOUS DRIVING
Jianing Li1, Siwei Dong1, Zhaofei Yu1,2, Yonghong Tian1,2, Tiejun Huang1,2
1Peking University, China, 2Pengcheng Laboratory, China
[ID:21] MULTI-DOMAIN EMBEDDING STRATEGIES FOR VIDEO STEGANOGRAPHY BY COMBINING PARTI- TION MODES AND MOTION VECTORS
Liming Zhai, Lina Wang, Yanzhen Ren
Wuhan University, China
[ID:22] Real-Time Indoor 3D Human Imaging Based on MIMO Radar Sensing
Hanqing Guo1, Nan Zhang1, Wenjun Shi1, Saeed AlQarni1, Shaoen Wu1, Honggang Wang2
1Ball State University, USA, 2University of Massachusetts Dartmouth, USA
[ID:23] MULTI-SPEAKERS SPEECH SEPARATION BASED ON MODIFIED ATTRACTOR POINTS ESTIMATION AND GMM CLUSTERING
129 IEEE ICME2019
Shanfa Ke1, Ruimin Hu1, Gang Li1, Tingzhao Wu1, Xiaochen Wang1,2, Zhongyuan Wang1
1Wuhan University, China, 2Collaborative Innovation Center of Geospatial Technolog, China
[ID:24] LEARNING A DEEP CONVOLUTIONAL NETWORK FOR SUBBAND IMAGE DENOISING
Jing Zhao1, Ruiqin Xiong1, Jizheng Xu2, Feng Wu3, Tiejun Huang1
1Peking University, China, 2ByteDance, USA, 3University of Science and Technology of China, China
[ID:25] MULTI-TASK CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
Zhijie Lin, Sen Jia, Bin Deng
Shenzhen University, China
[ID:26] A RETINA-INSPIRED SAMPLING METHOD FOR VISUAL TEXTURE RECONSTRUCTION
Lin Zhu1, Siwei Dong1, Tiejun Huang1,2, Yonghong Tian1,2
1Peking University, China, 2Pengcheng Laboratory, China
130 Wednesday, July 10, 2019
P-14: Machine Learning Applications in Image and Video Coding II
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Dan Zeng Shanghai University, China
[ID:27] LEARNED SCALABLE IMAGE COMPRESSION WITH BIDIRECTIONAL CONTEXT DISENTANGLEMENT NETWORK
Zhizheng Zhang, Zhibo Chen, Jianxin Lin, Weiping Li
University of Science and Technology of China, China
[ID:28] AN ATTENTION RESIDUAL NEURAL NETWORK WITH RECURRENT GREEDY APPROACH AS LOOP FIL- TER FOR INTER FRAMES
Jiabao Yao, Li Wang, Fangdong Chen, Chaoyi Lin, Shiliang Pu
Hikvision Research Institute, China
[ID:29] BAYESIAN NONNEGATIVE MATRIX FACTORIZATION WITH A TRUNCATED SPIKE-AND-SLAB PRIOR
Yuhang Liu1,2, Wenyong Dong1, Wanjuan Song1, Lei Zhang3
1Wuhan University, China, 2The University of Adelaide, Australia, 3Inception Institute of Artificial Intelligence, United Arab Emirates
[ID:30] ENCODING COMPLEXITY CONTROL FOR LIVE VIDEO APPLICATIONS: AN INTERPRETABLE MACHINE LEARNING APPROACH
Chao Huang, Zongju Peng, Fen Chen, Qiuping Jiang, Xin Cui, Gangyi Jiang
Ningbo University, China
[ID:31] ENHANCED RESIDUAL DENSE INTRINSIC NETWORK FOR INTRINSIC IMAGE DECOMPOSITION
Risheng Liu1,2,3, Cheng Yang1,3, Long Ma1,3, Miao Zhang1,3, Xin Fan1,3, Zhongxuan Luo1,3
1Dalian University of Technology, China, 2Xidian University, China, 3Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, China
131 IEEE ICME2019
[ID:32] NON-CONVEX TRANSFER SUBSPACE LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION
Zhipeng Lin1, Zhenyu Zhao1, Tingjin Luo1, Wenjing Yang1, Yongjun Zhang2, Yuhua Tang1
1National University of Defense Technology, China, 2National Innovation Institute of Defense Technology, China
[ID:33] DISCRIMINATIVE GROUP COLLABORATIVE COMPETITIVE REPRESENTATION FOR VISUAL CLASSIFI- CATION
Jianping Gou1, Lei Wang1, Zhang Yi2, Yunhao Yuan3, Weihua Ou4, Qirong Mao1
1Jiangsu University, China, 2Sichuan University, China, 3Yangzhou University, China, 4Guizhou Normal University, China
132 Wednesday, July 10, 2019
P-15: Multimedia and Vision II
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Shao-Yi Chien National Taiwan University, Taiwan
[ID:34] LARGE-SCALE DATASETS FOR GOING DEEPER IN IMAGE UNDERSTANDING
Jiahong Wu1,2, He Zheng3, Bo Zhao4, Yixin Li4, Baoming Yan4, Rui Liang1, Wenjia Wang4, Shipei Zhou4,5, Guosen Lin2, Yan- wei Fu6, Yizhou Wang4, Yonggang Wang1
1Sinovation Ventures, China, 2Ainnovation Technology Ltd, China, 3University of Chinese Academy of Sciences, China, 4Pe- king University, China, 5Carnegie Mellon University, USA, 6Fudan University, China
[ID:35] REVISIT SURROUND-VIEW CAMERA SYSTEM CALIBRATION
Xuan Shao1, Xiao Liu1, Lin Zhang1, Shengjie Zhao1, Ying Shen1, Yukai Yang2
1Tongji University, China, 2Uppsala University, Sweden
[ID:36] DECOUPLING SEMANTIC CONTEXT AND COLOR CORRELATION WITH MULTI-CLASS CROSS BRANCH REGULARIZATION
Vishal Keshav, Tejpratap GVSL
Samsung Research Institute Bangalore, India
[ID:37] CROWD COUNTING VIA MULTI-VIEW SCALE AGGREGATION NETWORKS
Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin
Sun Yat-sen University, China
[ID:38] PASIAM: PREDICTING ATTENTION INSPIRED SIAMESE NETWORK, FOR SPACE-BORNE SATELLITE VIDEO TRACKING
Jia Shao1, Bo Du1, Chen Wu1, Pingkun Yan2
1Wuhan University, China, 2Rensselaer Polytechnic Institute, USA
[ID:39] A RELATION NETWORK EMBEDDED WITH PRIOR FEATURES FOR FEW-SHOT CARICATURE RECOGNI- TION
Wenbo Zheng1,2, Lan Yan2,3, Chao Gou2,4, Wenwen Zhang1,2, Fei-Yue Wang2,4
1Xi’an Jiaotong University, China, 2Chinese Academy of Sciences, China, 3University of Chinese Academy of Sciences, Chi- na, 4Qingdao Academy of Intelligent Industries, China
133 IEEE ICME2019
[ID:40] A SINGLE-SHOT ORIENTED SCENE TEXT DETECTOR WITH LEARNABLE ANCHORS
Fenfen Sheng1,2, Zhineng Chen1, Tao Mei3, Bo Xu1
1Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3JD AI Research, China
[ID:41] CONTEXT-CONSTRAINED ACCURATE CONTOUR EXTRACTION FOR OCCLUSION EDGE DETECTION
Rui Lu1, Menghan Zhou1, Anlong Ming1, Yu Zhou2
1Beijing University of Posts and Telecommunications, China, 2Huazhong University of Science and Technology, China
[ID:42] LEARNING SIMULTANEOUS FACE SUPER-RESOLUTION USING MULTISET PARTIAL LEAST SQUARES
Yun-Hao Yuan1, Jin Li1, Jianping Gou2, Yun Li1, Jipeng Qiang1, Bin Li1
1Yangzhou University, China, 2Jiangsu University, China
[ID:43] CROPPING REGION PROPOSAL NETWORK BASED FRAMEWORK FOR EFFICIENT OBJECT DETECTION ON LARGE SCALE REMOTE SENSING IMAGES
Qifeng Lin, Jianhui Zhao, Qianqian Tong, Guian Zhang, Gang Fu, Zhiyong Yuan
Wuhan University, China
[ID:44] SPECTRAL ANALYSIS NETWORK FOR DEEP REPRESENTATION LEARNING AND IMAGE CLUSTERING
Jinghua Wang1, Adrian Hilton2, Jianmin Jiang1
1Shenzhen University, China, 2University of Surrey, UK
134 Wednesday, July 10, 2019
P-16: Oral-13 to Oral-16
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Lei Zhang Microsoft AI & Research, USA
[ID:45] GLOBAL AS-CONFORMAL-AS-POSSIBLE NON-RIGID REGISTRATION OF MULTI-VIEW SCANS
Zhenchao Wu1, Kun Li1, Yu-Kun Lai2, Jingyu Yang1
1Tianjin University, China, 2Cardiff University, UK
[ID:46] A LIGHT-WEIGHTED NETWORK FOR FACIAL LAND MARK DETECTION VIA COMBINED HEATMAP AND COORDINATE REGRESSION
Zhengning Wang1, Longfei Feng1, Fanwei Zeng1, Guang Hu1, Xiang Zhang1, Xia Lv1, Fengjun Zhang2
1University of Electronic Science and Techonogy of China, China, 2No.30 Institute of CETC, China
[ID:47] LIGHT WEIGHT STEREO MATCHING VIA DEEP EXTRACTION AND INTEGRATION OF LOW AND HIGH LEVEL INFORMATION
Xianzhe Xu1, Yonghong Hou1, Pichao Wang2, Zhongyu Jiang1, Wanqing Li3
1Tianjin University, China, 2Alibaba Group (U.S.) Inc., China, 3University of Wollongong, Australia
[ID:48] JUSTLOOKUP: ONE MILLISECOND DEEP FEATURE EXTRACTION FOR POINT CLOUDS BY LOOKUP TABLES
Hongxin Lin1,2, Zelin Xiao1,2, Yang Tan1,2, Hongyang Chao1, Shengyong Ding1
1Sun Yat-sen University, China, 2Pixtalks Tech, China
[ID:49] MULTIPLE GRAPH CONVOLUTIONAL NETWORKS FOR CO-SALIENCY DETECTION
Bo Jiang1, Xingyue Jiang1, Jin Tang1, Bin Luo1, Shilei Huang2
1Anhui University, China, 2PKU-HKUST Shenzhen Hong Kong Institution, China
[ID:50] QUANNET: JOINT IMAGE COMPRESSION AND CLASSIFICATION OVER CHANNELS WITH LIMITED BANDWIDTH
Lahiru Dulanjana Chamain Hewa Gamage1, Sen-ching S Cheung2, Zhi Ding1
1University of Califirnia Davis, USA,2 University of Kentucky, USA
135 IEEE ICME2019
[ID:51] HIGH EFFICIENCY LIGHT FIELD COMPRESSION VIA VIRTUAL REFERENCE AND HIERARCHICAL MV- HEVC
Jiawen Gu, Bichuang Guo, Jiangtao Wen
Tsinghua University, China
[ID:52] SELF-PACED SUBSPACE CLUSTERING
Youfa Liu, Bo Du, Lefei Zhang
Wuhan University, China
[ID:53] COLLOQUIAL IMAGE CAPTIONING
Xuri Ge, Fuhai Chen, Chen Shen, Rongrong Ji
Xiamen University, China
[ID:54] IMPROVING CAPTIONING FOR LOW-RESOURCE LANGUAGES BY CYCLE CONSISTENCY
Yike Wu1, Shiwan Zhao2, Jia Chen3, Yinng Zhang1, Xiaojie Yuan1, Zhong Su2
1Nankai University, China, 2IBM Research, USA, 3Carnegie Mellon University, USA
[ID:55] FRAMERANK: A TEXT PROCESSING APPROACH TO VIDEO SUMMARIZATION
Zhuo Lei1,2, Chao Zhang1,2, Qian Zhang2, Guoping Qiu3,4
1International Doctoral Innovation Center, China, 2The University of Nottingham Ningbo China, China, 3Shenzhen Universi- ty, China, 4University of Nottingham, UK
[ID:56] CHARACTER IMAGE SYNTHESIS BASED ON SELECTED CONTENT AND REFERENC ED STYLE EM- BEDDING
Anna Zhu, Qiyang Zhang, Xiongbo Lu, Shengwu Xiong
Wuhan University of Technology, China
[ID:57] QUERY-FREE EMBEDDING ATTACK AGAINST DEEP LEARNING
Yujia Liu, Weiming Zhang, Nenghai Yu
University of Science and Technology of China, China
[ID:58] GRAPH ATTENTION NEURAL NETWORKS FOR POINT CLOUD RECOGNITION
Zongmin li, Jun Zhang, Guanlin Li, Yujie Liu, Siyuan Li
China University of Petroleum (East China), China
136 [ID:59] MAXIMAL CORRELATION EMBEDDING NETWORK FOR MULTILABEL LEARNING WITH MISSING LA- BELS
Lu Li, Yang Li, Xiangxiang Xu, Shao-Lun Huang, Lin Zhang
Tsinghua University, China
[ID:60] SELF-ADAPTION MULTI-CLASSIFIER FUSION NETWORKS FOR IMAGE RECOGNITION
Zengyuan Guo, Xinzhu Ma, Haojie Li, Zhihui Wang, Pengbo Zhang
Dalian University of Technology, China
137 IEEE ICME2019
Wednesday, July 10, 2019
Demo Session 2
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Dong Liu University of Science and Technology of China, China
[ID:61] DEMONSTRATION OF APPLICATIONS IN COMPUTER VISION AND NLP ON ULTRA POWER-EFFICIENT CNN DOMAIN SPECIFIC ACCELERATOR WITH 9.3TOPS/WATT
Baohua Sun, Lin Yang, Wenhan Zhang, Patrick Dong, Charles Young, Jason Dong, Michael Lin
Gyrfalcon Technology Inc., USA
[ID:62] LIGHT FIELD RECONSTRUCTION USING SHEARLET TRANSFORM IN TENSORFLOW
Yuan Gao1, Reinhard Koch1, Robert Bregovic2, Atanas Gotchev2
1Kiel University, Germany, 2Tampere University, Finland
[ID:63] AUTOMATIC LONG-TERM DECEPTION DETECTION IN GROUP INTERACTION VIDEOS
Chongyang Bai1, Maksim Bolonkin1, Judee Burgoon3, Chao Chen1, Norah Dunbar4, Bharat Singh2, V. S. Subrahmanian1, Zhe Wu2
1Dartmouth College, USA, 2Univerity of Maryland, USA, 3University of Arizona, USA, 4University of California Santa Barba- ra, USA
138 Poster Session 5 & Grand Challenge
Thursday, July 11, 2019
P-17: Multimedia Understanding and Mixed Reality
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Robert Bregovic Tampere University of Technology, Finland
[ID:1] SALIENT OBJECT DETECTION VIA RECURRENTLY AGGREGATING SPATIAL ATTENTION WEIGHTED CROSS-LEVEL DEEP FEATURES
Chang Tang1, Xinzhong Zhu2, Xinwang Liu3, Pichao Wang4
1China University of Geosciences, China, 2Zhejiang Normal University, China, 3National University of Defense Technology, China, 4Alibaba Group (U.S.), China
[ID:2] FAST REGISTRATION FOR CROSS-SOURCE POINT CLOUDS BY USING WEAK REGIONAL AFFINITY AND PIXEL-WISE REFINEMENT
Xiaoshui Huang1, Lixin Fan2, Qiang Wu1, Jian Zhang1,4, Chun Yuan3
1University of Technology Sydney, Australia, 2Nokia Technologies, Finland, 3Graduate School at Shenzhen, Tsinghua Univer- sity, China, 4Peng Cheng Laboratory, China
[ID:3] 3D FACE REPRENTATION AND RECONSTRUCTION WITH MULTI-SCALE GRAPH CONVOLUTIONAL AU- TOENCODERS
Cunkuan Yuan1, Kun Li1, Yu-Kun Lai2, Yebin Liu3, Jingyu Yang1
1Tianjin University, China, 2Cardiff University, UK, 3Tsinghua University, China
[ID:4] VISUAL DIALOG WITH TARGETED OBJECTS
Qiang Wang, Yahong Han
Tianjin University, China
[ID:5] A NEW OBJECT SCENE FLOW ALGORITHM BASED ON SUPPORT POINTS SELECTION AND ROBUST MOVING OBJECT PROPOSAL
Zhengyang Sun1, Zongqing Lu1, Jing-Hao Xue2, Qingmin Liao1
1Graduate School at Shenzhen, Tsinghua University, China, 2University College London, UK
[ID:6] REFINING PROPOSALS WITH NEIGHBORING CONTEXTS FOR TEMPORAL ACTION DETECTION
Dashan Guo, Wei Li, Ning Xu, Jianhui Sun and Xiangzhong Fang
139 IEEE ICME2019
Shanghai Jiao Tong University, China
[ID:7] A DATA-DRIVEN FRAMEWORK FOR APPEARANCE EDITING OF MEASURED MATERIALS
Yanjun Chen1, Jie Guo1, Bingyang Hu1, Yanwen Guo1,2, Jingui Pan1
1Nanjing University, China, 2The 28th Research Institute of China Electronics Technology Group Corporation, China
[ID:8] ACTIVE SEMANTIC LABELING OF STREET VIEW POINT CLOUDS
Yang Zhou1,2, Shuhan Shen1,2, Zhanyi Hu1,2
1NLPR, Institute of Automation, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
[ID:9] VIDEO PREDICTION WITH TEMPORAL-SPATIAL ATTENTION MECHANISM AND DEEP PERCEPTUAL SIMILARITY BRANCH
Qian Wu, Wenmin Wang, Xiongtao Chen, Weimian Li
Peking University, China
[ID:10] AUTOMATIC LONG-TERM DECEPTION DETECTION IN GROUP INTERACTION VIDEOS
Chongyang Bai1, Maksim Bolonkin1, Judee Burgoon3, Chao Chen1, Norah Dunbar4, Bharat Singh2, V. S. Subrahmanian1, Zhe Wu2
1Dartmouth College, USA, 2Univerity of Maryland, USA, 3University of Arizona, USA, 4University of California Santa Barba- ra, USA
[ID:11] A NEW ROTATION-INVARIANT DEEP NETWORK FOR 3D OBJECT RECOGNITION
Yachi Zhang1, Zongqing Lu1, Jing-Hao Xue2, Qingmin Liao1
1Graduate School at Shenzhen, Tsinghua University, China, 2University College London, UK
[ID:12] LOCAL OPTICAL FLOW CONSIDERING OBJECT BOUNDARIES BY ADAPTIVE WINDOW POSITIONING
Andreas Kah, Matthias Narroschke
RheinMain University of Applied Sciences, Germany
[ID:13] MULTI-GRANULARITY REASONING FOR SOCIAL RELATION RECOGNITION FROM IMAGES
Meng Zhang1, Xinchen Liu2, Wu Liu2, Anfu Zhou1, Huadong Ma1, Tao Mei2
1Beijing University of Posts and Telecommunications, China, 2JD AI Research, China
140 Thursday, July 11, 2019
P-18: Media Classification and Segmentation IV
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Jianyu Yang Soochow University, China
[ID:14] MULTI-TIMESCALE CONTEXT ENCODING FOR SCENE PARSING PREDICTION
Xin Chen, Yahong Han
Tianjin University, China
[ID:15] PORTRAIT INSTANCE SEGMENTATION FOR MOBILE DEVICES
Lingyu Zhu1, Tinghuai Wang2, Emre Aksu2, Joni-Kristian Kamarainen1
1Tampere University, Finland, 2Nokia Technologies, Finland
[ID:16] LEARNING TO SEGMENT UNSEEN CATEGORY OBJECTS USING GRADIENT GAUSSIAN ATTENTION
Pengbo Zhang1, Zhihui Wang1, Xinzhu Ma1, Haojie Li1, Jianjun Li2
1Dalian University of Technology, China, 2Hangzhou Dianzi University, China
[ID:17] TEMPORAL SEGMENT CONVOLUTIONAL KERNEL NETWORKS FOR SEQUENCE MODELING OF VID- EOS
Fei Pan1, Yanwen Guo1, Zhicheng Yan2, Jie Guo1
1Nanjing University, China, 2Facebook Research, USA
[ID:18] SVNET: A SINGLE VIEW NETWORK FOR 3D SHAPE RECOGNITION
Shaoshuai Li, Fuyan Liu
Shanghai University, China
[ID:19] SPFUSIONNET: SKETCH SEGMENTATION USING MULTI-MODAL DATA FUSION
Fei Wang1, Shujin Lin1, Hefeng Wu2, Hanhui Li3, Ruomei Wang1, Xiaonan Luo3, Xiangjian He4
1Sun Yat-sen University, China, 2Guangdong University of Foreign Studies, China, 3Guilin University of Electronic Technolo- gy, China, 4University of Technology Sydney, Australia
[ID:20] ADAPTIVE COMPONENT EMBEDDING FOR UNSUPERVISED DOMAIN ADAPTATION
Mengmeng Jing1, Jingjing Li1, Ke Lu1, Jieyan Liu1, Zi Huang2
141 IEEE ICME2019
1University of Electronic Science and Technology of China, China, 2The University of Queensland, Australia
[ID:21] ACOUSTIC SCENE CLASSIFICATION WITH MISMATCHED RECORDING DEVICES USING MIXTURE OF EXPERTS LAYER
Truc Nguyen, Franz Pernkopf
Graz University of Technology, Germany
[ID:22] SELF-REPRESENTATION CONVOLUTIONAL NEURAL NETWORKS
Hongchao Gao1,2, Xi Wang1, Yujia Li1,2, Jizhong Han1, Songlin Hu1, Ruixuan Li3
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3Huazhong University of Science and Technology, China
142 Thursday, July 11, 2019
P-19: Oral-29 to Oral-35
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Honggang Wang University of Massachusetts (UMass) Dartmouth, USA
[ID:23] PEDESTRIAN RE-IDENTIFICATION BASED ON TREE BRANCH NETWORK WITH LOCAL AND GLOBAL LEARNING
Hui Li1, Meng Yang2, Zhihui Lai1, Weishi Zheng2, Zitong Yu3
1Shenzhen University, China, 2Sun Yat-sen University, China, 3University of Oulu, Finland
[ID:24] ADVERSARIAL BINARY CODING FOR EFFICIENT PERSON RE-IDENTIFICATION
Zheng Liu1, Jie Qin2, Annan Li1, Yunhong Wang1, and Luc Van Gool3
1Beihang University, China, 2Inception Institute of Artificial Intelligence, United Arab Emirates, 3Computer Vision Laborato- ry, ETH Zurich, Switzerland
[ID:25] PERSON RE-IDENTIFICATION WITH GRADUAL BACKGROUND SUPPRESSION
Yingzhi Tang, Xi Yang, Nannan Wang, Xinrui Jiang, Bin Song, Xinbo Gao
Xidian University, China
[ID:26] MULTI-BRANCH CONTEXT-AWARE NETWORK FOR PERSON RE-IDENTIFICATION
Yingxin Zhu1, Xiaoqiang Guo2, Jianlei Liu1, Zhuqing Jiang1
1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, China
[ID:27] POST-PROCESSING OF WORD REPRESENTATIONS VIA VARIANCE NORMALIZATION AND DYNAMIC EMBEDDING
Bin Wang1, Fenxiao Chen1, Angela Wang2 and C.-C. Jay Kuo1
1University of Southern California, USA, 2University of California, Berkeley, USA
[ID:28] MULTI-MODAL LANGUAGE ANALYSIS WITH HIERARCHICAL INTERACTION-LEVEL AND SELEC- TION-LEVEL ATTENTION
Dong Zhang, Liangqing Wu, Shoushan Li, Qiaoming Zhu, Guodong Zhou
Soochow University, China
[ID:29] MODELING THE CLAUSE-LEVEL STRUCTURE TO MULTIMODAL SENTIMENT ANALYSIS VIA REIN-
143 IEEE ICME2019
FORCEMENT LEARNING
Dong Zhang, Shoushan Li, Qiaoming Zhu, Guodong Zhou
Soochow University, China
[ID:30] TWICE OPPORTUNITY KNOCKS SYNTACTIC AMBIGUITY: A VISUAL QUESTION ANSWERING MODEL WITH YES/NO FEEDBACK
Jianming Wang, Wei Deng, Yukuan Sun, Yuanyuan Li, Kai Wang, Guanghao Jin
Tianjin Polytechnic University, China
[ID:31] GEOCAPSNET: GROUND TO AERIAL VIEW IMAGE GEO-LOCALIZATION USING CAPSULE NETWORK
Bin Sun1, Chen Chen2, Yingying Zhu1, Jianmin Jiang1
1Shenzhen University, China, 2University of North Carolina at Charlotte, USA
[ID:32] IMPROVING ROBUSTNESS OF DASH AGAINST NETWORK UNCERTAINTY
Bo Wang1,2, Fengyuan Ren1,2
1Beijing National Research Center for Information Science and Technology, China, 2Tsinghua University, China
[ID:33] HYBRID CONTROL-BASED ABR: TOWARDS LOW-DELAY LIVE STREAMING
Bo Wang1,2, Fengyuan Ren1,2, Chao Zhou3
1Beijing National Research Center for Information Science and Technology, China, 2Tsinghua University, China, 3Beijing Kuaishou Technology Co., Ltd. , China
[ID:34] TAXI ORIGIN-DESTINATION DEMAND PREDICTION WITH CONTEXTUALIZED SPATIAL-TEMPORAL NETWORK
Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin
Sun Yat-sen University, China
[ID:35] FAST IMAGE CLUSTERING BASED ON CAMERA FINGERPRINT ORDERING
Sahib Khan, Tiziano Bianchi
Politecnico di Torino, Italy
[ID:36] ENFORCING ACCESS CONTROL IN DISTRIBUTED VERSION CONTROL SYSTEMS
Xin Xu1,2, Quanwei Cai1,2, Jingqiang Lin1,2, Shiran Pan1,2, Liangqin Ren1,2
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
144 [ID:37] ATTRIBUTE-BASED ACCOUNTABLE ACCESS CONTROL FOR MULTIMEDIA CONTENT WITH IN-NET- WORK CACHING
Peixuan He1, Kaiping Xue1, Jie Xu1, Qiudong Xia1, Jianqing Liu2, Hao Yue3
1University of Science and Technology of China, China, 2University of Alabama in Huntsville, USA, 3San Francisco State University, USA
[ID:38] PRACTICAL IMAGE OBFUSCATION WITH PROVABLE PRIVACY
Liyue Fan
University at Albany, State University of New York, USA
[ID:39] JOINTLY SOLVING DEBLURRING AND SUPER-RESOLUTION PROBLEMS WITH DUAL SUPERVISED NETWORK
Zhenwen Liang, Dongyang Zhang, Jie Shao
University of Electronic Science and Technology of China, China
[ID:40] TWO-STAGED ACOUSTIC MODELING ADAPTION FOR ROBUST SPEECH RECOGNITION BY THE EX- AMPLE OF GERMAN ORAL HISTORY INTERVIEWS
Michael Gref 1,2, Christoph Schmidt1, Sven Behnke1,3, Joachim Köhler1
1Fraunhofer Institute for Intelligent Analysis and Information Systems, Germany, 2Niederrhein University of Applied Scienc- es, Germany, 3University of Bonn, Germany
[ID:41] AN ADAPTIVE AFFINITY GRAPH WITH SUBSPACE PURSUIT FOR NATURAL IMAGE SEGMENTATION
Yang Zhang1, Huiming Zhang1, Yanwen Guo1, Kai Lin2, Jingwu He1
1Nanjing University, China, 2Hubei University of Technology, China
[ID:42] PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIRCU- LAR MICROPHONE ARRAY
Li He, Yi Zhou, Hongqing Liu
Chongqing University of Posts and Telecommunications, China
[ID:43] LOCALITY-CONSTRAINED SPATIAL TRANSFORMER NETWORK FOR VIDEO CROWD COUNTING
Yanyan Fang1, Biyun Zhan1, Wandi Cai1, Shenghua Gao2, Bo Hu1
1Fudan University, China, 2ShanghaiTech University, China
[ID:44] SPATIAL-AWARE NON-LOCAL ATTENTION FOR FASHION LANDMARK DETECTION
Yixin Li1, Shengqin Tang2, Yun Ye3, Jinwen Ma1
1Peking University, China, 2Xi’an Jiaotong University, China, 3JD AI Research, China
145 IEEE ICME2019
[ID:45] RELATIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
Wu Zheng1,2, Lin Li1,2, Zhaoxiang Zhang1,2, Yan Huang1,2, Liang Wang1,2
1Institute of Automation, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
[ID:46] MULTI-VIEW LEARNING FOR VEHICLE RE-IDENTIFICATION
Weipeng Lin1, Yidong Li1, Xiaoliang Yang1, Peixi Peng2, Junliang Xing2
1Beijing Jiaotong University, China, 2Institute of Automation, Chinese Academy of Sciences, China
[ID:47] MANY COULD BE BETTER THAN ALL: A NOVEL INSTANCE-ORIENTED ALGORITHMFOR MULTI-MOD- AL MULTI-LABEL PROBLEM
Yi Zhang, Cheng Zeng, Hao Cheng, Chongjun Wang, Lei Zhang
Nanjing University, China
[ID:48] AFFECTIVE VIDEO CONTENT ANALYSES BY USING CROSS-MODAL EMBEDDING LEARNING FEA- TURES
Benchao Li1,3, Zhenzhong Chen2, Shan Li3, WeiShi Zheng1,4
1Sun Yat-Sen University, China, 2Wuhan University, China, 3Tencent, China, 4Key Laboratory of Machine Intelligence and Advanced Computing, Ministry of Education, ,China
[ID:49] LEARNING A 3D GAZE ESTIMATOR WITH IMPROVED ITRACKER COMBINED WITH BIDIRECTIONAL LSTM
Xiaolong Zhou, Jianing Lin, Jiaqi Jiang, Shengyong Chen
Zhejiang University of Technology, China
[ID:50] DETECTION OF OCCLUDED ROAD SIGNS ON AUTONOMOUS DRIVING VEHICLES
Jingda Guo, Xianwei Cheng, Qi Chen, Qing Yang
University of North Texas, USA
146 Thursday, July 11, 2019
Grand Challenge
Time: 13:30 - 15:00 PM
Room: 3rd Floor
Chair: Jiaying Liu Peking University, China
[ID:51] SALIENCY PREDICTION VIA MULTI-LEVEL FEATURES AND DEEP SUPERVISION FOR CHILDREN WITH AUTISM SPECTRUM DOISORDER
Weijie Wei1, Zhi Liu1, Lijin Huang1, Alexis Nebout2, Olivier Le Meur2 1Shanghai University, China, 2 University of Rennes 1, France
[ID:52] VISUAL ATTENTION MODELING FOR AUTISM SPECTRUM DISORDER BY U-NET
Yuming Fang, Hanqin Huang, Boyang Wan, and Yifan Zuo Jiangxi University of Finance and Economics, China
[ID:53] PREDICTING SALIENCY MAPS FOR ASD PEOPLE
Alexis Nebout1, Weijie Wei2, Zhi Liu2, Lijin Huang2, Olivier Le Meur1 1University of Rennes 1, France, 2Shanghai University, China
[ID:54] CLASSIFYING AUTISM SPECTRUM DISORDER BASED ON SCANPATHS AND SALIENCY
Mikhail Startsev, Michael Dorr Technical University of Munich, Germany
[ID:55] EXPLOITING VISUAL BEHAVIOUR FOR AUTISM SPECTRUM DISORDER IDENTIFICATION
Giuliano Arru, Pramit Mazumdar, Federica Battisti Roma Tre University, Italy
[ID:56] SP-ASDNET: CNN-LSTM BASED ASD CLASSIFICATION MODEL USING OBSERVER SCANPATHS
Yudong Tao, Mei-Ling Shyu University of Miami, USA
147 IEEE ICME2019
[ID:57] PREDICTING AUTISM DIAGNOSIS USING IMAGE WITH FIXATIONS AND SYNTHETIC SACCADE PATTERNS
Chongruo Wu1, Sidrah Liaqat2, Sen-ching Cheung2, Chen-Nee Chuah1, Sally Ozonoff1 1University of California, Davis, USA, 2University of Kentucky, USA
[ID:58] A SIMPLE BUT USEFUL MODEL FOR CLASSIFYING ASD AND NORMAL VIEWERS USING GAZE DATA AND LINEAR REGRESSION
S. Xu, J. Yan, M. Hu Shanghai Key Laboratory of Multidimensional Information Processing, East China Normal University, China
148 Poster Session 6
Thursday, July 11, 2019
P-20: Multimedia Communications, Networking and Mobility
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Tsung-Jung Liu National Chung Hsing University, Taiwan
[ID:1] TIYUNTSONG: A SELF-PLAY REINFORCEMENT LEARNING APPROACH FOR ABR VIDEO STREAMING
Tianchi Huang, Xin Yao, Chenglei Wu, Rui-Xiao Zhang, Zhengyuan Pang, Lifeng Sun
Tsinghua University, China
[ID:2] EDGE-BOOST: ENHANCING MULTIMEDIA DELIVERY WITH MOBILE EDGE CACHING IN 5G-D2D NET- WORKS
Venkatraman Balasubramanian1, Mu Wang1, Martin Reisslein1, Changqiao Xu2
1Arizona State University, USA, 2University of Posts and Telecommunications, China
[ID:3] 3D MESH BASED INTER-IMAGE PREDICTION FOR IMAGE SET COMPRESSION
Hao Wu1, Xiaoyan Sun2, Jingyu Yang1, Feng Wu3
1Tianjin University, China, 2Microsoft Research Asia, China, 3University of Science and Technology of China, China
[ID:4] FAST INTER MODE PREDICTIONS FOR SHVC
Dayong Wang1, Yu Sun2, Weisheng Li1, Ce Zhu3, Frederic Dufaux4
1Chongqing University of Posts and Telecommunications, China, 2University of Central Arkansas, USA, 3University of Elec- tronic Science and Technology of China, China, 4CNRS - CentraleSupélec – Université Paris-Sud, France
[ID:5] HIT RATIO DRIVEN MOBILE EDGE CACHING SCHEME FOR VIDEO ON DEMAND SERVICES
Xing Chen, Lijun He, Shang Xu, Shibo Hu, Qingzhou Li, Guizhong Liu
Xi’an Jiao Tong University, China
[ID:6] QOE-DRIVEN MOBILE STREAMING: A LOCATION-AWARE APPROACH
Fang Liu, Wei Zhang, Yonggang Wen
Nanyang Technological University, Singapore
[ID:7] ENERGY EFFICIENT TRANSMISSION OF 3D MESHES OVER MMWAVE-BASED MASSIVE MIMO SYS- TEMS
149 IEEE ICME2019
Aris Lalos1, Gerasimos Arvanitis2, Evangelos Vlachos3, Konstantinos Moustakas2
1Industrial System Institute, Greece, 2University of Patras, Greece, 3University of Edinburgh, UK
[ID:8] IDENTIFYING INFLUENTIAL USERS IN MOBILE DEVICE-TO-DEVICE SOCIAL NETWORKS TO PROMOTE OFFLINE MULTIMEDIA CONTENT PROPAGATION
Hao Fan, Xu Tong, Qing Zhang, Tianxiang Zhang, Chenyang Wang and Xiaofei Wang
Tianjin University, China
150 Thursday, July 11, 2019
P-21: Object Detection II
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Ye Luo Tongji University, China
[ID:9] ACCURATE AND EFFICIENT OBJECT DETECTION WITH CONTEXT ENHANCEMENT BLOCK
Yuhao Chen1, Min Zhao1, Xin Tan2, Hong Tang1, Dihua Sun1
1Chongqing University, China, 2Shanghai Jiao Tong University, China
[ID:10] MASK GUIDED KNOWLEDGE DISTILLATION FOR SINGLE SHOT DETECTOR
Yousong Zhu1,2, Chaoyang Zhao1,2, Chenxia Han3, Jinqiao Wang1,2, Hanqing Lu1,2
1Institute of Automation, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3Wuhan University, China
[ID:11] VIDEO TEXT DETECTION WITH FULLY CONVOLUTIONAL NETWORK AND TRACKING
Yang Wang, Lan Wang, Feng Su, Jiahao Shi
Nanjing University, China
[ID:12] CASCADE REGION PROPOSAL NETWORKS FOR OBJECT DETECTION IN THE WILD
DongMing Yang1,2, YueXian Zou1,2
1Peking University, China, 2Peng Cheng Laboratory, China
[ID:13] TRACKING ASSISTED FASTER VIDEO OBJECT DETECTION
Wenfei Yang, BinLiu, Weihai Li, Nenghai Yu
University of Science and Technology of China, China
[ID:14] REFINETEXT: REFINING MULTI-ORIENTED SCENE TEXT DETECTION WITH A FEATURE REFINEMENT MODULE
Pengyuan Xie, Jing Xiao, Yang Cao, Jia Zhu, Asad Khan
South China Normal University, China
[ID:15] MULTI-SCALE CAPSULE ATTENTION-BASED SALIENT OBJECT DETECTION WITH MULTI-CROSSED LAYER CONNECTIONS
Qi Qi1, Sanyuan Zhao1, Jianbing Shen1, Kin-Man Lam2
1Beijing Institute of Technology, China, 2The Hong Kong Polytechnic University, China
151 IEEE ICME2019
Thursday, July 11, 2019
P-22: Artificial Intelligence for Multimedia
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Kunal Swami Samsung, Korea
[ID:16] CONTINUOUS BIDIRECTIONAL OPTICAL FLOW FOR VIDEO FRAME SEQUENCE INTERPOLATION
Donghao Gu, Zhaojing Wen, Wenxue Cui, Rui Wang, Feng Jiang, Shaohui Liu
Harbin Institute of Technology, China
[ID:17] ROBUST DEEP TRACKING WITH TWO-STEP AUGMENTATION DISCRIMINATIVE CORRELATION FIL- TERS
Chunhui Zhang1,2, Shiming Ge1, Yingying Hua1,2, Dan Zeng3
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China, 3Shanghai University, China
[ID:18] EFFICIENT IMPLEMENTATION OF CONVOLUTIONAL NEURAL NETWORKS WITH END TO END INTE- GER-ONLY DATAFLOW
Yiwu Yao, Bin Dong, Yuke Li, Weiqiang Yang, Haoqi Zhu
Yidun Lab, NetEase Inc, China
[ID:19] LEARNING MOTION-AWARE POLICIES FOR ROBUST VISUAL TRACKING
Qianqian Wang, Liansheng Zhuang, Ning Wang, Wengang Zhou, Houqiang Li
University of Science and Technology of China, China
[ID:20] KNOWLEDGE DISTILLATION WITH CATEGORY-AWARE ATTENTION AND DISCRIMINANT LOGIT LOSSES
Lei Jiang, Wengang Zhou, Houqiang Li
University of Science and Technology of China, China
[ID:21] UNSUPERVISED LEARNING OF DEPTH AND EGO-MOTION WITH SPATIAL-TEMPORAL GEOMETRIC CONSTRAINTS
Anjie Wang1, Yongbin Gao1, Zhijun Fang1, Xiaoyan Jiang1, Shanshe Wang2, Siwei Ma2, Jenq-Neng Hwang3
1Shanghai University of Engineering Science, China, 2Peking University, China, 3University of Washington, USA
152 [ID:22] LEARNING MINIMAL INTRA-GENRE MULTIMODAL EMBEDDING FROM TRAILER CONTENT AND RE- ACTOR EXPRESSIONS FOR BOX OFFICE PREDICTION
Ming-Ya Ko, Jeng-Lin Li, Chi-Chun Lee
National Tsing Hua University, Taiwan
[ID:23] DEEP PAIRWISE RANKING WITH MULTI-LABEL INFORMATION FOR CROSS-MODAL RETRIEVAL
Yangwo Jian, Jing Xiao, Yang Cao, Asad Khan, Jia Zhu
South China Normal University, China
[ID:24] CORRELATION FILTER TRACKING WITH ADAPTIVE PROPOSAL SELECTION FOR ACCURATE SCALE ESTIMATION
Luo Xiong, Yanjie Liang, Yan Yan, Hanzi Wang
Xiamen University, China
[ID:25] SUPERVISED CONSISTENT AND SPECIFIC HASHING
Haitao Wang, Min Meng, Hui Chen, JiGang Wu
Guangdong University of Technology, China
[ID:26] MOMENTUM BASED ON ADAPTIVE BOLD DRIVER
Shengdong Li1,2, Xueqiang Lv3
1Renmin University of China, China, 2Langfang Yanjing Vocational Technical College, China, 3Beijing Information Science and Technology University, China
[ID:27] A LIGHTWEIGHT NEURAL NETWORK BASED HUMAN DEPTH RECOVERY METHOD
Meiyu Huang1, Xueshuang Xiang1, Yao Xu1, Yiqiang Chen2
1Qian Xuesen Laboratory of Space Technology, China Academy of Space Technology, China, 2Institute of Computing Tech- nology, Chinese Academy of Sciences, China
153 IEEE ICME2019
Thursday, July 11, 2019
P-23: Multimedia Quality Assessment and Metrics
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Federica Battisti Roma Tre University, Italy
[ID:28] EVALUATION OF DEFOGGING: A REAL-WORLD BENCHMARK DATASET, A NEW CRITERION AND BASELINES
Shiyu Zhao1, Lin Zhang1, Shuaiyi Huang2, Ying Shen1, Shengjie Zhao1, Yukai Yang3
1Tongji University, China, 2ShanghaiTech University, China, 3Uppsala University, Sweden
[ID:29] RESA: A REAL-TIME EVALUATION SYSTEM FOR ABR
Yanan Wang, Haili Wang, Jiaoyang Shang, Hu Tuo iQIYI, Inc, China
[ID:30] BLIND IMAGE SHARPNESS ASSESSMENT AND ENHANCEMENT VIA DEEP AUXILIARY LEARNING
Qingbo Wu, Rui Ma, King N. Ngan, Hongliang Li, and Fanman Meng
University of Electronic Science and Technology of China, China
[ID:31] END-TO-END BLIND IMAGE QUALITY ASSESSMENT WITH CASCADED DEEP FEATURES
Jinjian Wu, Jupo Ma, Fuhu Liang, Weisheng Dong, Guangming Shi
Xidian University, China
[ID:32] ENCODING DISTORTIONS FOR MULTI-TASK FULL-REFERENCE IMAGE QUALITY ASSESSMENT
Chen Huang1,2, Tingting Jiang1, Ming Jiang1
1Peking University, China, 2Baidu Inc., China
[ID:33] CAUSAL ANALYSIS OF THE UNSATISFYING EXPERIENCE IN REALTIME MOBILE MULTIPLAYER GAMES IN THE WILD
Yuan Meng1,4, Shenglin Zhang2, Zijie Ye1, Benliang Wang2, Zhi Wang1, Yongqian Sun2, Qitong Liu3, Shuai Yang3, Dan Pei1,4
1Tsinghua University, China, 2Nankai University, China, 3Tencent, China, 4Beijing National Research Center for Information Science and Technology, China
154 Thursday, July 11, 2019
P-24: Oral-25 to Oral-28
Time: 15:30 - 17:00 PM
Room: 3rd Floor
Chair: Kuan-Hsien Liu National Taichung University of Science and Technology, Taiwan
[ID:34] DYNAMIC CASCADED REGRESSION NETWORK WITH REINFORCEMENT LEARNING FOR ROBUST FACE ALIGNMENT
Zhihao Zhang, Liansheng Zhuang, Wengang Zhou, Houqiang Li
University of Science and Technology of China, China
[ID:35] DEEP LEARNING FACE HALLUCINATION VIA ATTRIBUTES TRANSFER AND ENHANCEMENT
Mengyan Li, Yuechuan Sun, Zhaoyu Zhang, Haonian Xie and Jun Yu
University of Science and Technology of China, China
[ID:36] EMOTION RECOGNITION FROM PHYSIOLOGICAL SIGNALS USING MULTI-HYPERGRAPH NEURAL NETWORKS
Junjie Zhu1, Xibin Zhao1, Han Hu2, Yue Gao1
1Tsinghua University, China, 2Beijing Institute of Technology, China
[ID:37] GPS: GROUP PEOPLE SEGMENTATION WITH DETAILED PART INFERENCE
Yue Liao1, Tianrui Hui1, Chen Gao1, Si Liu2, Yao Sun3, Hefei Ling4, Bo Li2
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2Beihang University, China, 3iie, China, 4Huazhong University of Science and Technology, China
[ID:38] MULTI-LABEL IMAGE RECOGNITION WITH JOINT CLASS-AWARE MAP DISENTANGLING AND LABEL CORRELATION EMBEDDING
Zhao-Min Chen1,2 Xiu-Shen Wei2, Xin Jin, 2Yanwen Guo1,3
1Nanjing University, China, 2Megvii Technology, China, 3Science and Technology on Information Systems Engineering Labo- raty, China
[ID:39] REAL TIME COMPRESSED VIDEO OBJECT SEGMENTATION
Zhentao Tan, Bin Liu, Weihai Li, Nenghai Yu
University of Science and Technology of China, China
155 IEEE ICME2019
[ID:40] ACCURATE AND FAST FINE-GRAINED IMAGE CLASSIFICATION VIA DISCRIMINATIVE LEARNING
Zhihui Wang1, Shijie Wang1, Pengbo Zhang1, Haojie Li1, Bo Liu2
1Dalian University of Technology, China, 2Shanghai Jiao Tong University, China
[ID:41] POSE2BODY: POSE-GUIDED HUMAN PARTS SEGMENTATION
Zhong Li1, Xin Chen2, Wangyiteng Zhou2, Yingliang Zhang2, Jingyi Yu2
1University of Delaware, USA, 2ShanghaiTech University, China
[ID:42] RESIDUAL MAGNIFIER: A DENSE INFORMATION FLOW NETWORK FOR SUPER RESOLUTION
Zhan Shu1, Mengcheng Cheng1, Biao Yang1, Zhuo Su1, Xiangjian He2,3
1Sun Yat-sen University, China, 2Minjiang University, China, 3University of Technology Sydney, Australia
[ID:43] EVERYONE IS A CARTOONIST: SELFIE CARTOONIZATION WITH ATTENTIVE ADVERSARIAL NET- WORKS
Xinyu Li, Wei Zhang, Tong Shen, Tao Mei
JD AI Research, China
[ID:44] SCALE-AWARE DEEP NETWORK WITH HOLE CONVOLUTION FOR BLIND MOTION DEBLURRING
Jichun Li, Ke Li, Bo Yan
Fudan University, China
[ID:45] REMOVING RAIN IN VIDEOS: A LARGE-SCALE DATABASE AND A TWO-STREAM CONVLSTM AP- PROACH
Tie Liu, Mai Xu and Zulin Wang
Beihang University, China
[ID:46] TOWARDS QOS-AWARE CLOUD LIVE TRANSCODING: A DEEP REINFORCEMENT LEARNING AP- PROACH
Zhengyuan Pang, Lifeng Sun, Tianchi Huang, Zhi Wang, Shiqiang Yang
Tsinghua University, China
[ID:47] HIGH SPEED RECURRENT REGRESSION NETWORK FOR VISUAL TRACKING
Ding Ma, Xiangqian Wu
Harbin Institute of Technology, China
[ID:48] PAAE: A UNIFIED FRAMEWORK FOR PREDICTING ANCHOR LINKS WITH ADVERSARIAL EMBEDDING
156 Yanmin Shang1, Zhezhou Kang1, Yanan Cao1, Dongjie Zhang1, Yangxi Li2, Yang Li3, Yanbing Liu1
1Institute of Information Engineering, Chinese Academy of Sciences, China, 2National Computer Network Emergency Re- sponse technical Team, China, 3State Information Center, China
[ID:49] MANIFOLD ALIGNMENT AND DISTRIBUTION ADAPTATION FOR UNSUPERVISED DOMAIN ADAPTA- TION
Ying Li, Lin Cheng, Yaxin Peng, Zhijie Wen, Shihui Ying
Shanghai University, China
157 IEEE ICME2019
Workshops
Monday, July 8, 2019
W-01: Multimedia Services and Technologies for Smart-health(MUST-SH)
Time: 8:30 AM - 17:00 PM
Room: 5F
Organizers: Shamim Hossain King Saud University, Saudi Arabia
Stefan Goebel KOM, TU Darmstadt, Germany
Yin Zhang Zhongnan University of Economics and Law, China
8:30 - 8:35 Opening Remarks:
Yin Zhang Zhongnan University of Economics and Law, China
8:35 - 9:30 Keynote Talk:
Huimin Lu Kyushu Institute of Technology, Japan
9:30 - 10:00 Oral Session 1:
Session Chair: Shamim Hossain King Saud University, Saudi Arabia
FULLY CONVOLUTIONAL NETWORK FOR 3D HUMAN SKELETON ESTIMATION FROM A SIN- GLE VIEW FOR ACTION ANALYSIS
Wen-Nung Lie1, Guan-Han Lin1, Lung-Sheng Shih1, YuLing Hsu1, Thang Huu Nguyen2, Quynh Nguyen Quang Nhu2
1National Chung Cheng University, Taiwan, 2The University of Danang, University of Science and Technology, Vietnam
10:00 - 10:30 Coffee Break
10:30 - 12:00 Oral Session 2:
Session Chair: Stefan Goebel KOM, TU Darmstadt, Germany
10:30 - 11:00
ATTENTION BASED SEMI-SUPERVISED DICTIONARY LEARNING FOR DIAGNOSIS OF AU- TISM SPECTRUM DISORDERS
Meng Yang1,2, Qin Zhong1, Lin Chen3, Fanglin Huang4, Baiying Lei4
1Sun Yat-sen University, Guangzhou, China, 2Key Laboratory of Machine Intelligence and Advanced Comput- ing(SYSU), Ministry of Education, 3Sogou, China, 4Shenzhen University, China
11:00 - 11:30
RT-ADI: FAST REAL-TIME VIDEO REPRESENTATION FOR MULTI-VIEW HUMAN FALL DETEC- TION
158 Qianggang Ding, Fan Yang, Jiawei Li, Sifan Wu, Bowen Zhao, Zhi Wang, Shutao Xia
Tsinghua University, China
11:30 - 12:00
A NEW IMAGE WATERMARKING SCHEME FOR EFFICIENT TAMPER DETECTION, LOCALIZA- TION AND RECOVERY
Faranak Tohidi, Manoranjan Paul
Charles Sturt University, Australia
12:00 - 13:30 Lunch Break
13:30 - 15:00 Oral Session 3:
Session Chair: Yin Zhang Zhongnan University of Economics and Law, China
13:30 - 14:00
PREDICTING HUMAN GRASP LOCATIONS ON CUP HANDLES BY USING DEEP NEURAL NET- WORKS TO INFER HEAT SIGNATURES FROM DEPTH DATA
Yijun Jiang, Sean Banerjee, Natasha Kholgade Banerjee
Clarkson University, USA
14:00 - 14:30
HIERARCHICAL FUZZY INFERENCE SYSTEM FOR DIAGNOSING DENGUE DISEASE
Mubarak Alrashoud
King Saud University, Saudi Arabia
14:30 - 15:00
HUMAN-INTERACTION WEAKLY-SUPERVISED DEEP NETWORKS FOR SEMANTIC SEGMEN- TATION
Wenfeng Luo1, Meng Yang1,2
1Sun Yat-sen University, China, 2Key Laboratory of Machine Intelligence and Advanced Computing (SYSU), Ministry of Educationl, China
15:00 - 15:30 Coffee Break 15:30 - 17:00 Oral Session 4: Session Chair: Shamim Hossain King Saud University, Saudi Arabia
15:30 - 16:15 THE PREDICTION MODEL OF BLOOD GLUCOSE CONCENTRATION FOR SMART HEALTH Han Yu, Jianmin Lu, Yue JIn, Binglei Yue, Xiao Ma Zhongnan University of Economics and Law, China
16:15 - 17:00 PREDICTING SPINE SURGERY COMPLICATIONS USING MACHINE LEARNING Mohamad Hoda1, Abdulmotaleb EI Saddik1, Eugene Wai2, Philippe Phan3
1University of Ottawa, Canada, 2The Ottawa Hospital, Canada, 3The Ottawa Hospital, Canada
159 IEEE ICME2019
Monday, July 8, 2019
W-02: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia (MMArt-ACM)
Time: 8:30 AM - 12:00 PM
Room: 5H
Organizers: Wei-Ta Chu National Chung Cheng University, Taiwan
Norimichi Tsumura Graduate School of Engineering, Chiba University, Japan
Shoji Yamamoto Tokyo Metropolitan College of Industrial Technology, Japan
Toshihiko Yamasaki University of Tokyo, Japan
8:30 - 8:35 Opening Remarks:
Session Chair: Toshihiko Yamasaki
8:35 - 9:50 Oral Session 1: Multimedia Artworks Analysis
Session Chair: Norimichi Tsumura, Toshihiko Yamasaki
8:35 - 8:50
DEEPIR: A DEEP SEMANTICS DRIVEN FRAMEWORK FOR IMAGE RETARGETING
Jianxin Lin, Tiankuang Zhou, Zhibo Chen
University of Science and Technology of China, China
8:50 - 9:05
MULTI-DEPTH DILATED NETWORK FOR FASHION LANDMARK DETECTION
Zeng Kai, Jun Feng, Richard F E Sutcliffe, Wang Xiaoyu, Bu Qirong
NorthWest University, China
9:05 - 9:20
SALIENCY-GUIDED IMAGE STYLE TRANSFER
Xiuwen Liu, Zhi Liu, Xiaofei Zhou, Minyu Chen
Shanghai University, China
9:20 - 9:35
A MULTIMEDIA-BASED MOVIE STYLE MODEL
Priyankar Choudhary, Neeraj Goel, Mukesh Saini
Indian Institute of Technology Ropa, India
9:35 - 9:50
NEURAL STYLE TRANSFER WITH CONTENT DISCRIMINATION
160 Xiyu Yan, Yeli Xing, Zihao He, Tao Dai, Yong Jiang, Shutao Xia
Tsinghua University, China
10:00 - 10:30 Coffee Break
10:30 - 11:30 Keynote talk by Prof. Jia Jia
Session Chair: Toshihiko Yamasaki
11:30 - 12:00 Oral Session 2: Attractiveness Computing in Multimedia
Session Chair: Wei-Ta Chu
11:30 - 11:45
PREDICTING THE ATTRACTIVENESS OF REAL-ESTATE IMAGES BY PAIRWISE COMPARISON USING DEEP LEARNING
Xueting Wang, Yuki Takada, Youiti Kado, Toshihiko Yamasaki
The University of Tokyo, Japan
11:45 - 12:00
VIDEO-BASED STRESS LEVEL MEASUREMENT USING IMAGING PHOTOPLETHYSMOGRA- PHY
Ryota Mitsuhashi1, Kaito Iuchi1, Takashi Goto2, Akira Matsubara2, Takahiro Hirayama2, Hideki Hashizume2, Norimichi Tsumura1
1Chiba University, Japan, 2Daikin Industries LTD, Japan
161 IEEE ICME2019
Monday, July 8, 2019
W-03: Visual Emotion Analysis: Theories and Applications
Time: 13:30 - 17:30 PM
Room: 5H
Organizers: Lifang Wu Beijing University of Technology, China
Jufeng Yang Nankai University, China
Rongrong Ji Xiamen University, China
13:30 - 13:35 Opening Remarks
13:35 - 14:30 Keynote: Computation of Emotion (Jiebo Luo)
14:30 - 15:00 Invited Talk 1: Affective and aesthetic computing on social images (Jia Jia)
15:00 - 15:30 Coffee Break
15:30 - 16:00 Invited Talk 2: Visual sentiment analysis and beyond (Yanwei Fu)
16:00 - 16:30 Invited Talk 3: Weakly supervised coupled networks for visual sentiment analysis (Dongyu She)
16:30 -16:50
FEAFA: A WELL ANOATED DATABASE FOR FACIAL EXPRESSION ANALYSIS AND 3D FACIAL ANIMATION
Yanfu Yan1, Ke Lu1, Jian Xue1, Pengcheng Gao1, Jiayi Lyu2
1University of Chinese Academy of Sciences, China 2Capital Normal University, China
16:50 - 17:10
CROSS-DATABASE MICRO-EXPRESSION RECOGNITION: A STYLE AGGREGATED AND AT- TENTION TRANSFER APPROACH
Ling Zhou, Qirong Mao, Luoyang Xue
Jiangsu University, China
17:10 -17:30
THE FUSION KNOWLEDGE OF FACE, BODY AND CONTEXT FOR EMOTION RECOGNITION
Jingjing Wu, Yong Zhang, Li Ning
Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, China
162 Monday, July 8, 2019
W-04: 1st International Workshop on Big Surveillance Data Analysis and Pro- cessing
Time: 8:30 AM - 12:00 PM
Room: 5I
Organizers: Weiyao Lin Shanghai Jiao Tong University, China
John See Multimedia University, Malaysia
Michael Ying Yang University of Twente, the Netherlands
8:30 - 10:00 Oral Session 1: Object Motion Analysis in Big Surveillance Videos
Session Chair: Weiyao Lin, Michael Ying Yang
8:30 - 8:45
DEFORMATION SAMPLE GENERATED NETWORK FOR ROBUST VISUAL TRACKING
Zizi Li, Yuan Zhou, Chunping Hou
Tianjin University, China
8:45 - 9:00
PRESERVING STRUCTURAL RELATIONSHIPS FOR PERSON RE-IDENTIFICATION
Liqiang Bao1, Bingpeng Ma1, Hong Chang2, Xilin Chen2
1University of Chinese Academy of Sciences, China 2Chinese Academy of Sciences, China
9:00 - 9:15
ADAPTIVE UPDATING SIAMESE NETWORK WITH LIKE-HOOD ESTIMATION FOR SURVEIL- LANCE VIDEO OBJECT TRACKING
Zhenxian Zheng, Yang Yi, Jinlong Shen, Jiahao Zhang
Sun Yat-sen University, China
9:15 - 9:30
A MULTIMODAL LOSSLESS CODING METHOD FOR SKELETONS IN VIDEOS
Xiaoyi He, Mingzhou Liu, Weiyao Lin, Xintong Han, Yanmin Zhu, Hongtao Lu, Hongkai Xiong
Shanghai Jiao Tong University, China
9:30 - 9:45
EFFICIENT SEMANTIC-BASED VEHICLE RETRIEVAL IN LONG-TERM CAR PARK VIDEOS
Clarence Weihan Cheong, Ryan Woei-Sheng Lim, John See, Lai-Kuan Wong, Ian Kim Teck Tan
Multimedia University, Malaysia
9:45 - 10:00
163 IEEE ICME2019
SINGLE IMAGE HAZE REMOVAL BY FEATURE MAPPING
Feiniu Yuan1, Yu Zhou2, Xue Xia2, Ya Li2
1Shanghai Normal University, China, 2Jiangxi University of Finance and Economics, China
10:00 - 10:30 Coffee Break
10:30 - 12:00 Oral Session 2: Human & Action Sensing for Big Surveillance Videos
Session Chair: Weiyao Lin, Michael Ying Yang
10:30 - 10:45
MOTION-LET CLUSTERING FOR SKELETON-BASED ACTION RECOGNITION
Jianyu Yang1, Chen Zhu1, Junsong Yuan2
1Soochow University, China, 2State University of New York at Buffalo, USA
10:45 - 11:00
DEEP KEY CLIPS-VIDEO FEATURE FUSION FRAMEWORK FOR ACTION RECOGNITION
Chao Li1, Yue Ming1, Yuan Shen2, Hui Yu3
1Beijing University of Posts and Telecommunications, China 2Tencent Technology (Beijing) Co., Ltd, China 3University of Portsmouth, UK
11:00 - 11:15
HUMAN IDENTIFICATION RECOGNITION IN SURVEILLANCE VIDEOS
Kai Jin, Xuemei Xie, Fangyu Wang, Xiao Han, Guangming Shi
Xidian University, China
11:15 - 11:30
AGE ESTIMATION FOR LOW-QUALITY FACIAL IMAGES: FROM SEPARATE DCNNS TO A DE- CISION FUSER
Kuan-Hsien Liu1, Pak Ki Chan2, Tsung-Jung Liu3, Hsiu-An Her1
1National Taichung University of Science and Technology, Taiwan, 2China Medical University Hospital,China 3National Chung Hsing University, Taiwan
11:30 - 11:45
SEMANTIC SEGMENTATION OF SATELLITE IMAGES USING A U-SHAPED FULLY CONNECT- ED NETWORK WITH DENSE RESIDUAL BLOCKS
Eric R Narciso Molina, Zenghui Zhang
Shanghai Jiao Tong University, China
11:45 - 12:00
MTCNN WITH WEIGHTED LOSS PENALTY AND ADAPTIVE THRESHOLD LEARNING FOR FA- CIAL ATTRIBUTE PREDICTION
Xingting He, Pingyu Wang, Zhicheng Zhao, Yanyun Zhao, Fei Su
Beijing University of Posts and Telecommunications, China
164 Monday, July 8, 2019
W-05: Multimedia for Robot, Unmanned Aerial Vehicle and Driverless Car
Time: 13:30 - 17:00 PM
Room: 5I
Organizers: Dong Zhao Beijing University of Posts and Telecommunications, China
Chenqiang Gao Chongqing University of Posts and Telecommunications, China
Jiayi Ma Wuhan University, China
Quan Zhou Nanjing University of Posts and Telecommunications, China
Ji Zhao TuSimple, China
Yu Zhou Beijing University of Posts and Telecommunications, China
13:30 - 13:35 Opening Remarks:
Yu Zhou Huazhong University of Science and Technology, China
13:35 - 14:10 Keynote Talk:
Yiqun Li Huazhong University of Science and Technology, China
14:10 - 14:45 Keynote Talk:
Chen Chen University of North Carolina at Charlotte, USA
14:45 - 15:05 Oral Session 1:
Session Chair: Dong Zhao
14:45 -15:05
MULTI-PATH FUSION NETWORK FOR HIGH-RESOLUTION HEIGHT ESTIMATION FROM A SINGLE ORTHOPHOTO
Yiteng Zhang, Xuejin Chen
University of Science and Technology of China, China
15:05 - 15:25 Coffee Break
15:25 - 16:00 Keynote Talk:
Lin Zhang Tongji University, China
16:00 - 17:00 Oral Session 2:
Session Chair:Jiayi Ma
16:00 - 16:20
FACE ANTI-SPOOFING BASED ON MULTI-LAYER DOMAIN ADAPTATION
Fengshun Zhou1,2, Chenqiang Gao1,2, Fang Chen1,2, Chaoyu Li1,2, Xindou L1,2, Feng Yang1,2, Yue Zhao1,2
1Chongqing University of Posts and Telecommunications, Chongqing, China, 2Chongqing Key Laboratory of Signal and Information Processing, Chongqing 400065, China
165 IEEE ICME2019
16:20 - 16:40
SELF-ATTENTION RELATION NETWORK FOR FEW-SHOT LEARNING
Binyuan Hui, Pengfei Zhu, Qinghua Hu, Qilong Wang
Tianjin University, China
16:40 - 17:00
BISE-RESNET: COMBINE SEGMENTATION AND CLASSIFICATION NETWORKS FOR ROAD FOLLOWING ON UNMANNED AERIAL VEHICLE
Dian Lyu, Peng Cheng, Ruizhou Liu, Liang Liu
Beijing University of Posts and Telecommunication, China
166 Monday, July 8, 2019
W-06: Information Theory and Multimedia Computing (ITMC)
Time: 8:30 AM - 16:30 PM
Room: 5J
Organizers: Ran He Chinese Academy of Sciences, China
Xiaotong Yuan Nanjing University, China
Jitao Sang Beijing Jiaotong University, China
8:50 - 9:00 Opening
9:00 - 10:00 Keynote Talk: Ran He
10:00 - 10:15 Coffee Break
10:15 - 11:45 Oral Session 1:
Session Chair: Ran He
10:15 – 10:30
HYBRID DEFENSE FOR DEEP NEURAL NETWORKS: AN INTEGRATION OF DETECTING AND CLEANING ADVERSARIAL PERTURBATIONS
Weiqi Fan, Guangling Sun, Yuying Su, Zhi Liu, Xiaofeng Lu
Shanghai University, China
10:30 – 10:45
SKETCH-BASED IMAGE RETRIEVAL VIA A SEMI-HETEROGENEOUS CROSS-DOMAIN NET- WORK
Chuo Li, Yuan Zhou, Jianxing Yang
Tianjin University, Tianjin, China
10:45 – 11:00
QUESTION SPLITTING AND UNBALANCED MULTI-MODAL POOLING FOR VQA
Mengfei Li, Huan Shao, Yi Ji, Yang Yang, ChunPing Liu
Soochow University Suzhou, Jiangsu, China
11:00 – 11:15
AI-GAN: SIGNAL DE-INTERFERENCE VIA ASYNCHRONOUS INTERACTIVE GENERATIVE AD- VERSARIAL NETWORK
Xin Jin, Zhibo Chen, Jianxin Lin, Wei Zhou, Jiale Chen, Chaowei Shan
University of Science and Technology of China, Hefei, China
11:15 – 11:30
Visual object tracking via Graph Convolutional Representation
167 IEEE ICME2019
Zhengzheng Tu, Ajian Zhou, Bo Jiang, Bin Luo
Anhui University, China
11:30 – 11:45
MOIRE PATTERN REMOVAL WITH MULTI-SCALE FEATURE ENHANCING NETWORK
Tian yu Gao1, Yanqing Guo1, Xin Zheng1, Qianyu Wang1, Xiangyang Luo2
1Dalian University of Technology, China 2The State Key Laboratory of Mathematical Engineering and Advanced Computing, China
12:00 - 13:30 Lunch Break
13:30 - 15:00 Oral Session 2:
Session Chair: Yi Li
13:30 – 13:45
DEEP COLOR IMAGE DEMOSAICKING WITH FEATURE PYRAMID CHANNEL ATTENTION.
Qi Kang, Ying Fu, Hua Huang
Beijing Institute of Technology, China
13:45 – 14:00
REAL-WORLD IMAGE DENOISING VIA WEIGHTED LOW RANK APPROXIMATION.
Yuenan Guo, Ying Fu, Hua Huang
Beijing Institute of Technology, China
14:00 – 14:15
TWO-STRE SPARSE NETWORK FOR ACCURATE IMAGE SUPER-RESOLUTION.
Ling Hu1,2, Shuhui Wang1, Liang Li1, Qingming Huang1,2
1Key Lab of Intell. Info. Process., Inst. of Comput. Tech., CAS, China, 2University of Chinese Academy of Sci- ences, Beijing, 100049, China
14:15 – 14:30
EMBEDDING NON-LOCAL MEAN IN SQUEEZE-AND-EXCITATION NETWORK FOR SINGLE IMAGE DERAINING.
Cong Wang, Hongyan Wang, Zhixun Su, Yan Yang
Dalian University of Technology, China
14:30 – 14:45
RELATIVE DEPTH ESTIMATION PRIOR FOR SINGLE IMAGE DEHAZING.
Jinbao Wang1, Ke Lu1, Jian Xue1, Yutong Kou2
1University of Chinese Academy of Sciences, China 2Huazhong University of Science & Technology, China
14:45 – 15:00
LOW-LIGHT IMAGE ENHANCEMENT WITH ATTENTION AND MULTI-LEVEL FEATURE FU- SION.
Lei Wang1, guangtao fu2, zhuqing jiang1, Guodong Ju3, aidong men1
168 1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, China, 3GuangDong TUS-TuWei Technology Co, Ltd, China
15:00 - 15:30 Coffee Break
15:30 - 16:30 Oral Session 3:
Session Chair: Yi Li
15:30 – 15:45
BLIND MESH QUALITY ASSESSMENT METHOD BASED ON CONCAVE, CONVEX AND STRUC- TURAL FEATURES ANALYSES.
Yaoyao Lin, Mei Yu, Ken Chen, Gangyi Jiang, Zongju Peng, Fen Chen
Faculty of Information Science and Engineering, Ningbo University, Ningbo, China
15:45 – 16:00
K-COVERS FOR ACTIVE LEARNING IN IMAGE CLASSIFICATION.
Yeji Shen1, Yuhang Song1, Hanhan Li2, Shahab Kamali2, Bin Wang1, C.-C. Jay Kuo1
1University of Southern California, USA, 2Google Research, USA
16:00 – 16:15
DISTRIBUTION DISCREPANCY MAXIMIZATION FOR IMAGE PRIVACY PRESERVING.
Sen Liu, Jianxin Lin, Zhibo Chen
University of Science and Technology of China, China
16:15 – 16:30
A NOVEL DISTANCE LEARNING FOR ELASTIC CROSS MODAL AUDIO-VISUAL MATCHING.
Rui Wang1, Huaibo Huang2,3, Xufeng Zhang1, Jixin Ma4, Aihua Zheng1
1Anhui University, China, 2University of Chinese Academy of Sciences, China, 3CASIA, China, 4University of Greenwich, UK
169 IEEE ICME2019
Friday, July 12, 2019
W-07: 6th IEEE International Workshop on Mobile Multimedia Computing (MMC)
Time: 8:30 AM - 12:00 PM
Room: 5F
Organizers: Tian Gan Shandong University, China
Wen-Huang Cheng National Chiao Tung University, Taiwan
Kai-Lung Hua National Taiwan University of Science and Technology, Taiwan
Klaus Schoeffmann Klagenfurt University, Austria
Vladan Velisavljevic University of Bedfordshire, UK
Christian von der Weth National University of Singapore, Singapore
8:30 - 9:00 Opening & Keynotes
9:00 - 10:00 Oral Session 1:
Session Chair: Wen-Huang Cheng
9:00 - 09:15
FINE DETECTION AND CLASSIFICATION OF MULTI-CLASS BARCODE IN COMPLEX ENVI- RONMENTS
Jiahe Zhang1, Jun Jia1, Zehao Zhu1, Xiongkuo Min1, Guangtao Zhai1, Xiao-Ping Zhang2
1Shanghai Jiao Tong University, China, 2Ryerson University, Canada
9:15 - 09:30
DEEP LEARNING BASED METHOD FOR PRUNING DEEP NEURAL NETWORKS
Lianqiang Li1, Jie Zhu1, Ming-Ting Sun2
Shanghai Jiao Tong University, China, 2University of Washington, USA
9:30 - 09:45
ALPS 1.0: Towards Automated Lecture Profiling System
Pratibha Kumari1, Prakhar Jain1, Swarna Sahay1, Gan Tian2, Mukesh Saini1
1Indian Institute of Technology Ropar, India, 2Shandong University, China
9:45 - 10:00
VAS360: QOE-DRIVEN VIEWPORT ADAPTIVE STREAMING FOR 360 VIDEO
Yuxiang Hu, Yu Liu, Yumei wang
Beijing University Posts and Telecommunications, China
10:00 - 10:30 Coffee Break
170 10:30 - 11:30 Oral Session 2:
Session Chair: Tian Gan
10:30 - 10:45
FUSING GEOGRAPHIC INFORMATION INTO LATENT FACTOR MODEL FOR PICK-UP REGION RECOMMENDATION
Zhuhua Liao, Jian Zhang, Yizhi Liu
Hunan University of Science & Technology, China
10:45 - 11:00
A FLEXIBLE VIEWPORT-ADAPTIVE PROCESSING MECHANISM FOR REAL-TIME VR VIDEO TRANSMISSION
Anyue Xu, Xinyu Chen, Yu Liu, Yumei Wang
Beijing University Posts and Telecommunications, China
11:00 - 11:15
OBJECTIVE QUALITY ASSESSMENT METHOD FOR STEREOSCOPIC IMAGE RETARGETING
Salah Addin Mohammed M Mohammed, Ya Zhou, Zhibo Chen, Houqiang Li
University of Science and Technology of China, China
11:15 - 11:30
OPTIMAL MULTI-CODEC ADAPTIVE BITRATE STREAMING
Yuriy Reznik, Xinagbo Li, Karl Lillevold, Abhijith Jagannath, Justin Greer
Brightcove Inc. USA
11:30 - 12:00
Best Paper Award Announcement
171 IEEE ICME2019
Friday, July 12, 2019
W-08: Time-sequenced Multimedia Computing
Time: 13:30 - 17:45 PM
Room: 5F
Organizers: Wei Li Fudan University, China
Mengyao Zhu Shanghai University, China
Bing-Kun Bao Nanjing University of Posts and Telecommunications, Nanjing, China
Min Xu University of Technology Sydney, Australia
Xi Shao Nanjing University of Posts and Telecommunications, Nanjing, China
13:30 - 13:55
AUDIO SCENE CLASSIFICATION WITH DISCRIMINATIVELY-TRAINED SEGMENT-LEVEL FEA- TURES
Haichuan Bai1,2, Hangting Chen1,2, Yonghong Yan1,2
1Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
13:55 - 14:20
EFFICIENT IMPLICIT FOURIER COMPRESSION BASED CONVOLUTIONAL FEATURES FOR VISUAL TRACKING
Ridong Zhu, Xiaoyuan Yang, Jingkai Wang, Zhengze Li
Beihang University, China
14:20 - 14:45
AUDIO2FACE: GENERATING SPEECH/FACE ANIMATION FROM SINGLE AUDIO WITH ATTEN- TION-BASED BIDIRECTIONAL LSTM NETWORKS
Guanzhong Tian1, Yi Yuan2, Yong Liu1
1Zhejiang University, China, 2Fuxi AI Lab, Netease, China
14:45 - 15:10
DEEP VOCODER: LOW BIT RATE COMPRESSION OF SPEECH WITH DEEP AUTOENCODER
Gang Min1, Changqing Zhang 1, Xiongwei Zhang 2, Wei Tan1
1National University of Defense Technology, China 2Army Engineering University of PLA, China
15:10 - 15:30 Coffee Break
15:30 - 15:55
BLIND ESTIMATION OF REVERBERATION TIME USING BINAURAL COMPLEX IDEAL RATIO MASK
MingYang Chai1, TianTian Li1, MengYao Zhu1, Tao Wang1, Wen Zhang2
172 1Shanghai University, China, 2Northwestern Polytechnical University, China
15:55 - 16:20
OPV: BIAS CORRECTION BASED OPTIMAL PROBABILISTIC VIEWPORT-ADAPTIVE STREAMING FOR 360-DEGREE VIDEO
Weihong Lin, Xinggong Zhang, Zongming Guo, Wei Hu
Peking University, China
16:20 - 16:45
SVD-BASED CHANNEL PRUNING FOR CONVOLUTIONAL NEURAL NETWORK IN ACOUSTIC SCENE CLASSIFICATION MODEL
Jun Wang1, Shengchen Li1, Wenwu Wang2
1Beijing University of Posts and Telecommunications, China, 2University of Surrey, UK
16:45 - 17:10
MULTI-LEVEL ATTENTION MODEL WITH DEEP SCATTERING SPECTRUM FOR ACOUSTIC SCENE CLASSIFICATION
Zhitong Li1, Yuanbo Hou2, Xiang Xie1,3, Shengchen Li2, Liqiang Zhang1, Shixuan Du1, Wei Liu1
1Beijing Institute of Technology, China, 2Beijing University of Posts and Telecommunications, China, 3Beijing Institute of Technology, China
17:10 - 17:45
A MULTI-CRITERIA SUBJECTIVE EVALUATION METHOD FOR BINAURAL AUDIO RENDERING TECHNIQUES IN VIRTUAL REALITY APPLICATIONS
Zhaoyu Yan, Jing Wang, Zhuoran Li
Beijing Institute of Technology, China
173 IEEE ICME2019
Friday, July 12, 2019
W-09: Smart Camera Gigavision ( ) Time: 8:30 AM - 12:00 PM
Room: 5I
Organizers: Lu Fang Associate Professor, Tsinghua-Berkeley Shenzhen Institute, China
David J. Brady Duke University, USA
Shenghua Gao Assistant Professor, ShanghaiTech University, China
Yuchen Guo Tsinghua University, China
8:30 - 8:35 Opening Remarks:
Lu Fang Tsinghua University, China
8:35 - 9:15 Plenary Talk:
David J. Brady Duke University, USA
9:15 - 9:40 Keynote Talk:
Lu Fang Tsinghua University, China
9:40 - 10:05 Oral Session 1:
Session Chair: Lu Fang
SCALE-ADAPTIVE CNN BASED CROWD COUNTING AND DYNAMIC SUPERVISION
Zhengxin Li1, Jing Li1, Ling Xie1, Jianli Liu2
1ShanghaiTech University, Shanghai, China, 2Jiangnan University, Wuxi, China
SPATIAL-TEMPORAL CODEC ACCURACY CALIBRATION FOR MULTI-SCALE GIGA-PIXEL MACRO- SCOPE
Lei WANG, Jinli SUO, Jingtao FAN
Tsinghua University, China
10:05 - 10:20 Coffee Break
10:20 - 10:45 Keynote Talk:
Zhan Ma Nanjing University, China
10:45 - 11:10 Keynote Talk:
Shenghua Gao ShanghaiTech University, China
11:10 - 11:35 Keynote Talk:
Xing Lin Tsinghua University, China
11:35 - 12:00 Oral Session 2:
174 Session Chair: Lu Fang
SEGMENTATION OF BUILDING FOOTPRINTS WITH XCEPTION AND IOULOSS
Kepeng Xu1, Yunye Zhang1, Wenxin Yu1, Zhiqiang Zhang1, Jingwei Lu2, Yibo Fan3, Gang He4, Zhuo Yang5
1Southwest University of Science and Technology, China, 2Cadence Design Systems, Inc, 3Fudan University, China 4Xidian University, China 5Guangdong University of Technology, China
GIGAPIXEL-LEVEL IMAGE CROWD COUNTING USING CSRNET
Zhijie Cao1, Renyou Yan2, Yiyong Huang3, Zhiru Shi4
1Shanghai Jiao Tong University, China, 2ShanghaiTech University, China, 3Shanghai University, China, 4Yoke Intelligence, China
175 IEEE ICME2019
Friday, July 12, 2019
W-10: Cross-media Big Data Analysis for Semantic Knowledge Understanding
Time: 13:30 AM - 17:45 PM
Room: 5I
Organizers: Yang Yang University of Electronic Science and Technology of China, China.
Yang Wang Dalian University of Technology, China.
Xing Xu University of Electronic Science and Technology of China, China.
Zi Huang University of Queensland, Australia.
13:30 - 13:35 Opening Remarks
13:35 - 14:05 Keynote 1: Tentative
14:05 - 15:35 Oral Session 1: Knowledge Transfer Methods in Vision and Language
Session Chair: Yang Yang
14:05 - 14:20
MASK-GUIDED STYLE TRANSFER NETWORK FOR PURIFYING REAL IMAGES
Tongtong Zhao, Yuxiao Yan, Jinjia Peng, Huibing Wang, Xianping Fu
Dalian Maritime University, China
14:20 - 14:35
IMITATION LEARNING FOR SENTENCE GENERATION WITH DILATED CONVOLUTIONS USING ADVERSARIAL TRAINING
JianWei Peng1, MinChun Hu1, ChuanWang Chang2
1National Cheng Kung University, Taiwan, 2Kun Shan University, Taiwan
14:35 - 14:50
NON-RIGID 3D SHAPE RETRIEVAL BASED ON MULTI-VIEW METRIC LEARNING
Haohao Li, Shengfa Wang, Nannan Li, Zhixun Su, Ximin
Dalian University of Technology, China
14:50 - 15:05
WHAT TOPICS DO IMAGES SAY: A NEURAL IMAGE CAPTIONING MODEL WITH TOPIC REPRESEN- TATION
Feng Chen, Songxian Xie, Xinyi Li, Shasha Li, Jintao Tang, Ting Wang
National University of Defense Technology, China
15:05 - 15:30 Coffee Break
15:30 - 16:00 Keynote 2: Tentative
16:00 - 16:30 Oral Session 1: Knowledge Transfer Methods in Vision and Language
176 Session Chair: Yang Yang
16:00 - 16:15
CROSS DOMAIN KNOWLEDGE TRANSFER FOR UNSUPERVISED VEHICLE RE-IDENTIFICATION
Jinjia Peng, Huibing Wang, Tongtong Zhao and Xianping Fu
Dalian Maritime University, China
16:15 - 16:30
CYCLE-CONSISTENT DIVERSE IMAGE SYNTHESIS FROM NATURAL LANGUAGE
Zhi Chen, Yadan Luo
The University of Queensland, Australia
16:30 - 18:00 Session 2: Knowledge Transfer Related Application
Session chair: Yang Wang
16:30 - 16:45
SELF-WEIGHTED MULTIVIEW METRIC LEARNING BY MAXIMIZING THE CROSS CORRELATIONS
Huibing Wang, Jinjia Peng and Xianping Fu
Dalian Maritime University, China
16:45 - 17:00
CAUSATION-DRIVEN VISUALIZATIONS FOR INSURANCE RECOMMENDATION
Zhixiu Liu1, Chengxi Zang2, Kun Kuang1, Hao Zou1, Hu Zheng3, Peng Cui1
1Tsinghua University, China, 2Cornell University, USA, 3Datebao Insurance Ltd, China
17:00 - 17:15
CROSS-MODAL TRANSFER HASHING BASED ON COHERENT PROJECTION
En Yu1,2, Jiande Sun1, Li Wang1, Xiaojun Chang3, Huaxiang Zhang1, Alexander G. Hauptmann2
1Shandong Normal University, China, 2Carnegie Mellon University, USA, 3Monash University, Australia
17:15 - 17:30
RELATION NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
Bin Deng (Shenzhen University)*; Daming Shi (College of Computer Science and Software Engineering, Shen-
zhen University)
Tianjin University, China
17:30 - 17:45
ANNOTATING 3D MODELS AND THEIR PARTS VIA DEEP FEATURE EMBEDDING
Kouki Omata, Takahiko Furuya, Ryutarou Ohbuchi
University of Yamanashi, Japan
177 IEEE ICME2019
Friday, July 12, 2019
W-11: AI Technology for Visual Fashion Computing Time: 8:30 - 9:50 AM
Room: 5J
Organizers: Wei Zhang JD AI Research, China
Ting Yao JD AI Research, China
Wen-Huang Cheng National Chiao Tung University, Taiwan
8:30 - 8:35 Opening Remarks
Session Chairs: Wei Zhang JD AI Research, China
8:35 - 9:00
DISENTANGLED HUMAN ACTION VIDEO GENERATION VIA DECOUPLED LEARNING
Lingbo Yang1, Zhenghui Zhao1, Shiqi Wang2, Shanshe Wang1, Siwei Ma1, Wen Gao1
1Peking University, China, 2City University of Hong Kong, China
9:00 - 9:25
PERSONALIZED IMAGE RECOMMENDATION WITH PHOTO IMPORTANCE AND USER-ITEM IN- TERACTIVE ATTENTION
Wan Zhang, Zepeng Wang, Tao Chen
Hefei University of Technology, China
9:25 - 9:50
PARTIALLY OCCLUDED HEAD POSTURE ESTIMATION FOR 2D IMAGES USING PYRAMID HOG FEATURES
Jun Wu1, Z. Shang1, K. Wang1, J. Zhai1, Y. Wang1, F. Xia1, W. Li1, J. Zhang1, Fan Zhang2
1Northwestern Polytechnical University, China, 2Zhejiang University, China
178 Friday, July 12, 2019
W-12: 2nd IEEE International Workshop on Faces in Multimedia(FacesMM)
Time: 10:30 - 12:00 AM
Room: 5J
Organizers: Yun Fu Northeastern University, China
Joseph P Robinson Northeastern University, China
Ming Shao University of Massachusetts, Dartmouth
Siyu Xia Southeast University, China
10:30 - 10:35 Opening Remarks: Joseph P Robinson
10:35 - 11:15 Keynote Talk: Di Huang Beihang University, China
11:15 - 11:30
ADAPTIVE SALIENCE PRESERVING POOLING FOR DEEP CONVOLUTIONAL NEURAL NETWORKS
Yu Zhenyu1, Dai Shiyu1, Xing Yuxiang2
1Nuctech Company Limited, China, 2Tsinghua University, China
11:30 - 11:45
FULLY AUTOMATIC PHOTOREALISTIC FACIAL EXPRESSION AND EYE GAZE TRANSFER WITH A SINGLE IMAGE
Wanxin Xu, Sen-ching Cheung
University of Kentucky, USA
11:45 - 12:00
DEEP DOMAIN ADAPTATION FOR ASIAN FACE RECOGNITION VIA ADA-IBN
Chen Qian, Yi Jin, Yidong Li, Congyan Lang, Songhe Feng, Tao Wang
Beijing Jiaotong University, China
179 IEEE ICME2019
Friday, July 12, 2019
W-13: The Third Workshop on Human Identification in Multimedia (HIM)
Time: 13:30 - 17:30 PM
Room: 5J
Organizers: Liangliang Ren Department of Automation University of Tsinghua University, China
Guangyi Chen Dept. of Automation University of Tsinghua University, China
Dr. Jiwen Lu Department of Automation Tsinghua University, China
13:30 - 13:35 Introduction
13:35 - 14:25 Invited Talk: Person Re-identification
Weishi Zheng
14:25 - 14:55 Oral Session 1: Human Identification
Session chair: Liangliang Ren
14:25 - 14:40
SIMILARITY PRESERVED CAMERA-TO-CAMERA GAN FOR PERSON RE-IDENTIFICATION
Jianlei Liu1, Yun Zhou2, Lingchuan Sun1, Zhuqing Jiang1
1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, China
14:40 - 14:55
UNSUPERVISED DOMAIN ADAPTATION FOR DISGUISED FACE RECOGNITION
Fangyu Wu1,2, Shiyang Yan3, Jeremy S. Smith2, Wenjin Lu1, Bailing Zhang4
1Xi’an Jiaotong-liverpool Universit, China, 2University of Liverpool, Liverpool, 3Queen’s University Belfast, UK, 4Zhejiang University, China
15:00 - 15:30 Coffee Break
15:30 - 16:45 Oral Session 2: Detection and Tracking
Session chair: Guangyi Chen
15:30 - 15:45
DUAL-CYCLE DEEP REINFORCEMENT LEARNING FOR STABILIZING FACE TRACKING
Congcong Zhu, Zhenhua Yu, Suping Wu, Hao Liu
Ningxia University, China
15:45 - 16:00
MULTI-TASK LEARNING FOR PEDESTRIAN BODY PARTS DETECTION AND MULTI-ATTRIBUTE
180 CLASSIFICATION
Miaomiao Lou1,2, Lin Chen1, Feng Guo2
1Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Science,China 2Chengdu Univer- sity of Information Technology,China
16:00 - 16:15
CONTEXT ATTENTION MODULE FOR HUMAN HAND DETECTION
Zhihuai Xie1, Shaojie Wang2, Wentian Zhao2, Zhenhua Guo1
1Department of Information Science and Technology, Graduate School at Shenzhen, Tsinghua University, China, 2Department of Computer Science, University of Rochester, USA
16:15 - 16:30
TOWARD ROBUST ONLINE ADAPTIVE VISUAL TRACKING VIA PYRAMIDAL FEATURES EX- TRACTION
Shuai Bai1, Yuan Dong1, Ting-Bing Xu2, Hongliang Bai3
1Beijing University of Posts and Telecommunications, China, 2Institute of Automation of Chinese Academy of Sciences, China, 3Beijing FaceAll Co., China
16:30 - 16:45
IMPROVING HUMAN POSE ESTIMATION WITH SELF-ATTENTION GENERATIVE ADVERSARIAL NETWORKS
Zhongzheng Cao, Rui Wang, Xiangyang Wang, Zhi Liu, Xiaoqiang Zhu
Shanghai University, China
16:45 - 17:30 Oral Session 3: Multimedia Processing
Session chair: Liangliang Ren
16:45 - 17:00
COLLABORATIVE REPRESENTATION GUIDED GRAPH LEARNING FOR VISUAL CLASSIFICATION
Sheng Huang, Yongxin Ge, Feiyu Chen, Kewen He, Xiaohong Zhang
Chongqing University, China
17:00 - 17:15
SPORTS HIGHLIGHTS GENERATION USING DECOMPOSED AUDIO INFORMATION
Muhammad Rafiqul Islam, Manoranjan Paul, Michael Antolovich, Ashad Kabir
Charles Sturt University, Australia
17:15 - 17:30
NEW BENCHMARK DATASETS AND A CHARACTER IDENTIFICATION SYSTEM ON TV SERIES
Zhuo Lei1, Qian Zhang2, Guoping Qiu3,4
1The University of Nottingh Ningbo China, 2University of Nottingh Ningbo China, 3Shenzhen University, China, 4University of Nottingham, UK
181 IEEE ICME2019
Student Program
Wednesday, July 10, 2019
Student Career Lunch
Time: 12:30 - 14:00 PM
Room: 5I
Chair: Weiyao Lin Shanghai Jiao Tong University, China
Xiaoyan Sun Microsoft Research Asia, China
Shaoen Wu Ball State University, China
3MT Competition
Time: 14:00 - 15:30 PM
Room: 5I
Chair: Weiyao Lin Shanghai Jiao Tong University, China
Xiaoyan Sun Microsoft Research Asia, China
Shaoen Wu Ball State University, China
14:00 ENHANCING QUALITY FOR COMPRESSED VIDEO
Ren Yang
ETH Zurich, Switzerland
14:05 STUDENT PARTICIPATION IN ICME2019
Qiyang Zhang
Wuhan University of Technology, China
14:10 RESEARCH ON LONG-TERM STABLE VISUAL TRACKING
Yuqi Han
Beijing Institute of Technology, China
14:15 PORTRAIT INSTANCE SEGMENTATION FOR MOBILE DEVICES
Lingyu Zhu
182 Tampere University, Finland
14:20 MODELING BOTH CONTEXT AND SPEAKER-SENSITIVE DEPENDENCE FOR EMOTION DETECTION IN MULTI-SPEAKER CONVERSATIONS
Dong Zhang
Soochow University, China
14:25 FOCUSED PLENOPTIC CAMERA AND CALIBRATION
Xufu Sun
Tsinghua University, China
14:30 ADVERSARIAL CROSS-MODAL RETRIEVAL VIA LEARNING AND TRANSFERRING SINGLE-MODAL SIMILARITIES
Xin Wen
Tsinghua University, China
14:35 DATA AUGMENTATION FOR MONAURAL SINGING VIOCE SEPARATION BASED ON VARIATIONAL AUTOENCODER-GENERATIVE ADVERSARIAL NETWORK
Boxin He
Tianjin Polytechnic University, China
14:40 MACHINE LEARNING FOR ACOUSTIC SCENE CLASSIFICATION
TrucThi Kim Nguyen
Graz University of Technology, Austria
14:45 STUDENT CAREEN LUNCH
Tie Liu
Beihang University, China
14:50 AN ADAPTIVE PARAMETER MODEL FOR DCT-BASED WATERMARKING SCHEMES
Ying Huang
Taiyuan University of Technology, China
14:55 NEWS-ORIENTED STOCK MOVEMENT PREDICTION ON DENSE TEMPORAL SEQUENCE USING IMPLICIT NEWS
Tsun-Hsien Tang
National Taiwan University, Taiwan
15:00 Evaluation and Q&A
183 IEEE ICME2019
Social Events
ICME 2019 Reception
Time: 18:00 - 21:00, Monday, July 8th, 2019
Room: Pearl Hall (7F)
ICME 2019 Student Career Dinner
Time: 18:00 - 21:00, Wednesday, July 10th, 2018
Room: Grand Ballroom (7F)
ICME 2019 Banquet
Time: 18:00 - 21:00, Wednesday, July 10th, 2018
Room: Grand Ballroom (7F)
184 Side Meetings
Tuesday, July 9, 2019 Tuesday, July 9, 2019 Tuesday, July 9, 2019
Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM
Room: 5J Room: 5F Room: 5H
TC meeting 1 (TMM SC) TC meeting 2 (MMSP TC) TC meeting 3 (ICME SC)
Wednesday, July 10, 2019 Wednesday, July 10, 2019 Wednesday, July 10, 2019
Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM
Room: 5J Room: 5F Room: 5H
TC meeting 4 (TMM EB) TC meeting 5 (ComSoc MMTC) TC meeting 6 (TCMC)
Thursday, July 11, 2019 Thursday, July 11, 2019 Thursday, July 11, 2019
Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM Time: 12:30 - 14:00 PM
Room: 5J Room: 5F Room: 5H
TC meeting 7 TC meeting 8 (IEEE MM-MAG EB) TC meeting 9 (MSA TC) (ICME 2019/2020 OC)
185 IEEE ICME2019
Area Chairs
Pradeep K. Atrey Song Guo University at Albany, SUNY, USA The Hong Kong Polytechnic University, China
Cunjian Chen Jungong Han Michigan State University, USA Lancaster University, UK
Ngai Man Cheung Luis Herranz Singapore University of Technology and Design, Singapore Computer Vision Center, Spain
Peng Cui Richang Hong Tsinghua University, China Hefei University of Technology, China
Tasos Dagiuklas Wolfgang Hürst London South Bank University, UK Utrecht University, Netherlands
Guiguang Ding Wen Ji Tsinghua University, China Institute of Computing Technology, Chinese Academy of Scienc- es, China
Ming Dong Cheolkon Jung Wayne State University Xidian University, China
Lingyu Duan Andre Kaup Peking University, China Friedrich-Alexander-Universität, Germany
Frederic Dufaux Patrick Le Callet Centre national de la recherche scientifique, France Universite de Nantes, France
Lu Fang Ge Li Tsinghua University, China Peking University, China
Jianlong Fu Houqiang Li Microsoft Research, USA University of Science and Technology of China, China
Chuang Gan Wanqing Li MIT-Watson AI Lab, USA University of Wollongong, Australia
Yue Gao Xiaolin Li Tsinghua University, China University of Florida, USA
Jingming Guo Weiyao Lin National Taiwan University of Science and Technology,Taiwan Shanghai Jiaotong University, China
186 Jianquan Liu Ju Shen NEC Corporation, Japan Department of Computer Science, University of Dayton, USA
Wu Liu Hailin Shi AI Research of JD.com, China JD AI Research, China
Jiwen Lu Vladimir Stankovic Tsinghua University, China University of Strathclyde, UK
Yan Lu Jinhui Tang Microsoft Research Asia, China Nanjing University of Science and Technology, China
Bo Luo Jelena Tešić University of Kansas, USA Texas State University, USA
Sanjeev Mehrotra Yonghong Tian Microsoft, USA Peking University, China
Jingjing Meng Farzad Toutounchi State University of New York at Buffalo, USA Queen Mary University of London, UK
Yuxin Peng Vladan Velisavljevic Peking University, China University of Bedfordshire, UK
Marius Preda Ruiping Wang Université Paris-Sud,France ICT, CAS, China
GuoJun Qi Wei Wang Huawei Cloud, China Aginome Scientific, Singapore
GuoJun Qi Xinchao Wang Huawei Cloud, China Stevens Institute of Technology, USA
Rajiv Ratn Shah Xin Jing Wang Indraprastha Institute of Information Technology, India Wework, USA
Amy R. Reibman Zhangyang Wang Purdue University, India Texas A&M University,USA
Jian Ren Mathias Wien Michigan State University, USA RWTH Aachen University, Germany
Yong Man Ro Dalei Wu Korea Advanced Institute of Science and Technology, Korea University of Technology of Compiegne, France
Sebastian Schwarz Shaoen Wu Nokia, Finland Ball State University, USA
187 IEEE ICME2019
Feng Xue Chengcui Zhang Hefei University of Technology, China The University of Alabama at Birmingham, USA
Chenggang Yan Hanwang Zhang Hangzhou Dianzi University, China Nanyang Technological University, Singapore
Qing Yang Qianni Zhang University of North Texas, USA Queen Mary University of London, UK
Wenxian Yang Sicheng Zhao Aginome Scientific, Singapore University of California Berkeley, USA
Yi Yang Liang Zheng University of Technology Sydney, Australia Australian National University, Australia
Ting Yao Ce Zhu JD AI Research, China University of Electronic Science & Technology of China, China
Cha Zhang Microsoft Research, USA
188 Technical Program Committee Members
Milad Abdollahzadeh Saverio Blasi Guangyi Chen Jaeyoung Choi
Mona Abid Du Bo Haoming Chen Kyoung-Ho Choi
Velibor Adzic Erik Bochinski Homer Chen Taelim Choi
Mariana Afonso Maksim Bolonkin Jingjing Chen Yoojin Choi
Luciano Volcan Agostini Marc Bosch Jun-Cheng Chen Hang Chu
Mohammad Faizal Ahmad Fauzi Imed Bouazizi Shixing Chen Lingyang Chu
Syed Hassan Ahmed Catarina Brites Shu-Ching Chen Wei-Ta Chu
Ali Ak Matthew Broadbent Si-Bao Chen Yung-Yu Chuang
Anique Akhtar Michael S Brown Toly Chen Stelvio Cimato
Hasan Al Marzouqi Michele Buccoli Wei Chen Claudiu Cobarzan
Ghassan Alregib Yujun Cai Wei-Bang Chen Giulio Coluccia
Laurent Amsaleg Roberto Caldelli Wuyang Chen Pedro Comesana-Alfaro
Gerasimos Arvanitis Shaun Canavan Xi Stephen Chen Antoine Coutrot
Joao Ascenso K. Selçuk Candan Xin Chen Dubravko Culibrk
Pedro A. Assuncao Kai Cao Xing Chen Eduardo A. B. Da Silva
Christoph Bachhuber Stefania Cecchi Yuli Chen Luis A Da Silva Cruz
Tom Bäckström Zhenhua Chai Zebin Chen Qi Dai
Chongyang Bai Chee Seng Chan Zhibo Chen Xiyang Dai
Yan Bai Din-Yuen Chan Zhineng Chen Antitza Dantcheva
Anant Baijal Chuan-Wang Chang Zhixiang Chen Mohamed Daoudi
Werner Bailer Chun-Fa Chang Bowen Cheng Carl J Debono
Ivan Bajic Yakun Chang Juntong Cheng Alessio Degani
Yukihiro Bandoh Yao-Jen Chang Shyi-Chyi Cheng Carlos Roberto Del Blanco
Bingkun Bao Marc Chaumont Wen-Huang Cheng Weijian Deng
Qian Bao Berlin Chen Anoop Cherian Weijian Deng
Tom Bashford-Rogers Bo-Wei Chen Boon-Seng Chew Mohamed Deriche
Jordi Mongay Batalla Chen Chen Jui-Chiu Chiang Jian-Jiun Ding
Jenny Benois-Pineau Chun-Chi Chen Jen-Tzung Chien Yu Ding
Marco Bertini Chun-Fu Chen Chih-Yi Chiu Zewei Ding
Zhenpeng Bian Dongdong Chen Nam Ik Cho Jana Dittmann
Tiziano Bianchi Francine Chen Hyomin Choi Marek Domański
189 IEEE ICME2019
Haoye Dong Han Gao Raouf Hamzaoui Lei Huang
Tingting Dong Wei Gao Hong Han Likun Huang
Dejing Dou Yuan Gao Shizhong Han Meiyu Huang
Pengfei Dou Liuhao Ge Xian-Hua Han Rui Huang
Shaoyi Du Shiming Ge Yahong Han Tsung-Wei Huang
Jiali Duan Yongxin Ge Renlong Hang Xiaofeng Huang
Yueqi Duan Guoping Gong Zongbo Hao Xiaohua Huang
Abhimanyu Dubey Mingming Gong Choochart Haruechaiyasak Xiaoming Huang
Pinar Duygulu Xinyu Gong Carlo Harvey Xin Huang
Isao Echizen Jianping Gou Mahmoud Reza Hashemi Yue Huang
Peter Eisert Marco Grangetto Devamanyu Hazarika Tzu-Yi Hung
Hazim KEMAL Ekenel Dan Grois Li He Jenq-Neng Hwang
Engin Erzin William I. Grosky Liang He Ichiro Ide
Ralph Ewerth Guanghua Gu Xiangnan He Tomohiro Ikai
Baojie Fan Renshu Gu Yuwen He Bogdan Ionescu
Jianping Fan Jian Guan Shintami C. Hidayati Maria Silvia Ito
Liyue Fan Valia Guerra Ones Lyndon Hill I-Hong Jhuo
Yuchen Fan Hongxing Guo Tuan Hoang Nguyen Anh Rongrong Ji
Zhipeng Fan Jingcai Guo Xiaopeng Hong Wen Ji
Zhiwen Fan Jingcai Guo Mohammad Hosseini Wen Ji
Leyuan Fang Jingda Guo Mohammad Hosseini Chuanmin Jia
Sergio M Faria Shuaishuai Guo Guanqun Hou Wenjing Jia
Reuben Farrugia Song Guo Junhui Hou Meng Jian
Attilio Fiandrotti Xun Guo Li Hou Junjun Jiang
Karel Fliegel Yiluan Guo Cheng-Hsin Hsu Tingting Jiang
Jingjing Fu Yiwen Guo Han Hu Wenbin Jiang
Qingtao Fu Yuanfang Guo Haoji Hu Xi Jiang
Xianping Fu Yuchen Guo Junlin Hu Xiaoyan Jiang
Takahiko Furuya Zongyu Guo Min-Chun Hu Jiren Jin
Neeraj Gadgil Zongyu Guo Tao Hu Xin Jin
Ji Gan Chitralekha Gupta Wei Hu Rolf Jongebloed
Tian Gan Cathal Gurrin Chih-Wei Huang Brendan Jou
Ernest D Ganaa Jesús Gutiérrez Huaxi Huang Kashyap K.R. Kam- bhatla Guanyu Gao Paul Haimes Kan Huang Li-Wei Kang
190 Xiangui Kang Chi-Chun Lee Dalton Lin Miaomiao Lou
Akankshya Kar Hyowon Lee Hsueh-Yi Lin Chun-Shien Lu
Kasun Karunanayaka Sanghoon Lee Jianxin Lin Guoyu Lu
Mohamed Abosaief Kassab Chuankun Li Jingqiang Lin Shao-Ping Lu
Birendra Kathariya Gaoling Li Wei-Yang Lin Yao Lu
Angeliki Katsenou Hongyan Li Suiyi Ling Yue Lu
Marie Katsurai Hongzhi Li Bei Liu Lannan Luo
Mohammad Kazemi Jianwu Li Bo Liu Wenfeng Luo
Naimul Mefraz Khan Jing Li Ding Liu Yong Luo
Changick Kim Jingjing Li Guangchi Liu Yong Luo
Changick Kim Kristen Li Hantao Liu Ryan Lustig
Han-Ul Kim Leida Li Hao Liu Bingpeng Ma
Jongyoo Kim Li Li Haomiao Liu Chongyang Ma
Woojae Kim Liang Li Jiankun Liu Fei Ma
Yeong Jun Koh Lianqiang Li Jingen Liu He Ma
Stefanos Kollias Lin Li Kuan-Hsien Liu Kede Ma
Jan Koloda Lizhong Li Kun Liu Qiang Ma
Takahiro Komamizu Shaozi Li Lingbo Liu Shiheng Ma
Jari Korhonen Shuai Li Peng Liu Siwei Ma
Harald Kosch Shuangqun Li Ping Liu Zhan Ma
Lukas Krasula Shujun Li Qiegen Liu Zhanyu Ma
Gosala Kulupana Site Li Rui Liu Debanjan Mahata
Anurag Kumar Teng Li Ruixu Liu Guangcan Mai
Yaman Kumar Wanhua Li Sheng Liu Emanuele Maiorana
Minoru Kuribayashi Xu Li Tsung-Jung Liu Qirong Mao
Jui-Hsin Lai Yehao Li Weifeng Liu Manuel J. Marín-Jiménez
Yu-Kun Lai Yue Li Xinchen Liu Manuel Martinello
Shang-Hong Lai Yuxi Li Xueliang Liu Marc Masana
Aris Lalos Zhengguo Li Yi Liu Reji Mathew
Long Lan Zhi Li Yinglu Liu Seksan Mathulaprangsan
Xiangyuan Lan Zhuoran Li Yu-Shen Liu Puneet Mathur
Jochen Lang Chia-Kai Liang Chengjiang Long Shaohui Mei
Chaker Larabi Haoyi Liang Zhiling Long Hardik Meisheri
Bowon Lee Chunze Lin Yi Loo Rufael Mekuria
191 IEEE ICME2019
Hongying Meng Dilruk Perera Shin'Ichi Satoh A Subramanyam
Zibo Meng Cristian Perra Peter Schelkens Chang Sun
Olivier Meur Matthieu Perreira Da Sil- Klaus Schoeffmann Heming Sun va Vasileios Mezaris John See Jiande Sun Antonio Pinheiro Qiguang Miao Mustafa Sert Lifeng Sun William J.-P. Puech Zhenjiang Miao Jie Shao Xiaoxiao Sun Fei Qi Simone Milani Bo Shen Yangfan Sun Lei Qi Vahid Mirjalili Jianghao Shen Yibao Sun Na Qi Philipp Moll Jie Shen Wei lian Suo Xiaojun Qi Marie-Jose Montpetit Jun Shen Thomas Swearingen Buyue Qian Yuta Nakashima Liyue Shen Seishi Takamura Xueming Qian Aous T. Naman Roger Shen Mengxuan Tan Linbo Qing Manish Narwaria Rui Shen Robby T. Tan Jiayan Qiu Ambarish Natu Yeji Shen Chang Tang Kai Qiu Hung Nguyen Boxin Shi Chih-Wei Tang Zhaofan Qiu Nhu Q Nguyen Haichao Shi Sheng Tang Maria Paula Queluz Weizhi Nie Shu Shi Youbao Tang Georges Quénot Xiushan Nie Yuxuan Shi Zheng Tang Saimunur Rahman Nikos Nikolaidis Huang-Chia Shih Georg Thallinger Aakanksha Rana Naoko Nitta Masato Shirai Nikolaos Thomos Yongming Rao Paulo Nunes Carlos N Silla Lei Tian Yogesh Rawat Makoto Okabe Jae-Young Sim Christian Timmerer Bappaditya Ray Vincent Oria Ashutosh Singla Ngoc-Trung Tran Kui Ren Yingwei Pan Luis Soares Juan Ramón Troncoso Pastoriza Liangliang Ren Wai Man Raymond Pang Faranak Sobhani Ngoc Trung Nuno Rodrigues Shivam Parikh Houbing Song Chia-Ming Tsai Hoda Roodaki Shashikant Patil Li Song Sik-Ho Tsang Nina Rosa Vikram Patil Qing Song Pei-Kuei Tsung Sankarasrinivasan S Houwen Peng Sibo Song Stefano Tubaro Mukesh Saini Wen-Hsiao Peng Eckehard Steinbach Nkiruka Uzuegbunam Dimitrios Sakkos Xi Peng Haakon K Stensland Giuseppev Valenzise Enrique Sánchez-Lozano Xiulian Peng Guan-Ming Su Avinash Varna Maria Santamaria Yan-Tsung Peng Haonan Su Stefanos Vrochidis Nabil Sarhan Yuxin Peng Yong Su Ji Wan Andrej Satnik
192 Jun Wan Zhen Wang Hongteng Xu Yasin Yazici
Bin Wang Zheng Wang Jianfeng Xu Mao Ye
Gaoang Wang Zhenzhen Wang Xiaozhong Xu Minxiang Ye
Hongxing Wang Zhiyong Wang Yiling Xu Xinchen Ye
Hsin-Min Wang Zhongdao Wang Yuanlu Xu Yun Ye
Huogen Wang Zhongyuan Wang Yuhui Xu Xi Yin
Jianfeng Wang Ziwei Wang Zengmin Xu Yifang Yin
Jiangping Wang Shikui Wei Zongyi Xu Zhenqiang Ying
Jinglu Wang Wei Wei Feng Xue Jianming Yong
Lizhi Wang Xiu-Shen Wei Jing-Hao Xue Satoshi Yoshida
Mea Wang Yingcan Wei Takehiro Yamamoto Atsuo Yoshitaka
Nannan Wang Bihan Wen Toshihiko Yamasaki Ilsun You
Pichao Wang Wei Wen Bo Yan Biting Yu
Qifei Wang Xin Wen Haibin Yan Dongfei Yu
Qin Wang Chaoqun Weng Jun Yan Haichao Yu
Shangfei Wang Fangyu Wu Xiyu Yan Jiahui Yu
Shanshe Wang Gengshen Wu Keiji Yanai Junqing Yu
Shaojie Wang Hefeng Wu Cheng Yang Mali Yu
Shuhui Wang Jinjian Wu Fei Yang Shengtao Yu
Sicheng Wang Junru Wu Fuzheng Yang Tan Yu
Song Wang Wei Wu Jufeng Yang Xiyu Yu
Suyu Wang Xiao Wu Meng Yang Yi Yu
Tinghuai Wang Yuhang Wu Qize Yang Jianbo Yuan
Xiangyu Wang Yuwei Wu Shuai Yang Ye Yuan
Xiaobo Wang Sen Xiang Wenhan Yang Huanjing Yue
Xingzheng Wang Chunxia Xiao Wenmian Yang Inyong Yun
Yaxing Wang Jing Xiao Xiaopeng Yang Pietro Zanuttigh
Yizhou Wang Xiaohua Xie Yang Yang Huanqiang Zeng
Yong Wang Zhihuai Xie Yiding Yang Liaoyuan Zeng
Yong Wang Qi Xin Yi-Hsuan Yang Qiang Zeng
Yongtao Wang Junliang Xing Yujiu Yang Yi-Chong Zeng
Yu Wang Jinbo Xiong Zhengyuan Yang Zhiyuan Zha
Yuantian Wang Baohan Xu Hantao Yao Liming Zhai
Yuanyuan Wang Chang Xu Kim Yap Baochang Zhang
193 IEEE ICME2019
Cheng Zhang Xiangrong Zhang H. Vicky Zhao Chunluan Zhou
Dejun Zhang Xiangrong Zhang Pinghua Zhao Jun Zhou
Fan Zhang Xue Zhang Sicheng Zhao Lijuan Zhou
Fan Zhang Yaping Zhang Tiesong Zhao Wei Zhou
Guigang Zhang Yi Zhang Xibin Zhao Wengang Zhou
Guofeng Zhang Yingxue Zhang Ziping Zhao Xiaolong Zhou
Junjie Zhang Yongbing Zhang Cairong Zhao Xiuzhuang Zhou
Ke Zhang Yuan Zhang Heliang Zheng Yipeng Zhou
Lefei Zhang Zhaobin Zhang Huiru Zheng Zhi Zhou
Lei Zhang Zhao-Xiang Zhang Wei-Shi Zheng Guibo Zhu
Ning Zhang Zhendong Zhang Wenzhao Zheng Lingyu Zhu
Shiliang Zhang Zheng Zhang Xiaozhen Zheng Yanjun Zhu
Tianzhu Zhang Zhizheng Zhang Zhedong Zheng Yingying Zhu
Wei Zhang Zhongyan Zhang Chen Zhong Peixian Zhuang
Weiming Zhang Baoquan Zhao Sheng-Hua Zhong Jeffrey Zou
Weitong Zhang Bin Zhao Zhun Zhong Ivan Zupancic
194 Sponsors
195 IEEE ICME2019
Organizational Sponsors
196 Whova Event App User Tutorial Get Most out of Your Event
How to Download the Whova App
The Whova event app is for free for event attendees. To download the app, please follow the step below: IOS: open up the Apple Store on your mobile device and search for 'Whova'. Android: open up the Google Play and search for 'Whova' or scan the QR code.
(You can click the left gray button to download the Whova App)
(just for Android)
How to Join the Meeting
1.Enter the email address you used for event registration or use your social media account.
2.After logging in, you can search 'ICME' for your event.
3.Then click the join button and enter the event invitation code: iicie
How to Vote
On Wednesday 10, July, there is a Star Innovator Session and you can vote for the finalists from 11:00am to 5:45pm. You can find the voting information in 'Messages', the activity will be shown at the top.
Name: ICME2019 Password: icme2019
197