The 2nd International Workshop On Robust Understanding of Low-quality Multimedia Data
Unitive Enhancement, Analysis, and Evaluation (UoLMM)
In Conjunction with ACM MM 2022
Lisbon, Portugal | 10-14 October

Introduction

Low-quality multimedia data (including low-resolution, low-illumination, defects, blurriness, etc.) often pose a challenge to content understanding, as algorithms are mainly developed under ideal conditions (high resolution and good visibility). To alleviate this problem, data enhancement techniques (super-resolution, low-light enhancement, derain, and inpainting) have been proposed to restore low-quality multimedia data. Efforts are also being made to develop robust content understanding algorithms in adverse weather and poor lighting conditions. Some quality assessment techniques aimed at evaluating the analytical quality of data have also emerged. Even though these topics are mostly studied independently, they are tightly related in terms of ensuring a robust understanding of multimedia content. For example, enhancement should maintain the semantic consistency in analysis, while quality assessment should take into account the comprehensibility of multimedia data.

This workshop aims to bring together individuals in three topics of enhancement, analysis, and evaluation of low-quality multimedia data for sharing ideas and discussion on current developments and future directions.

Call for Papers

The main goal of this workshop is to bring together leading experts from academia and industry in the three fields of multimedia data enhancement, multimedia content analysis, and quality evaluation for sharing ideas and discussions on current trends, developments, issues, and future directions, with the vision of bridging the three fields for a robust understanding of the low-quality data in the broader context of multimedia applications. The topics may include but are not limited to:

  • Emerging trends in data enhancement, analysis, or evaluation
  • Enhancement for the robust content understanding of low-quality data
  • Quality evaluation for the robust content understanding of low-quality data
  • Assessment-guided visual content enhancement of low-quality data
  • Joint embeddings learning for enhancement, analysis, and evaluation
  • Real-world systems and applications

Paper Submission

OpenReview website: https://openreview.net/group?id=acmmm.org/ACMMM/2022/Workshop/UoLMM

After signing in the ACM MM 2022 submission site as the author, please choose our workshop name to submit the paper. Submissions can be of varying length from 4 to 8 pages, plus additional pages for the reference pages; i.e., the reference page(s) are not counted to the page limit of 4 to 8 pages. There is no distinction between long and short papers, but the authors may themselves decide on the appropriate length of the paper.. All papers will be peer-reviewed by experts in the field, they will receive at least three reviews. Acceptance will be based on relevance to the workshop, scientific novelty, and technical quality. The workshop papers will be published in the ACM Digital Library.

Please refer to the main conference site for submission policies on blinding, originality, author lists and ArXiv publications.

Important Dates

  • Paper submission deadline:    1st July 2022 10th July 2022
  • Notification of acceptance:      29th July 2022
  • Camera ready submission:      14th August 2022
  • Workshop Date (Hybrid):        14 October 2022

Keynote Speakers


Prof. Wenjun Zeng

IEEE Fellow

EIT Institute for Advanced Study

Title: Unlocking the Potential of Disentangled Representation for Robust Media Understanding

Abstract: It has been argued that for AI to fundamentally understand the world around us, it must learn to identify and disentangle the underlying explanatory factors hidden in the observed environment of low-level sensory data. In this talk, I will first provide an overview of the recent developments in disentangled representation learning and identify some major trends. I will then present some applications of this powerful concept for robust media processing and understanding in tasks such as image restoration, super-resolution, classification, person re-ID, depth estimation, etc. I will also discuss some future directions.

Biography: Prof. Wenjun (Kevin) Zeng has been a Chair Professor and Vice President for Research of the Eastern Institute for Advanced Study, Ningbo, China since Oct. 2021. Prior to that, he was a Sr. Principal Research Manager and a member of the Senior Leadership Team at Microsoft Research Asia where He was leading the video analytics research powering the Microsoft Cognitive Services, Azure Media Analytics Services, Microsoft Office, Dynamics, and Windows Machine Learning. He was with the Computer Science Dept. of Univ. of Missouri from 2003 to 2016, most recently as a Full Professor. Prior to that, he worked for PacketVideo Corp, San Diego, CA, Sharp Labs of America, Camas, WA, Bell Labs, Murray Hill, NJ, and Panasonic Technology, Princeton, NJ. He has contributed significantly to the development of international standards (ISO MPEG, JPEG2000, and Open Mobile Alliance). He received his B.E., M.S., and Ph.D. degrees from Tsinghua Univ., the Univ. of Notre Dame, and Princeton Univ., respectively. He is on the Editorial Board of International Journal of Computer Vision, and was an Associate Editor-in-Chief, Associate Editor, or Steering Committee members for a number of IEEE journals. He has served as the General Chair or TPC Chair for several IEEE Conferences (e.g., ICME’2018, ICIP’2017). He is a Fellow of the IEEE.










Prof. Weisi Lin

IEEE Fellow

Nanyang Technological University

Title: Visual Signal Assessment, Analysis and Enhancement for Low-resolution or Varying-illumination Environment

Abstract: More often than not, practical application scenarios call for systems to be capable of dealing with input visual signals with low resolution/quality or environmental illumination. This talk will introduce related recent research in super-resolution reconstruction, signal quality assessment, content enhancement, and person re-identification for low-resolution or varying illumination. We will also discuss possible new research attempts to advance the relevant techniques.

Biography: Prof. Weisi Lin received the Ph.D. degree from King’s College, London University, U.K. He is currently a Professor with the School of Computer Science and Engineering, Nanyang Technological University. His areas of expertise include image processing, perceptual signal modeling, video compression, and multimedia communication, in which he has published over 200 journal papers, over 230 conference papers, filed seven patents, and authored two books. He has been an invited/panelist/keynote/tutorial speaker in over 20 international conferences, as well as a Distinguished Lecturer of the IEEE Circuits and Systems Society (2016–2017), and the Asia-Pacific Signal and Information Processing Association (APSIPA) (2012–2013). He is a Fellow of the IET, and an Honorary Fellow of Singapore Institute of Engineering Technologists. He has been a Technical Program Chair for IEEE ICME 2013, PCM 2012, QoMEX 2014, and IEEE VCIP 2017. He has been an Associate Editor for the IEEE Transactions on Image Processing, the IEEE Transactions on Circuits and Systems for Video Technology, the IEEE Transactions on Multimedia, and the IEEE Signal Processing Letters. He is a Fellow of the IEEE.


Dr. Jinwei Gu

Executive R&D Director

SenseBrain




Title: Computational Imaging on Mobile Phones

Abstract: Computational imaging refers to sensing the real world with optimally designed, task-specific, multi-modality image sensors and optics which actively probes key visual information. Together with advances in AI algorithms and powerful hardware, computational imaging has significantly improved the image and video quality of mobile phones in many aspects, which not only benefits computer vision tasks but also results in novel hardware, such as AI image sensors, AI ISP (Image Signal Processing) chips, and AI camera systems. In this talk, I will present several latest research results including high quality image restoration and accurate depth estimation from time-of-flight sensors or monocular videos, as well as some latest computational photography products in smart phones including under-display cameras, AI image sensors and AI ISP chips. I will also layout several open challenges and future research directions in this area.

Biography: Dr. Jinwei Gu is the R&D Executive Director of SenseBrain (aka SenseTime USA). His current research focuses on low-level computer vision, computational photography, computational imaging, smart visual sensing and perception, and appearance modeling. He obtained his Ph.D. degree in 2010 from Columbia University, and his B.S and M.S. from Tsinghua University, in 2002 and 2005 respectively. Before joining SenseTime, he was a senior research scientist in NVIDIA Research from 2015 to 2018. Prior to that, he was an assistant professor in Rochester Institute of Technology from 2010 to 2013, and a senior researcher in the media lab of Futurewei Technologies from 2013 to 2015. He serves as an associate editor for IEEE Transactions on Computational Imaging (TCI) and IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), an area chair for ICCV2019, ECCV2020, and CVPR2021, and industry chair for ICCP2020. He is an IEEE senior member since 2018. His research work has been successfully transferred to many products such as NVIDIA CoPilot SDK, DriveIX SDK, as well as super resolution, super night, portrait restoration, RGBW solution which are widely used in many flagship mobile phones.

Workshop Organizers

Dr. Liang Liao

liang.liao AT ntu.edu.sg

NTU, Singapore

Dr. Dan Xu

danxu AT cse.ust.hk

HKUST, HongKong, China

Dr. Yang Wu

dylanywu AT tencent.com

Tencent, China

Dr. Xiao Wang

wangxiao2021 AT wust.edu.cn

WUST, China

Dr. Jing Xiao

jing AT whu.edu.cn

WHU, China