ECE 590SIP Guest Lecture by Hung-Yi Lee

Title: Recent Progress of Self-supervised Learning for Speech Processing

Abstract: Self-supervised learning (SSL) has shown to be vital for advancing research in natural language processing (NLP), computer vision (CV), and speech processing. The paradigm pre-trains a shared model on large volumes of unlabeled data and achieves state-of-the-art for various tasks with minimal adaptation. This talk first introduces the Speech processing Universal PERformance Benchmark (SUPERB), which is a leaderboard to benchmark the performance of the SSL models across a wide range of speech processing tasks. The results on SUPERB demonstrate that SSL representations show competitive generalizability across speech processing tasks. Then this talk will share the recent advances and findings on SSL models for speech processing done at the 2022 Eighth Frederick Jelinek Memorial Summer Workshop (JSALT). I'll discuss training a better SSL model, including compressing it, making it more robust, etc. We then discuss efficient ways to leverage SSL models in downstream tasks, including adapters and prompts.

Bio: Hung-yi Lee is an associate professor in the Department of Electrical Engineering of National Taiwan University (NTU), with a joint appointment at the Department of Computer Science & Information Engineering of the university. His recent research focuses on developing technology that can reduce the requirement of annotated data for speech processing (including voice conversion and speech recognition) and natural language processing (including abstractive summarization and question answering). He won Salesforce Research Deep Learning Grant in 2019, AWS ML Research Award in 2020, Outstanding Young Engineer Award from The Chinese Institute of Electrical Engineering in 2018, Young Scholar Innovation Award from Foundation for the Advancement of Outstanding Scholarship in 2019, Ta-You Wu Memorial Award from Ministry of Science and Technology of Taiwan in 2019, and The 59th Ten Outstanding Young Person Award in Science and Technology Research & Development of Taiwan. He owns a YouTube channel teaching deep learning in Mandarin with about 100k Subscribers.