ECE 590SIP Graduate Seminar in Speech, Fall 2024

This is a one-hour graduate seminar on topics of speech processing. Each student will be required to give a research presentation, either alone or together with a small group.

Day and time: Mondays, 11:00-11:50

Place: ECEB 1015 or via Zoom. Zoom link is posted on CampusWire.

Instructor: Mark Hasegawa-Johnson (jhasegaw)

Discussion: CampusWire

Schedule

Course Schedule

Week

Date

Presenter

Topic

1

26-Aug

Mark Hasegawa-Johnson

Organization

2

02-Sep

No class

3

09-Sep

Bornali Phukon

ASR Semantic Scoring

4

16-Sep

Steven Guo

Auditory feedback using accent conversion

5

23-Sep

Zhongweiyang Xu

Neural Speech Codec

6

30-Sep

Jialu Li and Xiuwen Zheng

Enhancing Child Vocalization Classification with Fuzzy Phonetics for Assisting Autism Diagnosis and Fine-Tuning Automatic Speech Recognition for People with Parkinson’s: An Effective Strategy for Enhancing Speech Technology Accessibility

7

07-Oct

No class

8

14-Oct

Mingyue Huo

Beyond Speaker Identity: Text-Guided Target Speech Extraction

9

21-Oct

Kamila Abdiyeva and Katya Yegorova

Model Explainability

10

28-Oct

Jonghwan Na

Cohort-Sensitive Labeling and Stutter Augmentation for Disordered Speech Recognition

11

04-Nov

Jocelyn Xu

Separate What You Describe: Language-Queried Audio Source Separation and Separate Anything You Describe

12

11-Nov

Mark Hasegawa-Johnson

A Theory of Unsupervised Speech Recognition

13

18-Nov

Shreyanka Sinha

14

02-Dec

No class

15

09-Dec

Priyam Mazumdar and Haolong Zheng

16

16-Dec