ECE 590SIP Graduate Seminar in Speech, Fall 2024¶
This is a one-hour graduate seminar on topics of speech processing. Each student will be required to give a research presentation, either alone or together with a small group.
Day and time: Mondays, 11:00-11:50
Place: ECEB 1015 or via Zoom. Zoom link is posted on CampusWire.
Instructor: Mark Hasegawa-Johnson (jhasegaw)
Discussion: CampusWire
Schedule¶
Week |
Date |
Presenter |
Topic |
---|---|---|---|
1 |
26-Aug |
Mark Hasegawa-Johnson |
Organization |
2 |
02-Sep |
No class |
|
3 |
09-Sep |
Bornali Phukon |
ASR Semantic Scoring |
4 |
16-Sep |
Steven Guo |
Auditory feedback using accent conversion |
5 |
23-Sep |
Zhongweiyang Xu |
Neural Speech Codec |
6 |
30-Sep |
Jialu Li and Xiuwen Zheng |
Enhancing Child Vocalization Classification with Fuzzy Phonetics for Assisting Autism Diagnosis and Fine-Tuning Automatic Speech Recognition for People with Parkinson’s: An Effective Strategy for Enhancing Speech Technology Accessibility |
7 |
07-Oct |
No class |
|
8 |
14-Oct |
Mingyue Huo |
Beyond Speaker Identity: Text-Guided Target Speech Extraction |
9 |
21-Oct |
Kamila Abdiyeva and Katya Yegorova |
Model Explainability |
10 |
28-Oct |
Jonghwan Na |
Cohort-Sensitive Labeling and Stutter Augmentation for Disordered Speech Recognition |
11 |
04-Nov |
Jocelyn Xu |
Separate What You Describe: Language-Queried Audio Source Separation and Separate Anything You Describe |
12 |
11-Nov |
Mark Hasegawa-Johnson |
|
13 |
18-Nov |
Shreyanka Sinha |
|
14 |
02-Dec |
No class |
|
15 |
09-Dec |
Priyam Mazumdar and Haolong Zheng |
|
16 |
16-Dec |
||