| Date | Lecture (and links to advance videos when applicable) | Links and Notes |
Reading Suggestions | Comments | |
| Intro and Basics | 1/20 | Introduction and Logistics | Intro & logistics | Slide set | |
| 1/22 | Edge AI and Mitigating Resource Bottlenecks |
Bottlenecks | Slide set | ||
| 1/27 | Overfitting and Self-supervised Learning | Group requests due | |||
| 1/29 | Edge AI and Mitigating the Data Bottleneck | All groups assigned | |||
| The Data Bottleneck: Self- Supervised Data- Efficient Learning for IoT | 2/3 | Class Project Ideas Introduction. | Projects | Slide set | |
| 2/5 | Fundamentals of Self-Supervised Learning: Tokenization, Pre-training, Fine-tuning, Backbone Architectures (e.g., auto-encoders, transformers, etc), and Issues with Scaling Laws for IoT Applications | Self-Supervised Learning (SSL) | Slide set | ||
| 2/10 | Self-supervised Learning Architectures for Time-Series Data: RNNs, LSTMs, and State Space Models | SSL Models for Time- Series Data | 1. Schmidt, Robin M. "Recurrent neural networks (rnns): A gentle introduction and overview." arXiv preprint arXiv:1912.05911 (2019). 2. Kexin Zhang, Qingsong Wen, Chaoli Zhang, Rongyao Cai, Ming Jin, Yong Liu, James Y. Zhang et al. "Self-supervised learning for time series analysis: Taxonomy, progress, and prospects." IEEE transactions on pattern analysis and machine intelligence 46, no. 10 (2024): 6775-6794. 3. Albert Gu, Karan Goel, and Christopher Ré. "Efficiently modeling long sequences with structured state spaces." arXiv preprint arXiv:2111.00396 (2021). 4. Albert Gu, and Tri Dao. "Mamba: Linear-time sequence modeling with selective state spaces." In First conference on language modeling. 2024. |
Note: Project title and abstract due | |
| 2/12 | Representation Learning from Multimodal Sensor Data |
Multimodal Intro HW1 Out |
1. Chao
Zhang, Zichao Yang, Xiaodong He, and Li Deng. "Multimodal
intelligence: Representation learning, information fusion, and
applications." Journal of Selected Topics in Signal Processing, 2020. 2. Dave Vedant, Fotios Lygerakis, and Elmar Rueckert. "Multimodal visual-tactile representation learning through self-supervised contrastive pre-training." In 2024 IEEE ICRA, pp. 8013-8020. IEEE, 2024. 3. Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, and Cordelia Schmid. "Videobert: A joint model for video and language representation learning." In Proceedings of the IEEE/CVF international conference on computer vision, pp. 7464-7473. 2019. |
||
| 2/17 | Representation Learning from Multimodal Sensor Data (Student Led) |
G4 Multimodal Papers |
1. Shengzhong Liu, Tomoyoshi Kimura, Dongxin Liu, Ruijie Wang, Jinyang Li, Suhas Diggavi, Mani Srivastava, and Tarek Abdelzaher. "Focal: Contrastive learning for multimodal time-series sensing signals in factorized orthogonal latent space." Advances in Neural Information Processing Systems 36 (2023): 47309-47338. 2. Chen, Yatong, Chenzhi Hu, Tomoyoshi Kimura, Qinya Li, Shengzhong Liu, Fan Wu, and Guihai Chen. "SemiCMT: Contrastive cross-modal knowledge transfer for iot sensing with semi-paired multi-modal signals." Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 8, no. 4 (2024): 1-30. 3. Xiaomin Ouyang, Jason Wu, Tomoyoshi Kimura, Yihan Lin, Gunjan Verma, Tarek Abdelzaher, and Mani Srivastava. "MMbind: Unleashing the potential of distributed and heterogeneous data for multimodal learning in iot." In Proceedings of the 23rd ACM Conference on Embedded Networked Sensor Systems, pp. 491-503. 2025. 4. Tomoyoshi Kimura, Xinlin Li, Osama Hanna, Yatong Chen, Yizhuo Chen, Denizhan Kara, Tianshi Wang et al. "InfoMAE: Pair-efficient cross-modal alignment for multimodal time-series sensing signals." In Proceedings of the ACM on Web Conference 2025, pp. 3084-3095. 2025. 5. Li, Zechen, Shohreh Deldari, Linyao Chen, Hao Xue, and Flora D. Salim. "SensorLLM: Aligning large language models with motion sensors for human activity recognition." In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pp. 354-379. 2025. 6. Yuwei Zhang, Kumar Ayush, Siyuan Qiao, A. Ali Heydari, Girish Narayanswamy, Maxwell A. Xu, Ahmed A. Metwally et al. "SensorLM: Learning the Language of Wearable Sensors." arXiv preprint arXiv:2506.09108 (2025). |
Debate #1 (20 min) Student led talk (45 min + 10 min Q&A) See note @48 for debate concluding remarks on Piazza. |
|
| 2/19 | Self-supervised Learning
from Frequency Domain Data |
Frequency Domain Intro HW2 Out |
Slide set | Debate #2 (20 min) | |
| 2/24 | Self-supervised Learning from Frequency Domain Data (Student Led) |
G3 Frequency Domain Papers |
1. Shuochao Yao, Ailing Piao, Wenjun Jiang, Yiran Zhao, Huajie Shao, Shengzhong Liu, Dongxin Liu, Jinyang Li, Tianshi Wang, Shaohan Hu, Lu Su, Jiawei Han and Tarek Abdelzaher, "STFNets: Learning Sensing Signals from the Time-Frequency Perspective with Short-Time Fourier Neural Networks," In Proc. The Web Conference (WWW), San Francisco, CA, May 2019.
2. Dongxin Liu, Tianshi Wang, Shengzhong Liu, Ruijie Wang, Shuochao Yao, and Tarek Abdelzaher. "Contrastive self-supervised representation learning for sensing signals from the time-frequency perspective." In 2021 International Conference on Computer Communications and Networks (ICCCN), pp. 1-10. IEEE, 2021. 3. Yuan Gong, Cheng-I. Lai, Yu-An Chung, and James Glass. "Ssast: Self-supervised audio spectrogram transformer." In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 10, pp. 10699-10709. 2022. 4. Setareh Rahimi Taghanaki, Michael Rainbow, and Ali Etemad. "Self-supervised human activity recognition with localized time-frequency contrastive representation learning." IEEE Transactions on Human-Machine Systems 53, no. 6 (2023): 1027-1037. 5. Denizhan Kara, Shengzhong Liu, Jinyang Li, Dongxin Liu, Tianshi Wang, Ruijie Wang, Yizhuo Chen, Yigong Hu, Tarek Abdelzaher, "FreqMAE: Frequency-Aware Masked Autoencoder for Multi-Modal IoT Sensing," In Proc. The Web Conference (WWW), May 2024. 6. Denizhan Kara, Tomoyoshi Kimura, Yatong Chen, Jinyang Li, Ruijie Wang, Yizhuo Chen, Tianshi Wang, Shengzhong Liu, Lance Kaplan, Joydeep Bhattacharyya, Tarek Abdelzaher, "PhyMask: An Adaptive Masking Paradigm for Efficient Self-Supervised Learning in IoT," In Proc. 22nd ACM Conference on Embedded Networked Sensor Systems (SenSys), Hangzhou, China, November 2024. |
HW2 Debate (20 min) Student led talk (45 min + 10 min Q&A) |
|
| 2/26 | Handling Spatial-Temporal IoT Data |
Spatial Temporal Intro HW3 Out |
Slide Set | 2-page project proposal due | |
| 3/3 | Handling Spatial-Temporal IoT Data (Student Led) | G2 | 1. Yinghui Zhang, Hu An, Yaxuan Xing, Yang Liu, and Tiankui Zhang. "Learning temporal and spatial features jointly: A unified framework for space-time data prediction in industrial IoT networks." IEEE Sensors Journal 23, no. 16 (2023): 18752-18764.
2. Liu, Jing, et al. "Distributional and spatial-temporal robust representation learning for transportation activity recognition." Pattern Recognition (2023). 3. Porter Jenkins, Ahmad Farag, Suhang Wang, and Zhenhui Li. "Unsupervised representation learning of spatial data via multimodal embedding." In Proceedings of the 28th ACM international conference on information and knowledge management, pp. 1993-2002. 2019. 4. Yizhuo Chen, Tianchen Wang, You Lyu, Yanlan Hu, Jinyang Li, Tomoyoshi Kimura, Hongjue Zhao, Yigong Hu, Denizhan Kara, and Tarek Abdelzaher. "Spar: Self-supervised placement-aware representation learning for multi-node iot systems." arXiv e-prints (2025): arXiv-2505. 5. Tianchen Wang, Yizhuo Chen, Hongjue Zhao, You Lyu, Jinyang Li, Tomoyoshi Kimura, Yigong Hu et al. "On Network-Efficient Multimodal Multi-Vantage Foundation Models for Distributed Sensing." In 2025 IEEE 22nd International Conference on Mobile Ad-Hoc and Smart Systems (MASS), pp. 19-27. IEEE, 2025. 6. Pengrui Quan, Brian Wang, Kang Yang, Liying Han, and Mani Srivastava. "Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and Challenges." arXiv preprint arXiv:2505.11618 (2025). |
HW3 Debate (20 min) br> Student led talk (45 min + 10 min Q&A) |
|
| Data Curation and "Faking" | 3/5 | Physical Data Curation and Augmentation | HW4 Out | Slide Set | |
| 3/10 | Physical Data Curation and Augmentation (Student Led) | G6 | 1. Chenzhi Hu, Yatong Chen, Denizhan Kara, Shengzhong Liu, Tarek Abdelzaher, Fan Wu, Guihai Chen, “OpenMAE: Efficient Masked Autoencoder for Vibration Sensing with Open-Domain Data Enrichment,” Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (ACM IMWUT), also presented in UbiComp, Espoo, Finland, October 2025.
2. Jeongwoo Ju, Heechul Jung, Yoonju Oh, and Junmo Kim. "Extending contrastive learning to unsupervised coreset selection." IEEE Access 10 (2022): 7704-7715 3. Sungnyun KIm, Sangmin Bae, and Se-Young Yun. "Coreset sampling from open-set for fine-grained self-supervised learning." In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 7537-7547. 2023. 4. Haizhong Zheng, Elisa Tsai, Yifu Lu, Jiachen Sun, Brian R. Bartoldson, Bhavya Kailkhura, Atul Prakash, “ELFS: LABEL-FREE CORESET SELECTION WITHPROXY TRAINING DYNAMICS,” ICLR 2025 5. Tianshi Wang, Jinyang Li, Ruijie Wang, Denizhan Kara, Shengzhong Liu, Davis Wertheimer, Antoni Martin, Raghu Ganti, Mudhakar Srivatsa, and Tarek Abdelzaher, “SudokuSens: Enhancing Deep Learning Robustness for IoT Sensing Applications using a Generative Approach,” In Proc. ACM Sensys, Istanbul, Turkey, November 2023. 6. Tianshi Wang, Qikai Yang, Ruijie Wang, Dachun Sun, Jinyang Li, Yizhuo Chen, Yigong Hu, Chaoqi Yang, Tomoyoshi Kimura, Denizhan Kara, Tarek F. Abdelzaher, “Fine-grained Control of Generative Data Augmentation in IoT Sensing,” In Proc. 38th Annual Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, December 2024. |
HW4 Debate (20 min) Student led talk (45 min + 10 min Q&A) |
|
| 3/12 | Project Elevator Talks | ||||
| Break | 3/17 | Spring Break |
|||
| 3/19 | |||||
| The Compute Bottleneck: Efficient Inference at the IoT Edge | 3/24 | Input Data Filtering |
|
||
| 3/26 | Model Reduction: Pruning, Quantization, Distillation |
HW5 Out G1 |
Instructor intro (20 min) Student led talk (45 min + 10 min Q&A) |
||
| 3/31 | Neural Network Architecture Search | HW5 Debate (20 min) | |||
| 4/2 | Mixture of Experts Cascades |
HW6 Out G7 |
Instructor intro (20 min) Student led talk (45 min + 10 min Q&A) |
||
| 4/7 | Timing Guarantees | HW6 Debate (20 min) | |||
| 4/9 | Energy Consumption and Thermal Issues |
HW7 Out |
|||
| 4/14 | Federated Learning, Distributed Fine-Tuning, and Test-Time Adaptation | G8 |
HW7 Debate (20 min) Student led talk (45 min + 10 min Q&A) |
||
| 4/16 | Closed loop control and related foundation models (RT-2, RT-X, etc) | HW8 Out | |||
| Ethics | 4/21 | Ethical and Societal Considerations | HW8 Debate (20 min) | ||
| 4/23 | |||||
| Student Projects | 4/28 | Student-led Final Project Presentations | |||
| 4/30 | Student-led Final Project Presentations | ||||
| 5/5 | Recap |
|
|||