CSCI 4052U: Machine Learning II

Faculty of Science, Ontario Tech University

Author

Ken Pu

This course is for students with a foundational understanding of machine learning. It explores advanced topics including, but not limited to, encoder/decoder architectures, attention mechanisms, and transformer-based models. The course demonstrates the application of these advanced architectures in creating state-of-the-art neural networks for various applications, such as language modeling (both masked and generative), computer vision, text-to-speech, speech recognition, multimodal learning, and Q-learning in agent-based AI systems. Additionally, it covers essential methodologies for developing and deploying AI systems, encompassing aspects like data pipelines, model management, training and fine-tuning, quantization, model distillation, knowledge infusion, and performance monitoring.

Required reading and reference material

Dive into Deep Learning

  • We will use a text, but only selected chapters.
  • The text is intented to provide foundational knowledge required to understand other assigned reading, and course notes to be provided on this website.

Additional material

  • Research papers will be provided on the topics we include in the curriculum.
  • Students are encourage to gain the ability to extract information from research papers on the topic of deep learning.