image

1. Preface

The test of our progress is not whether we add more to the abundance of those who have much, it is whether we provide enough for those who have little. — Franklin D. Roosevelt (\(32nd\) President of the United States of America)

This quote from Franklin D. Roosevelt (FDR) curiously captures the motivation behind designing good machine learning algorithms. Roosevelt was quoted saying this at his inaugural address on January 20, 1937. FDR’s economic policies, titled the New Deal were already in place. The above inaugural address rhetoric was directed towards strengthening and expanding the governmental programs to help US citizens who were struggling during the depression era of the 1930s. Machine learning (ML) deals with extracting meaningful information from a large corpus of data. Modern ML progress is often associated with large-scale learning. This includes larger datasets, larger models, and greater computational resources. However, a more meaningful measure of progress is the ability to learn effectively from limited and noisy data that does not always conform to scale-driven learning. In many real-world settings, data is expensive, scarce, unfair, imbalanced, and impractical, or even immoral to obtain. Traditionally, ML research seeks to right this imbalance by constructing equitable and fair algorithms through better inductive biases, representation learning, or apriori knowledge. Hence, the central challenge of machine learning is not simply to extend the abundant but to equitably use the little data we have by constructing intelligent algorithms.

2. Vision

ML and its applications in Artificial Intelligence (AI) have permeated everyday lives including healthcare, transport, education, energy, entertainment, and finance among others. People with no exposure to data science or engineering principles are aware of ML techniques. Industries use ML to attract customers, market and sell products, recruit for workplace, power software applications, improve efficiency and so on. The significant interest from everyday people and industries has ensured ML a permanent place in undergraduate and graduate curricula.

The applicability of ML in multiple disciplines including electrical and computer engineering, computer science, civil engineering, industrial engineering, biomedical engineering, and mechanical engineering, among many other fields, has given rise to multiple cross-listed courses across the curricula. There are a diverse set of researchers from all the above disciplines, who bring their individuality to ML writing and education. As a growing interdisciplinary field, ML and specifically deep learning (DL) is susceptible to redefining terminologies. Oftentimes, this redefinition is a necessary evolution. However, for beginners learning ML, changing terminologies is a cause for confusion. For instance, the words ‘high-dimensional features’, ‘activations’, ‘embeddings’, ‘latent representation’ and ‘hidden representations’ all refer to the same concept under different contexts. It is necessary to separate ML-research with ML-education, while also keeping up with ML evolution. As such, the purpose of this book is to allow students with no or limited knowledge about data-driven techniques and engineering to comprehend ML theory and techniques as well as acquire the skills to deploy such systems on real data.

3. Why this Book?

ML is often introduced either as a toolbox of algorithms or as a sequence of mathematically dense topics. It is standard practice for introductory courses to linearly discuss regression, regularization, generalization, optimization, and inference. Such a regimented approach to learning does not translate well today since students have already been exposed to ML-driven AI tools from a young age. The book differs from existing works in the following ways:

4. Target Audience

The book is primarily targeted towards undergraduate and graduate students at universities with a basic background in linear algebra, probability, and statistics, as well as python programming. The book provides the necessary prerequisite details for using deep learning tools like Pytorch. Links to additional resources are provided wherever applicable. The chapters are designed with in-person lecture time constraints in mind. Additionally, practitioners who want to engage with a deeper understanding of machine learning will find the book relevant.

5. Resources

The book is a dynamic and multimodal resource of education for students and instructors. The book consisted of lecture notes and slides used in classrooms, links to lecture videos, exercises, and interactive notebooks whenever possible. The book has been designed to function both as a self-study resource for students and practitioners and as a complete teaching toolkit for instructors. The book provides the following resources:

6. How to Use this Book?

The resources in this book caters to both students and educators.

7. Authors

image

Ghassan AlRegib is currently the John and Marilu McCarty Chair Professor in the School of Electrical and Computer Engineering at the Georgia Institute of Technology. In the Omni Lab for Intelligent Visual Engineering and Science (OLIVES), he and his group work on robust and interpretable machine learning algorithms, uncertainty and trust, and human in the loop algorithms. The group has demonstrated their work on a wide range of applications such as Autonomous Systems, Medical Imaging, and Subsurface Imaging. The group is interested in advancing the fundamentals as well as the deployment of such systems in real-world scenarios. He has been issued several U.S. patents and invention disclosures. He is a Fellow of the IEEE. Prof. AlRegib is active in the IEEE. He served on the editorial board of several transactions and served as the TPC Chair for ICIP 2020, ICIP 2024, and GlobalSIP 2014. He was area editor for the IEEE Signal Processing Magazine. In 2008, he received the ECE Outstanding Junior Faculty Member Award. In 2017, he received the 2017 Denning Faculty Award for Global Engagement. He received the 2024 ECE Distinguished Faculty Achievement Award at Georgia Tech. He and his students received the Best Paper Award in ICIP 2019 and the 2023 EURASIP Best Paper Award for Image communication Journal. In addition, one of their papers is the best paper runner-up at BigData 2024. In 2024, he co-founded the AI Makerspace at Georgia Tech, where any student and any community member can access and utilize AI regardless of their background.

image

Mohit Prabhushankar received his Ph.D. degree in electrical engineering from the Georgia Institute of Technology (Georgia Tech) in 2021. He is currently a Postdoctoral Research Fellow and Instructor of Record in the School of Electrical and Computer Engineering at Georgia Tech in the Omni Lab for Intelligent Visual Engineering and Science (OLIVES). He is working in the fields of image processing, machine learning, active learning, healthcare, and robust and explainable AI. He is the recipient of the Best Paper award at ICIP 2019, Top Viewed Special Session Paper Award at ICIP 2020, and Best Paper runner-up award at BigData’24. He is the recipient of the ECE Outstanding Graduate Teaching Award, the CSIP Research award, and of the Roger P Webb ECE GRA Excellence award in 2022 and the Research Excellence Award in 2025. He has delivered short courses and tutorials at IEEE IV’23, ICIP’23, BigData’23, WACV’24, AAAI’24, CVPR’24, ICME’24, MIPR’24., WACV’25, and AAAI’25.

8. Disclaimer

All content of this book are part of ECE 4252/6252 course at Georgia Institute of Technology. Any re-use or distribution is not permitted without pre-approved permission. All these materials belong to, created by, and copyrighted for Ghassan AlRegib and Mohit Prabhushankar, Georgia Tech, 2021-2028.

9. Contributors

Contributors to the course and slides include Dr. Ahmad Mustafa, Dr. Motaz Alfarraj, Dr. Ashraf Alattar, and Dr. Chen Zhou.

Teaching Assistants with remarkable contributions include: Kuo-Wei Lai, Shiva Mahato, Michael Zhou, and Yogita Choudhary.

10. License

These lecture notes are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.