DS807: Applied machine learning
Comment
Entry requirements
Academic preconditions
The students are expected to be familiar with the following topics:
- Regression analysis.
- Basic unsupervised methods, including principal component analysis and clustering.
- Sampling techniques such as the Bootstrap, cross-validation, and data split (train and test), and how (and when) they should be applied.
- Basic understanding of “modern” approaches to statistical learning, including decision trees.
- Application of the above techniques in a programming language.
This knowledge can be obtained through the courses DS804: Data mining and machine learning; DS805: Multivariate Statistical Analysis and, if possible, DS810: Data Driven Decision Making.
Course introduction
The aim of the course is to enable the student to apply the most commonly used methods in machine learning.
The focus of the course to:
- Give the competence to setup a complete applied machine learning analysis from beginning to end.
- Give skills to perform classification and predictions using statistical and deep learning methods and to make critical assessments of the results.
- Give knowledge and understanding of a broad range of machine leaning techniques and to access their strengths and weaknesses when applied to data of varying types.
Expected learning outcome
The learning objective of the course is that the student demonstrates the ability to:
- Develop an understanding of the fundamental concepts of machine learning, including algorithms, models and practices.
- Implement good methods and practices for effective deployment of machine leaning systems
- Acquire practical competencies and hands-on experience in applying machine learning methods in quantitative and qualitative workflows using a variety of data types.
Content
The following main topics are included in the course. Under each topic presented the main focus will be on the applications of the models and methods:
• Classical statistical learning tools and their applications (shallow learners):
- Support vector machines
- Decision trees, boosting, random forests, and gradient boosting
- Ensembling
• Deep Learning Methods and their applications:
- Fully connected neural networks
- Convolutional neural networks
- Recurrent neural networks
- Training, regularization and optimization
- Autoencoders and variational autoencoders
- GANs
- Transformers and Generative AI
Models, applications and data will be inspired by cases posted on Kaggle and Hugging face.
Literature
- Deep Learning with Python by Francois Collet (ISBN10: 9781617294433)
See itslearning for syllabus lists and additional literature references.
Examination regulations
Exam element a)
Timing
Tests
Written take-home exam
EKA
Assessment
Grading
Identification
Language
Duration
Examination aids
To be announced during the course
ECTS value
Indicative number of lessons
Teaching Method
Indicative number of teaching hours covers more than 45 hours per semester, divided into llectures and exercises.
Activities during the study phase:
- Attending lectures, including computer lab session
- Solving ad hoc assigments
- Self study of various parts of the course material.