- Docente: Claudio Sartori
- Credits: 6
- SSD: ING-INF/05
- Language: English
- Teaching Mode: Traditional lectures
- Campus: Bologna
-
Corso:
Second cycle degree programme (LM) in
Telecommunications Engineering (cod. 9205)
Also valid for Second cycle degree programme (LM) in Computer Engineering (cod. 5826)
Second cycle degree programme (LM) in Automation Engineering (cod. 8891)
Second cycle degree programme (LM) in Automation Engineering (cod. 8891)
Second cycle degree programme (LM) in Artificial Intelligence (cod. 9063)
Second cycle degree programme (LM) in Telecommunications Engineering (cod. 9205)
Learning outcomes
At the end of the course, the student has an understanding of theoretical foundations, computational properties, and use cases for some of the most popular supervised and unsupervised machine learning techniques. In particular, the student is able to address tasks such as classification, clustering, and discovery of rules by using modern machine learning methods and libraries.
Course contents
Module 1 - Machine Learning (available for 75195 Data Mining M and for 81610 Machine Learning)
- What is Machine Learning: some history and motivating examples
- Theory of learning
- Supervised vs unsupervised learning
- Classification and regression
- Model Selection, validation and presentation of results
- Regression
- Classification with linear discrimination, decision trees, Bayesian inference, Support Vector Machines, k-nearest neighbors, logistic regression, random forests, adaboost
- Ensemble learning, boosting, bagging
- Association rules and the Apriori algorithm
- Clustering/segmentation with k-means, dbscan, Expectation Maximization, hierarchical methods, kernel methods
- Analysis of case studies
- CRISP-DM methodology
Pre-requisites for Module 1 - Machine Learning
- Fundamentals of programming
- Fundamentals of calculus and linear algebra
- Fundamentals of statistics and probabilities
Module 2 - Data Mining (Available for 75195 Data Mining M, for 81610 Machine Learning and for 91262 - Data Mining, Text Mining and Big Data Analytics - Module 1)
- Architectures of systems with data mining components
- Enterprise Data Warehouse
- Data Lake
- Case studies
Readings/Bibliography
Module 1 - Machine Learning
- Introduction to machine learning / Ethem Alpaydin. - 3. ed. - Cambridge : The MIT Press, 2014. - XXII, 613 p. - link to the online version
- Scikit-Learn [https://scikit-learn.org/stable/documentation.html], or Python Data Science Handbook
Module 2 - Data Mining
- Shearer C., The CRISP-DM model: the new blueprint for data mining, J Data Warehousing (2000); 5:13—22.
Teaching methods
Most course lectures are in "traditional" classrooms and exploit the slides. Case studies are also proposed based on open-source software.
Interaction is also stimulated with the use of consultation tools, such as Kahoot!
The laboratory activity for module 1 will be an integral part of the learning process.
Assessment methods
Module 1
The Verification of knowledge is tested through multiple choice questions. The minimum to pass is to answer correctly half + 1 of the questions.
The weight of this part is 33%.
The Verification of abilities will be tested in lab with the development of a program for the execution of a Machine Learning task on an assigned data set. The quality of the solution will be evaluated on the basis of the correctness of the approach, the correctness of the solution, the quality of the coding and of the documentation. The minimum to pass is to give a sensible approach and a reasonable coding.
It is also possible, on request, to have an oral examination, with possible outcomes between -3="no answer" and +3="correct answer", to be added to the weighted sum of the above-mentioned scores.
Module 2
The assessment is mainly oriented to knowledge and consists of some multiple choice questions administered together with the Verification of Knowledge of Module 1 (the 33% weight mentioned there for the students of 75195 Data Mining M includes the verification for Module 2).
Additional details on Assessment are available in the course page on https://virtuale.unibo.it
For the students of 75195 Data Mining M the assessment of Modules 1 and 2 is a unique exam administered in the same sitting.
For the students of 81610 Machine Learning the assessment, administered in a unique exam sitting, will cover Module 1 only.
For the students of 91262 - Data Mining, Text Mining and Big Data Analytics, the assessment on the Data Mining part will be administered together with the other two modules of that exam.
Teaching tools
- Projection of slides made available before the lectures
- Kahoot! for class interaction
- https://virtuale.unibo.it for distribution of teaching materials, self-evaluation activities, forums
- Python
- Jupyter notebooks (Anaconda or Google Colab)
-
This class is supported by DataCamp, a learning platform for data science. DataCamp statement: "Learn R, Python and SQL the way you learn best through a combination of short expert videos and hands-on-the-keyboard exercises. Take over 100+ courses by expert instructors on topics such as importing data, data visualization or machine learning and learn faster through immediate and personalised feedback on every exercise.”
Office hours
See the website of Claudio Sartori
SDGs


This teaching activity contributes to the achievement of the Sustainable Development Goals of the UN 2030 Agenda.