- Docente: Matteo Amabili
- Credits: 6
- SSD: SECS-S/06
- Language: English
- Moduli: Matteo Amabili (Modulo 1) Antonio Petruccelli (Modulo 2)
- Teaching Mode: Traditional lectures (Modulo 1) Traditional lectures (Modulo 2)
- Campus: Bologna
- Corso: Second cycle degree programme (LM) in Greening Energy Market and Finance (cod. 5885)
-
from Feb 13, 2025 to Mar 14, 2025
Learning outcomes
At the end of the course students will have a good knowledge of machine learning and Artificial Intelligence. They will be able to apply these techniques to problem related to climate change and sustainability.
Course contents
- Intoduction to ML
- 1.1.What is ML: a shift from knowledge to data
- 1.2.Kind of problems
- 1.2.1.supervised versus unsupervised
- 1.2.2.regression vs classification
- 1.3.Data pipeline
- 1.4.Python Basics
- Data Preprocessing
- 2.1.Data Normalizzation
- 2.3.Categorical variables: ordinal and non-ordinal
- 2.4.Outliers
- 2.5.Feature Engineering
- 2.6.Dimensionality reduction:PCA
- 2.7.Examples in python: sklearn
- 2.1.Data Normalizzation
- Linear Regression
- 3.1.Estimating the coefficients: Least Square Method & maximum likehood
- 3.2.Performance metrics
- 3.3.Interpreting the coefficients
- 3.4.The problem of Collinearity
- 3.5.Selecting the relevant variables: Lasso/Ridge regression
- 3.6.Kernel Regression
- 3.7.Pyhton Hand-on
- Logistic Regression
- 4.1.Problem Definition
- 4.2.Estimating the coefficients: gradient descent
- 4.3.Classification Metrics:
- 4.3.1.Precision
- 4.3.2.Recall
- 4.3.3.F-beta score
- 4.3.4.Area Under tre ROC curve
- 4.4.Interpreting the coefficients
- 4.5.Generalized linear model: Poisson regression
- 4.6.Multilabel case
- 4.7.Python hands-on
- Evaluate a Model
- 5.1.Cross-validation & hyper parameter tuning
- 5.2.Bias Variance trade-off
- 5.3.Simple cross-validation
- 5.4.N-fold cross-validation
- 5.5.Python hands-on
- Tree Based Method
- 6.1.Simple Cart for regression and classification
- 6.2.Ensample methods: Random Forest
- 6.3.Boosting methods
- 6.4.Python hands-on
- Unsupervised learning
- 7.1.Problems
- 7.2.K-means
- 7.3.Density-Based Model: DBSCAN
- 7.5.Remove outliers using unsupervised methods
- 7.6.Python Hands-on
The second part of the course (held by Prof. Petruccelli) is dedicated to the machine learning application to natural risk. Here is the contents:
1. Introduction to Natural Risks and Climate Change
- Usingtrees-based methods on natural disasters datasets
2. Geospatial Data Analysis
- Managing georeferenced data using appropriate libraries
3. Machine Learning in Weather and Climate
- Using pre-trained models for short-mid-range weather predictions
4. Extreme Events Analysis
- Performing analysis on extreme events using both classical and data-driven methods
5. AI for DisasterManagement
- Using computer vision and NLP algorithms to manage natural disasters use-cases
6. AI for Urban Vulnerability
- Using neural networks to segment and detect urban buildings
Readings/Bibliography
- James, Gareth, et al. An introduction to statistical learning. Vol. 112. New York: springer, 2013.
- Hastie, Trevor, et al. The elements of statistical learning: data mining, inference, and prediction. Vol. 2. New York: springer, 2009.
- Rogers, Simon, and Mark Girolami. A first course in machine learning. Chapman and Hall/CRC, 2016.
Teaching methods
Frontal lessons
Assessment methods
The final exam will consists of a Machine learning project. During the exam, the student will have to present the developed project and discuss its main aspects as well as the underlying theory.
Teaching tools
- Slides (power point/pdf)
- Selected literature
- Jupyter Notebook and Python Code
Office hours
See the website of Matteo Amabili
See the website of Antonio Petruccelli