ADVANCED MACHINE LEARNING AND KNOWLEDGE DISCOVERY
Modulo KNOWLEDGE DISCOVERY

Academic Year 2022/2023 - Docente: CONCETTO SPAMPINATO

Risultati di apprendimento attesi

The course covers the fundamental concepts of deep learning methods and how to use them for extracting, modelling and visualizing the learned knowledge.

Topics include: neural networks with backpropagation, convolutional neural networks, recurrent neural networks, methods for representation learning, and how to use them under different learning regimes (supervised, unsupervised and reinforcement learning) and in variety of real-world applications ranging from computer vision, machine translation and medical image analysis.

The learning objectives are:

to understand and use the main methodologies and techniques for learning from data
to understand the main methodologies to design and implement neural networks for real-world applications
to understand how to extract and learn knowledge in scenarios when supervision cannot be provided
to understand and foresee the reliability of machine learning methods in operational scenarios.

Knowledge and understanding

To understand the main concepts of learning from data
To understand concepts and tools for building intelligent systems using supervision and no supervision
To understand the most important machine learning and artificial intelligence methodologies and techniques used by industries to make sense of data in order to support the decision process
To understand what are the most appropriate techniques to be used in different real-world applications

Applying knowledge and understanding

To be able to effectively understand and use the main tools for creating, loading and manipulating datasets.
To design and implement from scratch a machine learning system following application-derived constraints in terms of modelling and data
To understand proper benchmarks and baselines and analysing achieved results and their generalization in real-world applications
To be able to apply methodologies and techniques to analyse data.

Course Structure

The main teaching methods are as follows:

Lectures, to provide theoretical and methodological knowledge of the subject;
Hands-on exercises, to provide “problem solving” skills and to apply design methodology;
Laboratories, to learn and test the usage of related tools.
Paper reading and presentations to enhance understanding of the core concepts
Seminars by renowned experts (from both universities and industries) in the field to understand the current state of the art.

Should teaching be carried out in mixed mode or remotely, it may be necessary to introduce changes with respect to previous statements, in line with the programme planned and outlined in the syllabus.

Required Prerequisites

Python programming language, statistical learning basic concepts

Attendance of Lessons

Strongly recommended. Attending and actively participating in the classroom activities will contribute positively towards the overall assessment of the final exam.

Detailed Course Content

The KD module consists of two parts: the first one will be addressing the general and modern techniques based on deep learning paradigm to create KD systems from data, while the second one on how to extract, represent and visualize knowledge from data and trained models.

Part I: Methods and Architectures

Neural Networks and Backpropagation

Derivatives and Gradient Descent
Neural Network Representation, Gradient descent for Neural Networks
Forward and Back Propagation
The revolution of depth: deep learning
Optimization algorithms: Mini-batch gradient descent, Exponentially weighted average, Gradient descent with momentum, RMSprop, Adam optimization algorithm, Learning rate decay
Training aspects of deep learning: Regularization, Dropout, Normalizing inputs, Vanishing / Exploding gradients, Weight Initialization for Deep Networks

Convolutional Neural Networks

Foundations: padding, strided convolution, dilation, 2D and 3D convolution, pooling
State of the art models: AlexNet, ResNets, DenseNets, Inception
Transfer Learning and Data Augmentation

Recurrent Neural Networks

LSTM and variants
Attention mechanisms

Part II: Knowledge Discovery from Data and Models

Unsupervised Learning with Deep Networks

Representation and Feature Learning
Autoencoders and Variational Autoencoders
Generative Adversarial Networks

Explainable AI

Principles of explainability vs interpretability
Post-hoc explanatory methods (e.g., IG and CAM)
Model agnostic (e.g., SHAP)

Deep Learning Frameworks:

Overview of the most used DL frameworks
PyTorch and Jupyter Notebooks

Applications:

Computer vision
Medical Image Analysis
Machine translation

Textbook Information

Deep Learning. I. Goodfellow, Y. Bengio and A. Courville, MIT Press, 2016
Programming PyTorch for Deep Learning, I. Pointer, O'Reilly Media
Teaching materials and reading paper list provided by the instructor

Autore	Titolo	Editore	Year	ISBN
I. Goodfellow, Y. Bengio and A. Courville	Deep Learning	MIT PRESS	2016	978-0262035613
I. Pointer	Programming PyTorch for Deep Learning	O'Reilly Media	2019	1492045357

Course Planning

	Subjects	Text References
1	Neural networks: derivatives, gradient descent, back-propagation	3
2	Deep Learning: basic concepts, optimization algorithms, training procedures	1,3
3	Convolutional Neural Networks	1,3
4	Explainable AI	3
5	Recurrent Neural Networks	1,3
6	Unsupervised Learning with Deep Networks: Representation and Feature Learning	1,3
7	Autoencoders and Variational Autoencoders	1,3
8	Generative Adversarial Networks	1,3
9	Deep Learning Frameworks: PyTorch and Jupyter Notebooks	2,3

Learning Assessment

Learning Assessment Procedures

The final exam consists of the development of a project in Pytorch, addressing one of the topics discussed during classes, together with a final report (structured as a scientific paper) discussing motivation, models, datasets and results used in the project.

The exam is evaluated according to the ability to create a deep learning model from scratch for extracting and learning knowledge from data on a given real-world problem, to understand how to properly measure its performance and to motivate the devised solutions.

The grading policy for the KD module is:

70%: Final project
30%: Theory test

The vote on the knowledge discovery module will account for 50% of the total grade for the entire course.

The module also foresees intermediate assignments only for students attending the course. These assignments include: a) between 2 and 3 homeworks regarding python scripts to solve simple basic learning problems on datasets discussed during with the instructor and b) a theory test to verify the correct understanding of the presented techniques. The grading policy is this case is the following one

55%: Homeworks
35%: Theory test
10%: Class attendance

Learning assessment may also be carried out on line, should the conditions require it.

Examples of frequently asked questions and / or exercises

Examples of questions and exercises are available on the Studium platform and on the course website.

ENGLISH VERSION

Degree Course in

Data Science for Management

ADVANCED MACHINE LEARNING AND KNOWLEDGE DISCOVERY
Modulo KNOWLEDGE DISCOVERY

Risultati di apprendimento attesi

Course Structure

Required Prerequisites

Attendance of Lessons

Detailed Course Content

Textbook Information

Course Planning

Learning Assessment

Learning Assessment Procedures

Examples of frequently asked questions and / or exercises

Dipartimento di

Economia e Impresa

Dove siamo

Contatti

Degree Course in

Data Science for Management

ADVANCED MACHINE LEARNING AND KNOWLEDGE DISCOVERYModulo KNOWLEDGE DISCOVERY

Risultati di apprendimento attesi

Course Structure

Required Prerequisites

Attendance of Lessons

Detailed Course Content

Textbook Information

Course Planning

Learning Assessment

Learning Assessment Procedures

Examples of frequently asked questions and / or exercises

Dipartimento di

Economia e Impresa

ADVANCED MACHINE LEARNING AND KNOWLEDGE DISCOVERY
Modulo KNOWLEDGE DISCOVERY