ADVANCED MACHINE LEARNING AND KNOWLEDGE DISCOVERY
Modulo ADVANCED MACHINE LEARNING

Academic Year 2022/2023 - Docente: Vincenza CARCHIOLO

Risultati di apprendimento attesi

The module will focus on the implementations of various machine learning techniques and their applications in various domains. The primary tools used in the class are the Python programming language and several associated libraries.

Course Structure

Lectures, hands-on exercises, paper reading, student presentations and seminars 

Should teaching be carried out in mixed mode or remotely, it may be necessary to introduce changes with respect to previous statements, in line with the programme planned and outlined in the syllabus.

Required Prerequisites

Python programming language, Linear Algebra

Attendance of Lessons

Strongly recommended. Attending and actively participating in the classroom activities will contribute positively towards the overall assessment of the oral exam.

Detailed Course Content

Introduction to the Course

  • Introduction to Machine Learning

  •  Review of data Characteristics of Data and Preparation and Preprocessing

 

Supervised Learning

  • Classification and Prediction using K-Nearest-Neighbor

  • Classifying with Probability Theory; Naïve Bayes

  • Building Decision Trees

  • Regression models

  • Evaluating predictive models

  • Ensemble Models: Bagging and Boosting

Unsupervised Learning

  • Clustering using K-Means

  • Hierarchical Clustering

  • Association Rule discovery 

  • Principal Component Analysis and Dimensionality Reduction

  • Singular Value Decomposition

Brief note on Advance Topics 

  • Matrix Factorization

  • Support Vector Machines

  • Search and Optimization Techniques

  • Markov models; time series analysis, sequential pattern mining

Real application domains

  • Text Mining and document analysis/filtering

    • Content analysis, TFxIDF transformation, text categorization, document clustering

  • Recommender systems

    • Neighborhood methods (user- and item-based)

    • Matrix factorization

    • Marketing and finance data analysis

  • Laboraroty activity
    • Brief Python review

    • The package Numpy, Pandas , matplotlib and seaborn

    • Scikit-learn: a machine learning library for Python

      • Classification, Regression, Clustering, Dimensionality Reduction, Model Selection, Preprocessing

Textbook Information

  1. Introduction to Machine Learning, Fourth Edition, By Ethem Alpaydin, MitPress ISBN: 9780262043793. 2020

  2. Python Data Science Essentials - Third Edition by Alberto Boschetti, Luca Massaron, Packt Publishing, ISBN: 9781789537864, 2020

  3. Teaching materials and reading paper list provided by the instructor

Course Planning

 SubjectsText References
1Introduction to Machine Learning1,3
2Python review3
3Pandas, Numpy, Matplotlib2,3
4Classification and Prediction: K-Nearest-Neighbor1
5Classification and Naive Bayes1
6Decision Tree1,3
7Regression models 1
8Evaluating predictive models1
9Ensemble Models: Bagging and Boosting1
10Clustering using K-Means1
11Hierarchical Clustering1
12Association Rule discovery 1
13Dimensional reduction1
14Singular Value Decomposition1
15Advanced topic1,3
16Sckit learn3
17ML in NLP3
18ML for Reccomender System3

Learning Assessment

Learning Assessment Procedures

There will be one assignment and one final exam. The assignments will contain written questions that require some Python programming. The final exam consists  a final assignment and an oral discussion concerning all course material.  

The final assignment concerns comparative analysis on a given problem that must be presented in a final report and discussed in an oral discussion. The vote on the advanced machine learning module will account for 40% of the total grade for the entire course.

The grading policy for the AML module is:

  • 40%: Final assignments

  • 20% Intermediate assignments

  • 40%: Oral discussion

Learning assessment may also be carried out on line, should the conditions require it.

Examples of frequently asked questions and / or exercises

Examples of questions and exercises are available on the Studium platform and on the course website.

ENGLISH VERSION