Data Analytics

Persone

Docente titolare del corso

Assistente

Ríssola E. A.

Assistente

Descrizione

The course deals with mining very large datasets, analysing them to make some descriptive summary of their content, test some hypothesis and extract valuable knowledge from them. Differently from other data mining courses, in this one we deal with datasets that for their large size, fast speed of updating, and variety of content (all characteristics of Big Data) cannot be mined with standard techniques. Hence, the course will deal with topics such as: similarity measures for very large datasets and data streams, data streams, link analysis, clustering, recommender systems, etc. The course is also complemented by a practical part where, using statistical packages for Python (language that we will assume students already know), we will also learn how to perform practical analysis of large datasets to interpret, visualise, and diagnose results and potential problems of your data analysis.

REFERENCES

Required: Jure Leskovec, Anand Rajaraman and Jeffrey David Ullman. Mining of Massive Datasets (2nd edition). Cambridge University Press, 2014.

Other books will be suggested during the course, but are not required and could be found in the university library.

Programma

Master of Science in Artificial Intelligence, Corso di base, Corso, 1° anno
Master of Science in Informatics, Corso a scelta, Corso, 1° anno
Master of Science in Informatics, Corso a scelta, Corso, 2° anno
Dottorato in Scienze informatiche, Corso a scelta, Corso, 1° anno (4.0 ECTS)
Dottorato in Scienze informatiche, Corso a scelta, Corso, 2° anno (4.0 ECTS)

Informazioni

Semestre

Primaverile

Anno accademico

2019-2020

ECTS

6.0

Persone

Formazione

Ricerca

Organizzazione

Data Analytics

Persone

Descrizione

Programma

Links

Informazioni

Indicazioni

Resta in contatto