Ricerca di contatti, progetti,
corsi e pubblicazioni

Data Analytics

Descrizione

The course deals with mining very large datasets, analysing them to make some descriptive summary of their content, test hypothesis, and extract valuable knowledge from them. Differently from other data mining courses, in this one we deal with datasets that for their large size, fast speed of updating, and variety of content (the so called Big Data) cannot be mined with standard techniques. Hence, the course will deal with topics such as: similarity measures for very large datasets and data streams, link analysis, clustering, recommender systems, MapReduce, etc. The course is also complemented by a practical part where, using statistical packages for Python (language that I will assume students already know), we will also learn how to perform practical analysis of large datasets and interpret, visualise, and diagnose results and potential problems of your data analysis.

 

 

REFERENCES

  • Required: Jure Leskovec, Anand Rajaraman and Jeffrey David Ullman. Mining of Massive Datasets (2nd edition). Cambridge University Press, 2014.

Persone

 

Crestani F.

Docente titolare del corso

Chakraborty M.

Assistente

Ríssola E. A.

Assistente

Informazioni aggiuntive

Semestre
Primaverile
Anno accademico
2018-2019
ECTS
6
Lingua
Inglese
Offerta formativa
Master of Science in Artificial Intelligence, Corso di base, Corso, 1° anno

Master of Science in Informatics, Corso a scelta, Corso, 1° anno

Master of Science in Informatics, Corso a scelta, Corso, 2° anno

Dottorato in Scienze informatiche, Corso a scelta, Corso, 1° anno (4 ECTS)

Dottorato in Scienze informatiche, Corso a scelta, Corso, 2° anno (4 ECTS)