The course assists students in the understanding of techniques for the indexing, retrieval, filtering, clustering, presentation, and evaluation of textual ainformation held in digital archives, web, and information systems. The course complements what the student learned from the previous course on Data Management, where only structured information is dealt with.
Nowadays more and more information is available in unstructured or poorly structured form. Examples of information of this type are textual documents, web pages, videos, photos, music, blogs, etc. The goal of this course is to enable the student to understand the foundations of managing unstructured or poorly structured information.
The course consists of theoretical lectures and practical sessions. The practical sessions deal with the design, implementation, and evaluation of an information retrieval system for a small and medium size collection of documents.
Examination will consist of 3 theoretical tests and 1 course project carried out during the semester (no final exam).