Today more and more information is becoming available in unstructured or poorly structured form. Examples of information of this type are textual documents, web pages, videos, images, sounds, blogs, etc. The goal of this course is to enable the student to understand the foundations of managing unstructured or poorly structured information. In particular, the aim is to assist students to get an understanding of some of the techniques for the indexing, retrieval, filtering, clustering, and presentation of textual and multimedia information held in digital archives and/or on the web. From this perspective the course complements what the student learned from a previous course on Database technology, where only structured information is managed. The course will be complemented by practical sessions dealing with the design, implementation, and evaluation of information retrieval systems for medium size collections of documents.
- Required: W.B. Croft, D. Metzler, and T. Strohman. Search Engines: Information Retrieval in Practice, Pearson, 2009.
- Other books will be suggested during the course, but are not required and could be found in the university library.