|
Course Catalog 2014-2015
PLA-43106 Data Mining, 5 cr |
Person responsible
Pekka Ruusuvuori, Teemu Kumpumäki
Lessons
Study type | P1 | P2 | P3 | P4 | Implementations | Lecture times and places |
|
|
|
|
|
|
Learning Outcomes
The course gives an introduction to data mining and analysis of large datasets. For example, networks and databases involve massive amounts of data, and mining of useful information from data is an increasingly common challenge. By taking the course, the student learns the basic principles and terminology of data mining, knows the commonly used algorithms, and recognizes the typical challenges of processing large datasets. The course will introduce several application areas of data mining, including the principles of web search engines, recommendation systems, and web advertising.
Content
Content | Core content | Complementary knowledge | Specialist knowledge |
1. | The concept and terminology of data mining. | Knowledge of the basic methods and algorithms. | Knowledge of the limitations of data mining. |
2. | Understanding the special principles of processing large, non-structured datasets. | Special challenges of processing large datasets: memory usage and data formats. | Mapreduce algorithm. |
3. | Basic principles of web search engines. |
Study material
Type | Name | Author | ISBN | URL | Edition, availability, ... | Examination material | Language |
Book | Mining of Massive Datasets | A. Rajaraman, J. Leskovec, J.D. Ullman | Yes | English | |||
Lecture slides | P. Ruusuvuori | Yes | English |
Prerequisite relations (Requires logging in to POP)
Correspondence of content
There is no equivalence with any other courses
More precise information per implementation
Implementation | Description | Methods of instruction | Implementation |
Documents