SGN-43006 Knowledge Mining and Big Data, 5 cr

Additional information

Lectures in English

Suitable for postgraduate studies.

Person responsible

Ari Visa

Lessons

Implementation Period Person responsible Requirements
SGN-43006 2018-01 4 Ari Visa
Assignment and final examination.

Learning Outcomes

Learning outcomes: The student can describe the difference between data and knowledge mining. The student can list and describe OLAP, association, predictive modeling, modeling, regression analysis and cluster analysis. The student can analyse the own problem and apply the lectured method on it. The student is capable to analyse the proposed solutions.

Content

Content Core content Complementary knowledge Specialist knowledge
1. Concept Description  Data preprocessing Data Generalization Summarization-Based Characterization Analyzing of Attribute Relevance   
2. Mining Association Rules  Mining Single-Dimensional Boolean Association Rules, and Multilevel Association Rules, and Multidimensional Association Rules Correlation Analysis    
3. Descriptive Models  Cluster Analysis Describing Data by Probability Distributions and Densities   Parametric models Nonparametric models 
4. Predictive Models  Regression models Stochastic models Predictive models for classification Models for structured data   

Instructions for students on how to achieve the learning outcomes

The examination is based on the final exam and an exercise work. The grading of the execise work is pass/fail.

Assessment scale:

Numerical evaluation scale (0-5)

Partial passing:

Completion parts must belong to the same implementation

Study material

Type Name Author ISBN URL Additional information Examination material
Book   "Data Mining: Concepts and Techniques"   Jiawei Han & Micheline Kamber       Morgan Kaufmann Publisher, 2000   Yes   
Book   "Principles of Data Mining"   David J. Hand, Heikki Mannila and Padhraic Smyth       MIT Press, 2000   Yes   

Additional information about prerequisites
It is adviceable to have some programming knowledge.



Correspondence of content

Course Corresponds course  Description 
SGN-43006 Knowledge Mining and Big Data, 5 cr SGN-5306 Knowledge Mining, 3 cr  

Updated by: Ketola Susanna, 09.03.2018