Skip to main content
You are browsing the curriculum of an upcoming academic year (2024–2025).
Do you want to change to the ongoing academic year?
Course unit, curriculum year 2024–2025
DATA.ML.370

Mining of Big Datasets, 5 cr

Tampere University
Teaching periods
Active in period 3 (1.1.2025–2.3.2025)
Active in period 4 (3.3.2025–31.5.2025)
Course code
DATA.ML.370
Language of instruction
English, Finnish
Academic years
2024–2025, 2025–2026, 2026–2027
Level of study
Intermediate studies
Grading scale
General scale, 0-5
Persons responsible
Responsible teacher:
Tarmo Lipping
Responsible organisation
Faculty of Information Technology and Communication Sciences 100 %
Coordinating organisation
Computing Sciences Studies 100 %
Core content
  • The concept and terminology of data mining.
  • Understanding the principles of processing large, non-structured datasets.
  • Basic methods and algorithms for the analysis of large datasets
  • Common tasks of mining large datasets such as similarity analysis, link analysis, finding frequent itemsets, clustering
  • Common applications of mining large datasets such as recommendation systems, web search, mining of social network graphs
Complementary knowledge
  • Mining data streams
  • Special challenges of processing large datasets: memory usage and data formats.
  • Deep learning methods in mining large datasets
Specialist knowledge
  • Mapreduce algorithm.
  • Locality-sensitive hashing
  • Distance measures
  • More advanced algorithms for mining large datasets
Learning outcomes
Prerequisites
Further information
Learning material
Studies that include this course
Completion option 1
The course will involve exercises and Teams discussions

Independent study

07.01.2025 30.05.2025
Active in period 3 (1.1.2025–2.3.2025)
Active in period 4 (3.3.2025–31.5.2025)