7305021
TOPICS IN APPLIED MATHEMATICS,
TOPICS IN APPLIED MATHEMATICS, 2-3 ov
Lecturer info
Professor ESKO TURUNEN
Lectures and exercises:
Lectures and exercise hours total: 28 h.
Weekly teaching / period |
|
|
|
|
|
Lectures (h): |
- |
- |
- |
- |
- |
Exercises (h): |
- |
- |
- |
- |
- |
Content of the course
GUHA - a data mining method
GUHA (General Unary Hypotheses Automaton) is a method of automatic generation of hypothesis based on empirical data, thus a method of data mining. GUHA is primary suitable for exploratory analysis of large data; the processed data form a rectangle matrix, whose rows correspond to objects belonging to the sample and each column correspond to one investigated variable. A typical data matrix processed by GUHA has hundreds or thousands of rows and tens of columns. The aim in the course is to study the theoretical basis of GUHA, a logical justification for statistical reasoning, and to analyse a 'real world' data matrix possibly introduced by the student himself.
Requirements
Examination and written exercises.
Literature
Hajek P., Havranek T.: Mechanising hypothesis formation - Mathematical foundations for a general theory, Springer Verlag 1978. (Downloadable freely from http://www.cs.cas.cz/~hajek/guhabook/index.html)
Notes
The course starts in May (Summer 2004), lectures in English.