Seoul National University
M1522.001400 Introduction to Data Mining
Spring 2016 - U Kang

News and Announcements

Course Information

Data mining refers to theories and techniques for finding useful patterns from massive amount of data. Data mining has been used in high impact applications including web analysis, fraud detection, recommendation system, cyber security, etc. This course covers important algorithms and theories for data mining. Main topics include mapreduce, finding similar items, mining frequent patterns, link analysis, data stream mining, clustering, graphs, and mining big data.

Schedule

Date Topic Notice
Mar 2 Introduction
7 Basics (Chapter 1)
9 Map-Reduce and the New Software Stack (Chapter 2)
14 Map-Reduce and the New Software Stack-2 (Chapter 2) Hw1 out (due: Mar 23)
16 Map-Reduce and the New Software Stack-2 (Chapter 2): use previous slide
21 Finding Similar Items (Chapter 3)
23 Finding Similar Items-2 (Chapter 3): use previous slide Hw2 out (due: Mar 30)
28 Mining Data Streams (Chapter 4)
30 Mining Data Streams-2 (Chapter 4)
Apr 4 Mining Data Streams-3 (Chapter 4)
6 Mining Data Streams-3 (Chapter 4): use previous slide Hw3 out (due: Apr 13)
11 Link Analysis (Chapter 5)
13 Link Analysis-2 (Chapter 5)
18 Link Analysis-3 (Chapter 5)
20 Midterm exam
25 Link Analysis-3 (Chapter 5): use previous slide Hw4 out (due: May 2)
27 Frequent Itemsets-1 (Chapter 6)
May 2 Frequent Itemsets-2 (Chapter 6) HW5 out (due: May 9)
4 Clustering-1 (Chapter 7)
9 Clustering-2 (Chapter 7) HW6 out (due: May 16)
11 Advertising on the Web (Chapter 8)
16 Advertising on the Web (Chapter 8): use previous slide HW7 out (due: May 23)
18 Recommendation System-1 (Chapter 9)
23 Recommendation System-2 (Chapter 9) HW8 out (due: May 30)
25 Graphs-1 (Chapter 10)
30 Graphs-2 (Chapter 10) HW9 out (due: June 8)
June 1 Dimensionality Reduction-1 (Chapter 11)
8 Dimensionality Reduction-2 (Chapter 11) HW10 out (due: June 15)
13 Final exam

Grading

Late policy - for all deliverables:

Textbook

The text book is
Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeff Ullman. (available in online)

Prerequisite

We expect you to have an undergraduate-level knowledge on the following topics: We provide some background, but the class will be fast paced.
Last modified May 24, 2016, by U Kang