Seoul National University
M1522.000900 Data Mining
Spring 2018 - U Kang

News and Announcements

Course Information

Data mining refers to theories and techniques for finding useful patterns from massive amount of data. Data mining has been used in high impact applications including web analysis, fraud detection, recommendation system, cyber security, etc. This course covers important algorithms and theories for data mining. Main topics include mapreduce, finding similar items, mining frequent patterns, link analysis, data stream mining, clustering, graphs, and mining big data.

Schedule

Date Topic Notice
Mar 5 Introduction
7 Basics (Chapter 1)
12 Map-Reduce and the New Software Stack (Chapter 2)
14 Map-Reduce and the New Software Stack-2 (Chapter 2) HW1 out (due: Mar. 28)
19 Finding Similar Items (Chapter 3)
21 Finding Similar Items-2 (Chapter 3): use the previous slide
26 Finding Similar Items-3 (Chapter 3): use the previous slide HW2 out (due: Apr. 2)
28 Mining Data Streams (Chapter 4)
Apr. 2 Mining Data Streams-2 (Chapter 4)
4 Mining Data Streams-3 (Chapter 4) HW 3 out (due: Apr. 11)
9 Link Analysis (Chapter 5)
11 Link Analysis-2 (Chapter 5)
16, 18 Midterm
23 Link Analysis-3 (Chapter 5) HW 4 out (due: May 2)
25 Frequent Itemsets (MMDS Chapter 6)
30 Frequent Itemsets-2 (MMDS Chapter 6)
May 2 Clustering (MMDS Chapter 7) HW 5 out (due: May 9)
7 Clustering (MMDS Chapter 7): use the previous slide
9 Clustering-2 (MMDS Chapter 7)
14 Advertising on the Web (MMDS Chapter 8)
16 Advertising on the Web-2 (MMDS Chapter 8) HW 6 out (due: May 23)
21 Recommendation (MMDS Chapter 9)
23 Recommendation-2 (MMDS Chapter 9) HW 7 out (due: May 30)
28 Mining Social-Network Graphs (MMDS Chapter 10)
30 Mining Social-Network Graphs-2 (MMDS Chapter 10) HW 8 out (due: June 11)
4 Dimensionality Reduction (MMDS Chapter 11)
June 6 Dimensionality Reduction-2 (MMDS Chapter 11) HW 9 out (due: June 13)
11 Conclusion
13 Final

Grading

Late policy - for all deliverables:

Textbook

The text book is
Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeff Ullman. (available in online)

Prerequisite

We expect you to have an undergraduate-level knowledge on the following topics: We provide some background, but the class will be fast paced.
Last modified Mar. 25, 2018, by U Kang