Seoul National University
M1522.000900 Data Mining
Spring 2017 - U Kang

News and Announcements

Course Information

Data mining refers to theories and techniques for finding useful patterns from massive amount of data. Data mining has been used in high impact applications including web analysis, fraud detection, recommendation system, cyber security, etc. This course covers important algorithms and theories for data mining. Main topics include mapreduce, finding similar items, mining frequent patterns, link analysis, data stream mining, clustering, graphs, and mining big data.


Date Topic Notice
Mar. 6 Introduction
8 Basics (Chapter 1)
13 Map-Reduce and the New Software Stack (Chapter 2)
15 Map-Reduce and the New Software Stack 2 (Chapter 2)
20 Map-Reduce and the New Software Stack 2 (Chapter 2): use the previous slide HW1 out (due: Apr. 3)
22 Finding Similar Items (Chapter 3)
27 Finding Similar Items (Chapter 3): use the previous slide HW2 out (due: Apr. 5)
29 Mining Data Streams (Chapter 4)
Apr. 3 Mining Data Streams (Chapter 4): use the previous slide
5 Mining Data Streams-2 (Chapter 4)
10 Mining Data Streams-3 (Chapter 4) HW3 out (due: Apr. 17)
12 Link Analysis (Chapter 5)
17 Link Analysis-2 (Chapter 5)
19 Midterm
24 Link Analysis-3 (Chapter 5) HW4 out (due: May 1)
26 Frequent Itemsets (Chapter 6)
May 1 Frequent Itemsets-2 (Chapter 6) HW5 out (due: May 8)
3 Clustering (Chapter 7)
8 Clustering (Chapter 7): use the previous slide
10 Clustering-2 (Chapter 7) HW6 out (due: May 17)
15 Advertising on the Web (Chapter 8)
17 Advertising on the Web-2 (Chapter 8) HW7 out (due: May 24)
22 Recommendation (Chapter 9)
24 Recommendation-2 (Chapter 9) HW8 out (due: May 31)
29 Mining Social-Network Graphs (Chapter 10)
31 Mining Social-Network Graphs-2 (Chapter 10)
June 5 Dimensionality Reduction (Chapter 11) HW9 out (due: June 12)
7 Dimensionality Reduction-2 (Chapter 11) HW 10 out (due: June 16)
12 Conclusion
14 Final


Late policy - for all deliverables:


The text book is
Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeff Ullman. (available in online)


We expect you to have an undergraduate-level knowledge on the following topics: We provide some background, but the class will be fast paced.
Last modified Apr. 3, 2017, by U Kang