U Kang
U Kang
Associate Professor
Data Mining Lab
Department of Computer Science and Engineering
Seoul National University
1 Gwanak-ro, Gwanak-gu, Seoul,
Republic of Korea 08826

Email: ukang at snu.ac.kr
Office: Room 301-502

I am an associate professor in the Department of Computer Science and Engineering at Seoul National University. Here is my short biography and full Curriculum Vitae.
I lead the Data Mining Lab in Department of Computer Science and Engineering.

(Announcement)

  1. I am looking for motivated Ph.D. student, MS students, and postdocs. If interested, send me an email with your full Curriculum Vitae and transcript.
  2. I am also looking for undergraduate students who are interested in data mining, machine learning, and artificial intelligence. If interested, send me an email with your full Curriculum Vitae and transcript.
  3. Our lab holds the weekly Data Mining Seminar which is open to faculty, graduate, and undergraduate students interested in data mining. Free pizza or sandwitches are provided in the seminar!

What's New

Courses

Research Interests

Big Data Mining and Machine Learning: models, algorithms, and systems for scalable data analysis with applications on knowledge discovery, prediction, and anomaly detection.

Awards and Honors

     (My Awards)      (My Students' Awards)

Software

I lead the development of PEGASUS (Peta-Scale Graph Mining System), an award-winning open-source software for mining very large graphs in Hadoop platform.

I lead the development of BigTensor, a large scale multi-dimensional array (tensor) mining package on top of Spark and Hadoop.

I lead the development of TegViz, a distributed tera-scale graph generation and visualization tool.

Talks and Tutorials

Talks and tutorials by U Kang

Press

Articles (in Korean)

Publications

(Google Scholar, DBLP)

2018

2017

  • A Comparative Study of Matrix Factorization and Random Walk with Restart in Recommender Systems.
    Haekyu Park, Jinhong Jung, and U Kang.
    IEEE International Conference on Big Data (BigData) 2017, Boston, USA.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • Supervised Belief Propagation: Scalable Supervised Inference on Attributed Networks.
    Jaemin Yoo, Saehan Jo, and U Kang.
    IEEE International Conference on Data Mining (ICDM) 2017, New Orleans, USA.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • BePI: Fast and Memory-Efficient Method for Billion-Scale Random Walk with Restart.
    Jinhong Jung, Namyong Park, Lee Sael, and U Kang.
    ACM International Conference on Management of Data (SIGMOD) 2017, Chicago, IL, USA.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • Fast and Scalable Distributed Boolean Tensor Factorization.
    Namyong Park, Sejoon Oh and U Kang.
    IEEE International Conference on Data Engineering (ICDE) 2017, San Diego, CA, USA.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • A New Question Answering Approach with Conceptual Graphs.
    Kyung-Min Kim, Jinhong Jung, Jihee Ryu, Ha-Myung Park, Joseph P.Joohee, Seokwoo Jeong, U Kang, and Sung-Hyon Myaeng.
    Conférence en Recherche d’Information et Applications (CORIA) 2017, Marseille, France.
    [PDF] [BIBTEX]

  • Time-weighted Counting for Recently Frequent Pattern Mining in Data Streams
    Yongsub Lim, and U Kang.
    Knowledge and Information Systems (KAIS), vol. 53, no. 2, pp. 391-422, Sep. 12 2017.
    [PDF] [BIBTEX]

  • Fully Scalable Methods for Distributed Tensor Factorization
    Kijung Shin, Lee Sael, and U Kang.
    IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 29, no. 1, pp. 100-113, Jan. 1 2017.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

2016

  • Partition Aware Connected Component Computation in Distributed Systems.
    Ha-Myung Park, Namyong Park, Sung-Hyon Myaeng, and U Kang.
    IEEE International Conference on Data Mining (ICDM) 2016, Barcelona, Spain.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • Personalized Ranking in Signed Networks using Signed Random Walk with Restart.
    Jinhong Jung, Woojung Jin, Lee Sael, and U Kang.
    IEEE International Conference on Data Mining (ICDM) 2016, Barcelona, Spain.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • BIGtensor: Mining Billion-Scale Tensor Made Easy
    Namyong Park, Byungsoo Jeon, Jungwoo Lee, and U Kang.
    ACM International Conference on Information and Knowledge Management (CIKM) 2016, Indianapolis, Indiana, USA.
    (Demo Paper)
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • PIN-TRUST: Fast Trust Propagation Exploiting Positive, Implicit, and Negative Information
    Min-Hee Jang, Christos Faloutsos, Sang-Wook Kim, U Kang, and Jiwoon Ha.
    ACM International Conference on Information and Knowledge Management (CIKM) 2016, Indianapolis, Indiana, USA.
    [PDF] [BIBTEX]

  • PTE: Enumerating Trillion Triangles On Distributed System
    Ha-Myung Park, Sung-Hyon Myaeng, and U Kang.
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2016, San Francisco, USA.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • M-Flash: Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
    Hugo Gualdron, Robson Cordeiro, Jose Rodrigeus-Jr, Duen Horng (Polo) Chau, Minsuk Kahng, and U Kang.
    European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML-PKDD) 2016, Riva Del Garda, Italy.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • MTP: Discovering High Quality Partitions in Real World Graphs
    Yongsub Lim, Won-Jo Lee, Ho-Jin Choi, and U Kang.
    World Wide Web Journal
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • Mining Billion-Scale Tensors: Algorithm and Discoveries
    Inah Jeon, Evangelos E. Papalexakis, Christos Faloutsos, Lee Sael, and U Kang.
    VLDB Journal, vol. 25, issue 4, pp. 519-544, August 2016.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • SCouT: Scalable Coupled Matrix-Tensor Factorization - Algorithm and Discoveries
    ByungSoo Jeon, Inah Jeon, Lee Sael, and U Kang.
    IEEE International Conference on Data Engineering (ICDE) 2016, Helsinki, Finland
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • Random Walk with Restart on Large Graphs Using Block Elimination
    Jinhong Jung, Kijung Shin, Lee Sael, and U Kang.
    ACM Transactions on Database Systems (TODS), vol. 41, issue 2, pp. 12:1-12:43, June 2016.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

2015

  • TeGViz: Distributed Tera-Scale Graph Generation and Visualization
    ByungSoo Jeon, Inah Jeon, and U Kang.
    IEEE International Conference on Data Mining (ICDM) 2015, Atlantic City, USA.
    (Demo Paper)
    [PDF] [BIBTEX] [HOMEPAGE (Code)]

  • MASCOT: Memory-efficient and Accurate Sampling for Counting Local Triangles in Graph Streams.
    Yongsub Lim and U Kang.
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2015, Sydney, Australia.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs.
    Kijung Shin, Jinhong Jung, Lee Sael, and U Kang.
    ACM International Conference on Management of Data (SIGMOD) 2015, Melbourne, Australia
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • HaTen2: Billion-scale Tensor Decompositions.
    Inah Jeon, Evangelos E. Papalexakis, U Kang, and Christos Faloutsos.
    31st IEEE International Conference on Data Engineering (ICDE) 2015, Seoul, Korea.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • Reverse Nearest Neighbor Search with a Non-spatial Aspect.
    JengHoon Park, Chin-Wan Chung, and U Kang.
    Journal of Information Systems.
    [PDF] [BIBTEX]

  • Summarizing and understanding large graphs.
    Danai Koutra, U Kang, Jilles Vreeken, and Christos Faloutsos.
    Statistical Analysis and Data Mining, doi: 10.1002/sam.11267, 18 May 2015.
    [PDF] [BIBTEX]

  • Fast graph mining with HBase.
    Ho Lee, Bin Shao, and U Kang.
    Information Sciences, vol. 315, pp. 56-66, 10 September 2015.
    [PDF] [BIBTEX]

  • Discovering Large Subsets with High Quality Partitions in Real World Graphs.
    Yongsub Lim, Won-Jo Lee, Ho-Jin Choi, and U Kang.
    2nd International Conference on Big Data and Smart Computing (BigComp) 2015, Jeju Island, Korea.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • An Extension of the Automatic Cross-Association Method with a 3-dimensional Matrix.
    Won-Jo Lee, Chae-Gyun Lim, U Kang, and Ho-Jin Choi.
    2nd International Conference on Big Data and Smart Computing (BigComp) 2015, Jeju Island, Korea.
    [PDF] [BIBTEX]

  • Scalable Tensor Mining.
    Lee Sael, Inah Jeon, and U Kang.
    Big Data Research Journal, Feb. 2015.
    [PDF] [BIBTEX]

2014

  • Distributed Methods for High-dimensional and Large-scale Tensor Factorization.
    Kijung Shin, and U Kang.
    IEEE International Conference on Data Mining (ICDM) 2014, Shenzhen, China.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • Eventera: Real-time Event Recommendation System from Massive Heterogeneous Online Media
    Dongyeop Kang, DongGyun Han, NaHea Park, Sangtae Kim, U Kang, and Soobin Lee.
    IEEE International Conference on Data Mining (ICDM) 2014, Shenzhen, China.
    (Demo Paper)
    [PDF] [BIBTEX]

  • Fast, Accurate, and Space-efficient Tracking of Time-weighted Frequent Items from Data Streams.
    Yongsub Lim, Jihoon Choi, and U Kang.
    23rd ACM International Conference on Information and Knowledge Management (CIKM) 2014, Shaghai, China
    [PDF] [BIBTEX]

  • MapReduce Triangle Enumeration With Guarantees.
    Ha-Myung Park, Franceso Silvestri, U Kang, and Rasmus Pagh.
    23rd ACM International Conference on Information and Knowledge Management (CIKM) 2014, Shaghai, China
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • Data/Feature Distributed Stochastic Coordinate Descent for Logistic Regression.
    Dongyeop Kang, Woosang Lim, Kijung Shin, Lee Sael, and U Kang.
    23rd ACM International Conference on Information and Knowledge Management (CIKM) 2014, Shaghai, China
    [PDF] [Supplementary Document] [BIBTEX]

  • MMap: Fast Billion-Scale Graph Computation on a PC via Memory Mapping.
    Zhiyuan Lin, Minsuk Kahng, Kaeser Md. Sabrin, Duen Horng (Polo) Chau, Ho Lee, and U Kang.
    IEEE International Conference on Big Data (BigData) 2014, Washington DC, USA.
    [PDF] [BIBTEX] [HOMEPAGE (Code, Data)]

  • SlashBurn: Graph Compression and Mining beyond Caveman Communities.
    Yongsub Lim, U Kang, and Christos Faloutsos.
    IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 26, no. 12, pp. 3077-3089, April 2014.
    [PDF] [BIBTEX] [CODE]

  • Link Prediction Based on Generalized Cluster Information.
    Jungeun Kim, Minsoo Choy, Daehoon Kim, and U Kang.
    23rd International World Wide Web Conference (WWW) 2014, Seoul, Korea.
    (Poster paper)
    [PDF] [BIBTEX]

  • Net-Ray: Visualizing and Mining Billion-Scale Graphs.
    U Kang, Jay-Yoon Lee, Danai Koutra, and Christos Faloutsos.
    Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2014, Tainan, Taiwan.
    [PDF] [BIBTEX] [HOMEPAGE (Code)]

  • VoG: Summarizing and Understanding Large Graphs.
    Danai Koutra, U Kang, Jilles Vreeken, and Christos Faloutsos.
    SIAM International Conference on Data Mining (SDM) 2014, Philadelphia, Pennsylvania, USA.
    Best of SDM 2014.
    [PDF] [BIBTEX]

  • HEigen: Spectral Analysis for Billion-Scale Graphs.
    U Kang, Brendan Meeder, Evangelos E. Papalexakis, and Christos Faloutsos.
    IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 26, no.2, pp. 350-362, Feb. 2014.
    [PDF] [BIBTEX]

  • Mining Tera-scale graphs with ‘Pegasus’: algorithms and discoveries.
    U Kang and Christos Faloutsos.
    Large Scale Data Analytics, Springer, January 2014. Editors: Aris Gkoulalas- Divanis and Abdel Labbi.
    (Book Chapter)
    [BIBTEX]

2013


2012

  • Big graph mining: algorithms and discoveries.
    U Kang and Christos Faloutsos.
    ACM SIGKDD Explorations Newsletter Volume 14 Issue 2, December 2012. pp. 29-36.
    [PDF] [BIBTEX]

  • GigaTensor: Scaling Tensor Analysis Up By 100 Times - Algorithms and Discoveries.
    U Kang, Evangelos Papalexakis, Abhay Harpale, and Christos Faloutsos.
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2012, Beijing, China.
    [PDF] [BIBTEX]

  • GBASE: An Efficient Analysis Platform for Large Graphs.
    U Kang, Hanghang Tong, Jimeng Sun, Ching-Yung Lin, and Christos Faloutsos.
    VLDB Journal, 2012.
    [PDF] [BIBTEX]

  • Mining Tera-Scale Graphs: Theory, Engineering and Discoveries.
    U Kang.
    Ph.D. Thesis, Computer Science Department, Carnegie Mellon Univeristy, May 2012.
    [PDF] [BIBTEX]

  • Large Graph Mining System for Patterns, Anomalies & Visualization.
    Leman Akoglu*, Duen Horng Chau*, U Kang*, Danai Koutra*, and Christos Faloutsos.
    Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2012, Kuala Lumpur, Malaysia.
    (Demo paper)
    [BIBTEX]

  • OPAvion: Mining and visualization in large graphs.
    Leman Akoglu*, Duen Horng Chau*, U Kang*, Danai Koutra*, and Christos Faloutsos.
    ACM SIGMOD Conference 2012, Scottsdale, AZ, USA.
    (Demo paper)
    [PDF] [BIBTEX]

  • Axiomatic Analysis of Co-occurrence Similarity Functions.
    U Kang, Mikhail Bilenko, Dengyong Zhou, and Christos Faloutsos.
    CMU Computer Science Tech Report CMU-CS-12-102, February 2012.
    [PDF] [BIBTEX]

  • Fast Random Walk Graph Kernel.
    U Kang, Hanghang Tong, and Jimeng Sun.
    SIAM International Conference on Data Mining (SDM) 2012, Anaheim, California, USA. (acceptance rate 27 %)
    [PDF] [BIBTEX]

  • PEGASUS: Mining Billion-Scale Graphs in the Cloud.
    U Kang, Duen Horng Chau, and Christos Faloutsos.
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2012, Kyoto, Japan
    (Special Session: Signal and Information Processing for "Big Data")
    [PDF] [BIBTEX]

2011

  • Beyond `Caveman Communities': Hubs and Spokes for Graph Compression and Mining.
    U Kang and Christos Faloutsos.
    IEEE International Conference on Data Mining (ICDM) 2011, Vancouver, Canada. (acceptance rate 12.2 %)
    [PDF] [BIBTEX] [CODE]

  • Unifying Guilt-by-Association Approaches: Theorems and Fast Algorithms.
    Danai Koutra, Tai-You Ke, U Kang, Duen Horng (Polo) Chau, Hsing-Kuo Kenneth Pao, and Christos Faloutsos.
    European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2011, Athens, Greece. (acceptance rate 20.2 %)
    [PDF] [BIBTEX]

  • GBASE: A Scalable and General Graph Management System.
    U Kang, Hanghang Tong, Jimeng Sun, Ching-Yung Lin, and Christos Faloutsos.
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2011, San Diego, CA, USA. (acceptance rate 17.5 %)
    [PDF] [BIBTEX]

  • Clustering Very Large Multi-dimensional Datasets with MapReduce.
    Robson L. F. Cordeiro, Caetano Traina Jr., Agma J. M. Traina, Julio Lopez, U Kang, and Christos Faloutsos.
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2011, San Diego, CA, USA. (acceptance rate 17.5 %)
    [PDF] [BIBTEX]

  • Spectral Analysis for Billion-Scale Graphs: Discoveries and Implementation.
    U Kang, Brendan Meeder, and Christos Faloutsos.
    Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2011, Shenzhen, China. (acceptance rate 9.7 %)
    Best Paper Award: Best Application Paper.
    [PDF] [BIBTEX]

  • Centralities in Large Networks: Algorithms and Observations.
    U Kang, Spiros Papadimitriou, Jimeng Sun, and Hanghang Tong.
    SIAM International Conference on Data Mining (SDM) 2011, Mesa, Arizona, USA. (acceptance rate 25.1 %)
    [PDF] [BIBTEX]

  • Mining Large Graphs: Algorithms, Inference, and Discoveries.
    U Kang, Duen Horng Chau, and Christos Faloutsos.
    IEEE International Conference on Data Engineering (ICDE) 2011, Hannover, Germany. (acceptance rate 19.8 %)
    [PDF] [BIBTEX]

  • HADI: Mining Radii of Large Graphs.
    U Kang, Charalampos E. Tsourakakis, Ana Paula Appel, Christos Faloutsos, and Jure Leskovec.
    ACM Transactions on Knowledge Discovery from Data (TKDD), 2011.
    [PDF] [BIBTEX]

  • PEGASUS: Mining Peta-Scale Graphs.
    U Kang, Charalampos E. Tsourakakis, and Christos Faloutsos.
    Knowledge and Information Systems (KAIS), Springer, 2011.
    [PDF] [BIBTEX]

2010

  • Patterns on the Connected Components of Terabyte-Scale Graphs.
    U Kang, Mary McGlohon, Leman Akoglu, and Christos Faloutsos.
    IEEE International Conference on Data Mining (ICDM) 2010, Sydney, Australia. (acceptance rate 19.4 %)
    [PDF] [BIBTEX]

  • Inference of Beliefs on Billion-Scale Graphs.
    U Kang, Duen Horng "Polo" Chau, and Christos Faloutsos.
    Large-scale Data Mining: Theory and Applications (LDMTA) 2010, in conjunction with KDD 2010, Washington D.C., USA.
    [PDF] [BIBTEX]

  • Radius Plots for Mining Tera-byte Scale Graphs: Algorithms, Patterns, and Observations.
    U Kang, Charalampos E. Tsourakakis, Ana Paula Appel, Christos Faloutsos, and Jure Leskovec.
    SIAM International Conference on Data Mining (SDM) 2010, Columbus, Ohio, USA. (acceptance rate 23.4 %)
    [PDF] [BIBTEX]

2009

  • PEGASUS: A Peta-Scale Graph Mining System - Implementation and Observations.
    U Kang, Charalampos E. Tsourakakis, and Christos Faloutsos.
    IEEE International Conference on Data Mining (ICDM) 2009, Miami, Florida, USA. (acceptance rate 8.9 %)
    Best Paper Award: Best Application Paper (runner-up).
    [PDF] [BIBTEX] [PEGASUS HOMEPAGE]

  • DOULION: Counting Triangles in Massive Graphs with a Coin.
    Charalampos E. Tsourakakis, U Kang, Gary Miller, and Christos Faloutsos.
    ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2009, Paris, France. (acceptance rate 20 %)
    [PDF] [BIBTEX]

2008

  • HADI: Fast Diameter Estimation and Mining in Massive Graphs with Hadoop .
    U Kang, Charalampos Tsourakakis, Ana Paula Appel, Christos Faloutsos, and Jure Leskovec.
    CMU Machine Learning Tech Report CMU-ML-08-117, December 2008.
    [PDF] [BIBTEX]

free web stats