Data Preprocessing

  1. M. Hernandez and S. Stolfo, Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem, Journal of Data Mining and Knowledge Discovery, 1998
  2. Presenter: Wyatt Pease
    Date: 2/10/2015

  3. Wei Cheng, Xiaoming Jin, Jian-Tao Sun, Xuemin Lin, Xiang Zhang, and Wei Wang, Searching Dimension Incomplete Databases, IEEE Transactions on Knowledge and Data Engineering, 2013.
  4. Presenter: Preston Tunnel Wilson
    Date: 2/10/2015

    Association Rules

  5. Chun-Nan Hsu and Graig A. Knoblock, Discovering Robust Knowledge from Databases that Change, Data Mining and Knowledge Discovery, Volume 2, Issue 1, 1998, 69-95.
  6. Presenter: Daniel Morris
    Date: 2/12/2015

  7. Xindong Wu, Chengqi Zhang, and Shichao Zhang, Efficient Mining of Both Positive and Negative Association Rules, ACM Transactions on Information Systems, 2004.
  8. Presenter: None (will not be presented)
    Date: 2/12/2015

  9. S.D. Lee, David Cheung and Ben Kao, Is Sampling Useful in Data Mining? A Case in the Maintenance of Discovered Association Rules, Data Mining and Knowledge Discovery, Volume 2, Issue 3, 1998, 233-262.
  10. Presenter: Hong Xu
    Date: 2/17/2015

  11. R. Agrawal and R. Srikant, Fast Algorithms for Mining Association Rules, Proceedings of the 20th VLDB Conference, Santiago, Chile, 1994
  12. Presenter: Evan Deere
    Date: 2/17/2015

  13. R. Srikant and R. Agrawal, Mining Quantitative Association Rules in Large Relational Tables, SIGMOD 1996.
  14. Presenter: John Alar
    Date: 2/19/2015

  15. D. Cheung, J. Han, V. Ng, and C.Y. Wong, Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique, ICDE, 1996.
  16. Presenter: James Simpson
    Date: 2/24/2015

  17. J. Han and Y. Fu, Mining Multiple-Level Association Rules in Large Databases, IEEE Transactions on Knowledge and Data Engineering, 1999.
  18. Presenter: William Watkins
    Date: 2/24/2015

  19. Eui-Hong (Sam) Han, George Karypis, and Vipin Kumar, Scalable Parallel Data Mining for Association Rules, IEEE Transactions on Knowledge and Data Engineering, 1999.
  20. Presenter: William Kalescky
    Date: 2/26/2015

    Pattern Mining

  21. Mohammed J. Zaki, Efficiently Mining Frequent Trees in a Forest, KDD 2002.
  22. Presenter: Haley Adams
    Date: 2/26/2015

  23. Pedro Domingos and Geoff Hulten, Mining High-Speed Data Streams, KDD 2000.
  24. Presenter: Alex Hofmann
    Date: 3/3/2015

  25. X. Yan and J. Han, gSpan: Graph-Based Substructure Pattern Mining, ICDM 2002
  26. Presenter: Casey Means
    Date: 3/17/2015

  27. Jiawei Han, Jian Pei, and Yiwen Yin, Mining Frequent Patterns without Candidate Generation, SIGMOD, 2000.
  28. Presenter: Andrew Tackett
    Date: 3/17/2015

  29. R. Agrawal and R. Srikant, Mining Sequential Patterns, Proc. of the Int'l Conference on Data Engineering (ICDE), Taipei, Taiwan, March 1995.
  30. Presenter: Katie Wiener
    Date: 3/19/2015

  31. Jian Pei, Jiawei Han, and Runying Mao, CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets, SIGMOD, 2000.
  32. Presenter: Max Tilka
    Date: 3/24/2015

    Classification

  33. Pedro Domingos, Meta-Cost: A General Method for Making Classifiers Cost-Sensitive, KDD, 1999.
  34. Presenter: Kristopher Baker
    Date: 3/24/2015

  35. B. Abelson, K. Varshney, and J. Sun, Targeting Direct Cash Transfers to the Extremely Poor, KDD, 2014.
  36. Presenter: Alex Wang
    Date: 3/26/2015

    Clustering

  37. George Karypis, Eui-Hong (Sam) Han, and Vipin Kumar, CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling, IEEE Computer, 1999.
  38. Presenter: David Thomas
    Date: 3/26/2015

  39. Hastie, T. and Tibshirani, R., Discriminant Adaptive Nearest Neighbor Classification, IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI), 1996.
  40. Presenter: Joel Michelson
    Date: 3/31/2015

  41. Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, and Jan Prins. Clustering Pair-wise Dissimilarity Data into Partially Ordered Sets, KDD, 2006.
  42. Presenter: Lucas Grim
    Date: 3/31/2015

  43. S. Arya, D. Mount, N. Netanyahu, R. Silverman, and A. Wu, An Optimal Algorithm for Approximate Nearest Neighbor Searching in Fixed Dimensions, J. ACM 45, 6 (November 1998), 891-923.
  44. Presenter: Wyatt Gale
    Date: 4/7/2015

    Neural Networks

  45. Zheng Zhang, Jun Li, C.N. Manikopoulos, Jay Jorgenson, and Jose Ucles, HIDE: a Hierarchical Network Intrusion Detection System Using Statistical Preprocessing and Neural Network Classification, IEEE Workshop on Information Assurance and Security, 2001.
  46. Presenter: Sumner Magruder
    Date: 4/7/2015

    Boosting/Bagging

  47. Y. Freund and R. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences, 55(1): 119-139, 1997.
  48. Presenter: Matthew Jackoski
    Date: 4/9/2015

  49. R. Schapire and Y. Singer, Improved Boosting Algorithms Using Confidence-rated Predictions, Machine Learning, 37(3):297-336, 1999.
  50. Presenter: Khang Nguyen
    Date: 4/9/2015

    Big Data

  51. Brin, S. and Page, L. The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the Seventh international Conference on World Wide Web (WWW-7), 1998.
  52. Presenter: Joshua Ladd
    Date: 4/14/2015

  53. R. Kosala and H. Blockeel, Web Mining Research: A Survey, SIGKDD Explorations, June 2000. Volume 2, Issue 1.
  54. Presenter: Catherine Grace Jernigan
    Date: 4/14/2015

  55. Roberto J. Bayardo Jr., Efficiently Mining Long Patterns from Databases, SIGMOD, 1998.
  56. Presenter: Connor Jerow
    Date: 4/16/2015

    Applications

  57. Tom Fawcett and Foster Provost, Data Mining for Adaptive Fraud Detection, Data Mining and Knowledge Discovery, 1997.
  58. Presenter: Corrie Moore
    Date: 4/16/2015

  59. J. Dorre, P. Gerstl, and R. Seiffert, Text Mining: Finding Nuggets in Mountains of Textual Data, KDD, 1999.
  60. Presenter: Farah Sharis
    Date: 4/21/2015

  61. J.Yang, J.McAuley, and J. Leskovec, Community Detection in Networks with Node Attributes, IEEE International Conference On Data Mining (ICDM), 2013.
  62. Presenter: Yuanshuo Li
    Date: 4/21/2015

  63. G. Simon, H. Xiong, E. Eilertson, and V. Kumar, Scan Detection: A Data Mining Approach, SIAM International Conference on Data Mining, 2006.
  64. Presenter: Morgan McCullough
    Date: 4/23/2015

  65. J. Liu, C. Aggarwal, and J. Han. On Integrating Network and Community Discovery, WSDM, 2015.
  66. Presenter: Trevor Tamura
    Date: 4/23/2015