Data Mining
ConGESE COSC-6412
Fall 2007


Semester: Fall 2007
Course: ConGESE COSC-6412
Time: Tue 1:30pm-4:30am
Location: IBM Training Center
Instructor: Aijun An
Office Hours: Tue: 4:30pm - 5:00pm (in the classroom)
Phone #: 416-736-2100 x44298
e-mail: aan@cse.yorku.ca


Welcome to the Data Mining course, COSC-6412, for Fall 2007. Materials, instructions, and notices for the course will accumulate here over the semester.


Message Board

November 9, 2007
Assignment 2 is posted. See the link below under "Assignments".
October 12, 2007
Assignment 1 is posted. See the link below under "Assignments".
October 1, 2007
The web site is set up. Welcome to the course! Lectures will start tomorrow at 1:30pm.


Description

Data mining or knowledge discovery from databases (KDD) is one of the most active areas of research in databases. It is at the intersection of database systems, statistics, AI/machine learning, and data visualization. In this course, we will introduce the concepts of data mining and present data mining algorithms and applications. Topics include association rule mining, sequential pattern mining, classification models, and clustering.


Prerequisites

  • Required: an introductory course on database systems and an introductory course on probability.
  • Preferred: basic knowledge on statistics.


Reference Books and Materials

  • Jiawei Han and Micheline Kamber, Data Mining -- Concepts and Techniques, Morgan Kaufmann, Second Edition, 2006.
  • Pang-Ning Tan, Michael Steinbach, Vipin Kumar, Introduction to Data Mining, Addison Wesley, 2006.
  • Ian H. Witten and Eibe Frank, Data Mining -- Practical Machine Learning Tools and Techniques (Second Edition), Morgan Kaufmann, 2005.
  • Margaret H. Dunham, Data Mining -- Introductory and Advanced Topics, Prentice Hall, 2003.
  • Some conference/journal papers (More will be posted over the semester).


Grading Scheme

  • Assignments (40%)
  • Paper review and presentation (10%)
  • Course project (40%)
  • Participation (10%)


Lecture Notes


Assignments


Paper Review and Presentation


Project


Useful On-line Information