Data mining (aka knowledge discovery) is an interdisciplinary area of computer science with the goal of extracting new knowledge and insights from big and complex data sets. The course introduces essential pattern recognition methodologies leveraging machine learning and rule-based techniques. Supplementary tasks involving processing, cleaning, integration, and transformation of data are also covered. An etymology of data mining is provided to help students compare and contrast knowledge discovery with contemporary data analytics and decision support methodologies.
Prerequisites: CS 1103, CS 2704 and (STAT 2593 or STAT 2793).