ITS665

 Data Mining

Text Mining Algorithm


Course Learning Outcomes


At the end of the course, students should be able to: 

1. Assess the methods of data mining related to real application in data science ( C5 )

2. Manipulate data mining methods based on the given tasks using data mining tools ( P4 )

3. Organise the use of digital skills in model development related to the data mining project ( P5 )


Course Description


The Data Mining course introduces the concepts and methods of data mining and shows its relationship with data science. All the steps involved in knowledge data discovery will be discussed. Topics include Introduction to Data Mining, Data Preparation and Pre-processing, Classification, Model Evaluation & Selection, Clustering, Association Analysis, and ends with Future Trends & Challenges. The algorithm for each modelling process is discussed with supporting examples using real-world datasets. These datasets are used for model building using assessable technology with easy-to-use platforms. The findings will be presented through digital tools. The knowledge and practical skills gained from this course would benefit the students for solving real problems in industry or society-related issues for various SDG-based applications.


Syllabus Content


1. Introduction to Data Mining 

Evolution of Data Mining

Data Mining and Data Science Concept

Knowledge Discovery in Databases

Problem and Challenges

Slide chapter 1 - part 1

Slide chapter 1 - part 2

Book chapter 1


2. Data Preparation for Knowledge Discovery 

Data Understanding

Basic Statistics

Data Exploration

Data Visualization

Slide chapter 2

Book chapter 2


3. Data Pre-processing 

Data Quality

Cleaning, Integration and Reduction

Transformation

Discretization

Slide chapter 3

Book chapter 3


4. Classification 

Classification Concepts

Classification Tasks

Decision Tree

Slide chapter 4

Book chapter 4


5. Model Evaluation and Selection 

Accuracy

Confusion Matrix

Comparison of models

Improving Classifier Accuracy

Slide chapter 5

Book chapter 5


6. Association 

Frequent Patterns

Market Basket Analysis

Apriori Algorithm

Association Rules

Frequent Pattern Tree

Slide chapter 6 - part 1

Slide chapter 6 - part 2

Book chapter 6


7. Clustering 

Cluster Analysis

Similarity and Dissimilarity

Clustering Algorithms

Slide chapter 7 - Part 1

Slide chapter 7 - part 2

Book chapter 7


8. Current and Future Works 

Advanced Data Mining Methods

Future Trends and Applications

Slide chapter 8

Book chapter 8