A Novel Incremental Framework for Building Classifier Using Constraint Class Association Rules

A Novel Incremental Framework for Building Classifier Using Constraint Class Association Rules

B. Subbulakshmi, C. Deisy, S. Parthasarathy
Copyright: © 2023 |Pages: 21
DOI: 10.4018/IJIRR.316125
Article PDF Download
Open access articles are freely available for download

Abstract

Associative classification (AC) performs much better than other traditional classifiers. It generates a huge number of class association rules (CARs). Since users are interested in the subset of rules, constraints are introduced in the generation of CARs. Real-world databases are record-based in which data is continuously added which demands incremental mining. Hence, constraint class association rules (CCAR) is mined from incremental data. To limit the number of rules and to remove the duplicate rules, redundant rule pruning and duplicate rule pruning techniques are applied. To improve the accuracy of the classifier, the rule selection using principality metric has been applied and the classifier is constructed with rules possessing high principality. Then, classifier is evaluated using single rule and multiple rule prediction methods and the accuracy of the proposed classifier are measured. Experimental results show that the accuracy of the proposed classifier is relatively higher when compared to other algorithms.
Article Preview
Top

1. Introduction

1.1 Associative Classification

Associative Classification (AC) (Vishwakarma et al. 2013; Phan-Luong, 2013) is an integration of association and classification methods aims to build a classifier describing the classes of the input training data set. Class Association Rules (CARs) (Mabu et al. 2011) are basically used to build a classification model for which prediction defines the relationship between the itemsets and the class labels. It is mainly used in the medical dataset. Based on the generated rules, it is possible to identify population at high risk for a particular disease.

Constructing an associative classifier involves two steps. Given the training dataset, class association rules are generated first. Then, a small subset of high- quality rules are obtained using pruning methods and an accurate classifier or model is built for the training data. However, in large or correlated data sets, rule mining may yield a huge number of classification rules. Hence, pruning techniques, in particular, support- based pruning, are exploited to reduce the complexity of the extraction task.

1.2 Motivation

A class association rule is a special case of association rule where the rule consequent contains only a class label (Nguyen et al., 2016). The basic objective of any user in mining class association rules (CARs) is to determine a complete set of CARs in such a way that it satisfies user-specified minimum support and minimum confidence thresholds. In literature there exists several methods and techniques have been proposed to manage this issue in CARs. In fact, the CAR mining has been widely used in various practical domains, namely tourism management ((Rong et al., 2012; Leung et al., 2013), social media security (Zhang et al., 2007), healthcare (Nahar et al., 2013), and education sectors (Luna et al., 2015). However, it is also observed by previous researchers that the results obtained as a result of CARs mining has redundancy.

Many real time applications are dynamic in nature such that the data is incrementally added to the database. The major problem with incremental mining is updating knowledge which was mined from the original database. Methods for mining frequent itemsets from incremental datasets have been proposed by (Cheung et al. 1996; Hong et al. 2001; Le et al. 2011, 2012 ; Vo et al. 2014). These methods reduce the execution time and memory usage compared to those obtained from rescanning the original dataset. Thus the techniques of incremental frequent itemset mining can be integrated into constraint CAR Mining. Hence, the incremental method generates all CARs satisfying the given constraint whenever the original dataset is added with new records. However, there exists the possibility of generation of duplicate and redundant CCARs. It can be addressed by applying post processing techniques such as pruning and selection of rules.

In the post-processing step, the Constraint Class Association Rules (CCARs) can be pruned, ranked and then quality rules are selected in order to build a compact and high quality classifier. The classifier is evaluated by measuring the number of correct classifications made for a given set of test instances. Finally, the prediction of test instances is made by selecting the single rule or multiple rules satisfying the antecedent of test instances.

Complete Article List

Search this Journal:
Reset
Volume 14: 1 Issue (2024)
Volume 13: 1 Issue (2023)
Volume 12: 4 Issues (2022): 3 Released, 1 Forthcoming
Volume 11: 4 Issues (2021)
Volume 10: 4 Issues (2020)
Volume 9: 4 Issues (2019)
Volume 8: 4 Issues (2018)
Volume 7: 4 Issues (2017)
Volume 6: 4 Issues (2016)
Volume 5: 4 Issues (2015)
Volume 4: 4 Issues (2014)
Volume 3: 4 Issues (2013)
Volume 2: 4 Issues (2012)
Volume 1: 4 Issues (2011)
View Complete Journal Contents Listing