CompactLEM2: A Scalable Rough Set based Knowledge Acquisition Method that Generates Small Number of Short Rules

Document type: Conference Papers
Peer reviewed: Yes
Author(s): Yang Liu, Guohua Bai, Boqin Feng
Title: CompactLEM2: A Scalable Rough Set based Knowledge Acquisition Method that Generates Small Number of Short Rules
Conference name: Cognitive Informatics, 2008. ICCI 2008. 7th IEEE International Conference on
Year: 2008
Pagination: 215-222
ISBN: 978-1-4244-2538-9
Publisher: IEEE CS Press
City: Stanford University, CA, USA
URI/DOI: 10.1109/COGINF.2008.4639171
Organization: Blekinge Institute of Technology
Department: School of Engineering - Dept. of Interaction and System Design (Sektionen för teknik – adv. för interaktion och systemdesign)
School of Engineering S- 372 25 Ronneby
+46 455 38 50 00
http://www.tek.bth.se/
Authors e-mail: yli@bth.se
Language: English
Abstract: The complexity of knowledge plays an important role in the success of any types of knowledge acquisition algorithms performing on large-scale database. LERS (Learning from examples based on rough sets) system is a rule based knowledge acquisition system that is characterized by excellent accuracy, but the complexity of generated rule set is not taken into account. This may cause interpretation problems for human and the classification knowledge may overfit training data. In this paper, CompactLEM2 is proposed as a scalable knowledge acquisition method that extracts rule set with easily understood rule forms, i.e., small size of rule set and short rule forms, without sacrificing classification accuracy. The main advantage of CompactLEM2 is its high efficiency. It can also produce compact rule set that fully or approximately describes classifications of given examples. We theoretically and experimentally show that CompactLEM2 exhibits log-linear asymptotic complexity with the number of training examples in most cases. We also present an example to illustrate characteristics of this algorithm. Finally, the capabilities of our method are demonstrated on eleven datasets. Experimental results are encouraging, and show that the length of extracted rule forms are short, and size of rule set is small, keeping the same level of classification accuracy of other rule acquisition methods in LERS system.
Subject: Computer Science\Artificial Intelligence
Business Administration\Information
Computer Science\General
Keywords: Knowledge acquisition, rough set, LERS data mining system, rule induction, classification.
Edit