Publication Date

2007-01-01

Availability

Open access

Degree Type

Thesis

Degree Name

Master of Science (MS)

Department

Electrical and Computer Engineering (Engineering)

Date of Defense

2007-05-06

First Committee Member

Dr. Miroslav Kubat - Committee Chair

Second Committee Member

Dr. Mei-Ling Shyu - Committee Member

Third Committee Member

Dr. Peter Tarjan - Committee Member

Abstract

The importance of attribute vector ambiguity has been largely overlooked by the machine learning community. A pattern recognition problem can be solved in many ways within the scope of machine learning. Neural Networks, Decision Tree Algorithms such as C4.5, Bayesian Classifiers, and Instance Based Learning are the main algorithms. All listed solutions fail to address ambiguity in the attribute vector. The research reported shows, ignoring this ambiguity leads to problems of classifier scalability and issues with instance collection and aggregation. The Algorithm presented accounts for both ambiguity of the attribute vector and class label thus solving both issues of scalability and instance collection. The research also shows that when applied to sanitized data sets, suitable for traditional instance based learning, the presented algorithm performs equally as well.

Keywords

Attribute Vector Ambiguity; Instance Based Learning

Share

COinS