Skip to main content
notice

Master Thesis Defense: Rahmah Brnawy

June 4, 2015
|


 

 

Speaker: Rahmah Brnawy

Supervisor: Dr. N. Shiri

Examining Committee: Drs. G. Grahne, S. P. Mudur,
P. Rigby (Chair)

Title:  An Efficient Technique for Clustering Data With Mixed Attribute Types

Date: Thursday, June 4, 2015

Time: 14:00


Place: EV 3.309

ABSTRACT

Clustering is a technique used to extract useful information and discover patterns from data. Existing clustering techniques have often focused on datasets with attributes that are either numeric or categorical but not both. The problem of clustering mixed numeric and categorical datasets has received increased attention recently, resulted in a number of solution methods. In this research, we study these solutions and propose two clustering algorithms. The first algorithm is Cluc+, which extends and improves an existing algorithm for clustering pure categorical data. Using Cluc+, we then develop a new algorithm, called K-mixed, for clustering data with mixed numeric and categorical attribute types. We experimentally evaluate the performance of these algorithms using real-life benchmark datasets. Our results indicate increased efficiency and accuracy of our proposed solution techniques.

 

 




Back to top

© Concordia University