Nautilus Systems, Inc. logo and menu bar Site Index Home
News Books
Button Bar Menu- Choices also at bottom of page About Nautilus Services Partners Case Studies Contact Us
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [Subscribe]

Re: DM: Clustering and categorical attributes (fwd)


From: Warren Sarle
Date: Mon, 12 Jan 1998 17:42:39 -0500 (EST)
Murray Jorgensen <maj@waikato.ac.nz> wrote:
> Surely k-means clustering requires numerical attributes?

There is nothing inherently wrong with doing k-means on dummy 0|1
variables generated from categorical attributes. K-means tries to
minimize sums of squared Euclidean distances, which can be expressed
as sums of simple matching coefficients when applied to categorical
data. But whether this is a good thing to do depends on the purpose
of the analysis and the nature of the data.

-- 

Warren S. Sarle       SAS Institute Inc.   The opinions expressed here
saswss@unx.sas.com    SAS Campus Drive     are mine and not 
necessarily
(919) 677-8000        Cary, NC 27513, USA  those of SAS Institute.
* Do not send me unsolicited commercial, political, or religious 
email *



[ Home | About Nautilus | Case Studies | Partners | Contact Nautilus ]
[ Subscribe to Lists | Recommended Books ]

logo Copyright © 1998 Nautilus Systems, Inc. All Rights Reserved.
Email: nautilus-info@nautilus-systems.com
Mail converted by MHonArc 2.2.0