Nautilus Systems, Inc. logo and menu bar Site Index Home
News Books
Button Bar Menu- Choices also at bottom of page About Nautilus Services Partners Case Studies Contact Us
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [Subscribe]

Re: DM: Re: problem of sample size


From: Roberto Bayardo/Almaden/IBM
Date: Wed, 30 Aug 2000 12:05:05 -0700


One useful way of analyzing minority classes is to use a rule miner which
accepts a consequent constraint (In this case, the consequent constraint
would be the minority class identifier).  This won't build a classifier for
you, but it will build a useful descriptive model (which can then be turned
into a predictive model with other techniques).

I have put the "dense-miner" algorithm on-line, which allows you to do just
this. If you can make your data-set HTTP accessible, you can try it out.

Here are a few things to note:

o This publically accessible version has several restrictions. For example,
it will only attempt data-sets of size 10 MBytes or less.

o Works only with categorical attributes (so continuous attributes must be
discretized).

Instructions are available at:

http://www.almaden.ibm.com/cs/people/bayardo/vinci/index.html

Roberto Bayardo






[ Home | About Nautilus | Case Studies | Partners | Contact Nautilus ]
[ Subscribe to Lists | Recommended Books ]

logo Copyright © 1998-2000 Nautilus Systems, Inc. All Rights Reserved.
Email: firschng@nautilus-systems.com
Mail converted by MHonArc 2.2.0