If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

 
Go Back  dBforums > General > Database Concepts & Design > Missing dataset (k-nn)

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 05-09-11, 17:18
Ozzyb89 Ozzyb89 is offline
Registered User
 
Join Date: May 2011
Posts: 5
Missing dataset (k-nn)

I am comparing three methods decision tree, Naïve Bayes, and K-NN. I have a dataset with missing values and by using the Weka value replace tool I was able to replace the values.

I ran two test one before replacing the missing value and one after replacing the missing value. Before replacing the value k-nn correctly classified 55% instances but after replacing missing value it classified 56% correct instances.

I want wondering why this happens to k-nn but not the other methods of classification?
Reply With Quote
  #2 (permalink)  
Old 05-10-11, 10:03
mike_bike_kite mike_bike_kite is offline
vaguely human
 
Join Date: Jun 2007
Location: London
Posts: 2,519
I doubt you'll get a good response on this forum but I'll start the ball rolling:
  • If you only have small amounts of data (< few hundred) then small anomalies will look like large trends.
  • Is your Weka tool putting in correct replacement values?
  • Is it then correct to store these values in a database rather than just record that no value is known?
  • Assuming the 3 methods all use different strategies to classify items then isn't it fair to guess that they'll use your new artificial data in different ways?
  • Have you looked at your coursework notes?
__________________
Mike
Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On