A couple requests--not a big deal as it's pretty easy to code these myself
outside of waffles, but would help with completeness:
It would be handy to have an option in waffles_transform along the lines of dropmissingvalues that would drop any column whose values are all identical.
Rebuild the list of class values for a nominal attribute from the values in that column.
On both of the above, my issue is that I'm finding that after I do a split on
an arff dataset, I have some redundant columns/attributes.
Thanks!
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
1 sounds pretty easy to implement. I don't quite understand #2. Could you
elaborate on what it does and how that differs from what you would prefer?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
Anonymous
-
2012-06-22
I'd rephrase #2 as: Remove class values for categorical attributes that are no
longer present in that column.
For instance, suppose you have file1.arff that starts out with the attribute:
@ATTRIBUTE people {Alice, Bob, Charlie}
Now I do some sort of split on the file1.arff producing file2.arff. In
file2.arff, I only have "Bob" and "Charlie" in this column. What I'd like is a
command in waffles_transform that will reset the ATTRIBUTE list to just say
@ATTRIBUTE people {Bob, Charlie}
Thanks,
SeattleDad
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
A couple requests--not a big deal as it's pretty easy to code these myself
outside of waffles, but would help with completeness:
On both of the above, my issue is that I'm finding that after I do a split on
an arff dataset, I have some redundant columns/attributes.
Thanks!
1 sounds pretty easy to implement. I don't quite understand #2. Could you
elaborate on what it does and how that differs from what you would prefer?
I'd rephrase #2 as: Remove class values for categorical attributes that are no
longer present in that column.
For instance, suppose you have file1.arff that starts out with the attribute:
@ATTRIBUTE people {Alice, Bob, Charlie}
Now I do some sort of split on the file1.arff producing file2.arff. In
file2.arff, I only have "Bob" and "Charlie" in this column. What I'd like is a
command in waffles_transform that will reset the ATTRIBUTE list to just say
@ATTRIBUTE people {Bob, Charlie}
Thanks,
SeattleDad
ok, I added both features. Here is example usage:
waffles_transform drophomogcols mydata.arff
and
waffles_transform dropunusedvalues mydata.arff