Alright friends, I am blogging my first steps towards some meaningful research
I walked into the lab today and Vineet tells me that our classifier is doing a
100% accurate classification feeding off itself. Yes, I know that does not mean
much, but this opens a whole new set of possibilities. (For those not into data
mining, take my word for it, for the experts, just bear !!)
Some of the things we plan to look at
1. Can we boost the accuracy with Intelligent Keyword selection
2. Can we actually use incremental learning algorithms to induce decision trees
3. Do we need to bias the filters in filtering out junk email towards more
Now that we have the infrastructure in place we can answer all these questions.
Thanks Vineet for getting us this far!!!