KDD 2003 – Accepted Papers
#117 Efficient Elastic Burst Detection in Data Streams
Authors: Yunyue Zhu, Dennis Shasha
#178 XRules: An Effective Structural Classifier for XML Data
Authors: Mohammed Zaki, Charu Aggarwal
#153 Proximus: A Framework for Analyzing Very High Dimensional Discrete-Attributed Datasets
Authors: Mehmet Koyuturk, Ananth Grama
#180 Fast Vertical Mining Using Diffsets
Authors: Mohammed Zaki, Karam Gouda
#264 Towards Systematic Design of Distance Functions for Data Mining Applications Authors:
Charu Aggarwal
#292 On Detecting Differences Between Groups
Authors: Geoff Webb, Shane Butler, Douglas Newlands
#358 Eliminating Noisy Information in Web Pages for Data Mining
Authors: Lan Yi, Bing Liu, Xiaoli Li
#375 Mining Concept-Drifting Data Streams using Ensemble Classifiers
Authors: Haixun Wang, Wei Fan, Philip Yu, Jiawei Han
#390 Maximizing the Spread of Influence through a Social Network
Authors: David Kempe, Jon Kleinberg, Eva Tardos
#457 Privacy-Preserving K-Means Clustering over Vertically Partitioned Data
Authors: Jaideep Vaidya, Chris Clifton
#469 To Buy or Not to Buy: Mining Airline Fare Data to Minimize Ticket Purchase Price
Authors: Oren Etzioni, Craig Knoblock, Rattapoon Tuchinda, Alexander Yates
#326 An Iterative Hypothesis-Testing Strategy for Pattern Discovery
Authors: Richard Bolton, Niall Adams

Profusion of information

My dial-up costs a lot of Indian Rupees. The phone lines are usually clogged, I get very little bandwidth. The phone company too charges money per minute.

It sucks, but I am pretty addicted to blogging. Here is a profusion of links I picked up this morning.

Joel spoke on Craftsmanship, his articles sometimes do state the obvious, especially with detailed works like “Design of everyday things” having already gone past, but it does not hurt to be reminded why an Engineer is required to think.


The iPod


Eric S. Raymond on Threads

HeisenBugs & Threads

Take your Time

Wired is running an article on Kelly’s cool things. Kelly is addicted to blogging objects that you love over use or time. He hates being appreciative of newness for newness’ sake. Kellys cool things

Kelly’s Cool things

Kelly’s website



My trip looks like this
Seattle (6hrs) -> Chicago (4hrs) -> (8.5 hrs) London (2 hrs) ->(8.5 hrs) Mumbai

It was a harrowing flight, Air India wasn't the best of airlines.

I am exhausted and not upto the weather.
I am excited and disbelieving of the reality.
I am babbling and mostly working off all my energy.
I know that eventually one will come to grips with the situation and work it.
I loved eating Chana Bhatura at Udupi Palace, now that I can enjoy the same everyday, I am too ill to do so 🙂 Ironic!

Static filters works for me

I have with the help of static filters and over 35 mail filter rules, blocked 4300 spam messages since the start of the year, with 900 legitmate emails and almost 5 false positives, approximately 100 false negatives.

Life goes on

At work, it feels wierd? No not at all, usual work day!

Two links
Blogging activity in RIT
Cheap Guju

For those who don’t know what a Cheap Guju is, a Cheap Guju is a ‘Desi’ or Asian who is particularly skimpy and tries to stretch the Dollar, sometimes too far. Its all a bit of humour, no offence intended. In fact I stretched the dollar too and I shall continue to do so as long as my bank balance remains in 2 – 3 digits.

Man versus Machine

News report just in, Kasparov had an advantage over Fritz in 16 moves! Looks like Fritz has a programming problem with his opening game :)!

He probably studied the Computers ‘thinking’ really well before coming in with a killer strategy. Some say the strategy was too forward thinking!! Kasparov is now in the business of breaking computer chess programs 🙂 Brilliant I say, every chess game programmers worst nightmare. The newscientist explains the twin strategy that Kasparov employed very clearly!

last week here…

I think I have grown to enjoy the place, my house, my work, this city and maybe even the weather. I wish I did not have to leave, but se la vie… so it must be!

Now I have to think of another name for my blog…

Progress with Systems

List of things to think about:
* With large code bases, bug fixes however minor, require specs.
* Take time to think about what your about to code, this is THE most important thing you can do to avoid reworking the code you wrote a month ago. Ideally, for every 4 days of coding you want to think about the problem 1 day atleast. (Credit: Murthy Gorthy)
* Take time off to assimilate the code already present, review it thoroughly and understand the mechanics.
* Check your data going across the wire, is it efficient.
* Those tiny bug fixes may result in additional design, not necessarily extendable but nevertheless not tightly coupled. Sub-Classes, interfaces and so on.
* Unit Test, Unit Test, Unit Test, Hard
* Is all the functionality possibly affected ok? Run some functional tests to find out!
* Clean up code unused! Dead Code sucks!
* Code Reviews are necessary, death to those who waste their peers time!
* And also…Documentation, documentation, documentation…so all future developers send up a silent cheer for you. (Credit: frequent_stopperby)

From the J2EE Tutorial

To reduce costs and fast-track application design and development, Java 2 Platform, Enterprise Edition (J2EE) provides a component-based approach to the design, development, assembly, and deployment of enterprise applications. The J2EE platform offers a multitiered distributed application model, reusable components, a unified security model, flexible transaction control, and Web services support through integrated data interchange on eXtensible Markup Language (XML)-based open standards and protocols.

From the Tutorial