Tuesday, February 26, 2008


IEEE SMC 2008 will be hold in Singapore, from Oct 12 to 15.

Important dates:

Submission due of special session proposals
March 2, 2008 (Sunday)
Submission due of papers (full-length papers only)
March 16, 2008 (Sunday)
Notification of acceptance/rejection
May 15, 2008 (Thursday)


Hypertable is one open source project from Google. It is for the design and implementation of a high performance, scalable, distributed storage and processing system for structured and unstructured data.

By now, the latest version is

Tuesday, February 19, 2008

Some references for MapReduce(from Wikipedia)

"MapReduce: Simplified Data Processing on Large Clusters" — paper by Jeffrey Dean and Sanjay Ghemawat; from Google Labs
"Interpreting the Data: Parallel Analysis with Sawzall" — paper by Rob Pike, Sean Dorward, Robert Griesemer, Sean Quinlan; from Google Labs
"Google's MapReduce Programming Model -- Revisited" — paper by Ralf Lammel; from Microsoft
"Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters" — paper by Hung-Chih Yang, Ali Dasdan, Ruey-Lung Hsiao, and D. Stott Parker; from Yahoo and UCLA; published in Proc. of ACM SIGMOD, pp. 1029--1040, 2007. (This paper shows how to extend MapReduce for relational data processing.)
FLuX: the Fault-tolerant, Load Balancing eXchange operator from UC Berkeley provides an integration of partitioned parallelism with process pairs. This results in a more pipelined approach than Google's MapReduce with instantaneous failover, but with additional implementation cost.

"How Google Works - Reducing Complexity" — article from Baseline magazine
"Can Your Programming Language Do This?" — article from the Joel on Software weblog
Nutch MapReduce — article about MapReduce in Nutch from Tom White's weblog
Cat MapReduce — article about MapReduce in Cat from the Cat project wiki.
"Simple Map Reduce in Ruby" - article about using SimpleMapReduce on Ruby's Rinda which uses DrbRuby
"MapReduce: A major step backwards" - column about advances in database technology compared to MapReduce.

Hadoop — open source MapReduce implementation from Apache
IBM MapReduce Tools for Eclipse — a plug-in that supports the creation of MapReduce applications within Eclipse.
QtConcurrent Open Source C++ MapReduce (non-distributed) implementation from Trolltech
Skynet Ruby Map/Reduce Framework
Retrieved from "http://en.wikipedia.org/wiki/MapReduce"

Saturday, February 16, 2008


Just create a page about myself using google page creator.

Sunday, February 03, 2008


The 2008 edition of the IEEE International Conference on Data Mining series (ICDM 2008) will be held in Pisa, Italy, on December 15 thru 19, 2008.
The International Conference on Data Mining series (ICDM) is well established as a top ranked research conference in data mining, providing a premier forum for presentation of original research results, as well as exchange and dissemination of innovative, practical development experiences.
The conference covers all aspects of data mining, including algorithms, software and systems, and applications. In addition, ICDM draws researchers and application developers from a wide range of data mining related areas such as statistics, machine learning, pattern recognition, databases and data warehousing, data visualization, knowledge-based systems, and high performance computing. By promoting novel, high quality research findings, and innovative solutions to challenging data mining problems, the conference seeks to continuously advance the state-of-the-art in data mining. Besides the technical program, the conference will feature workshops, tutorials, panels and, new for this year, the ICDM data mining contest.

[Important Dates]
July 7, 2008 Deadline for paper submission
September 15, 2008 Notification to authors
October 7, 2008 Deadline for camera-ready copies
December 15 – 19, 2008 Conference


The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) will be located in Antwerp, Belgium, from September 15th to 19th, 2008. This event builds upon a very successful series of 18 ECML and 11 PKDD conferences, which have been jointly organized for the past seven years. It has become the major European scientific event in these fields and in 2008 it will comprise presentations of contributed papers and invited speakers, a wide program of workshops and tutorials, a discovery challenge, a panel, a demo track and an industrial track.

Important DatesWorkshop Proposals due March 31stTutorial Proposals due March 31stPaper Submission due April 25thAuthor Notification on June 13thCamera Ready Papers due June 22nd

Let's go to FLAIRS 2008

Accepted as full paper, cheer!