Thursday, November 30, 2006


May 28 - 30, 2007 Montreal, Québec,
Canada Full paper submission due: January 15th 2007

AI'07, the twentieth Canadian Conference on Artificial Intelligence, invites papers that present original work in all areas of Artificial Intelligence, either theoretical or applied such as

Natural Language Agent and Multi-Agent System Machine Learning
User Modeling Search AI applications Constraint Satisfaction
Smart Graphics Knowledge Representation E-Commerce Planning Information Processing Automated Reasoning Bioinformatics Neural Nets Web Applications Reasoning under Uncertainty Education Data Mining Games Robotics Case-based reasoning

Papers will be reviewed by the Program Committee members and judged according to their originality, technical merit and clarity of presentation. Each accepted paper will be allocated a maximum of 12 pages in the proceedings. All accepted papers for which one of the authors will have registered and present at the conference will be published in the conference proceedings as Lecture Notes in Artificial Intelligence - Springer. An award will be given for the best paper of the conference. Another prize will be given for the best paper for which the main author is a student. Papers submitted to AI'2007 must not have been accepted for publication elsewhere or be under review for another conference.

Authors are invited to submit electronically, by January 15th 2007, full papers in PDF, Postscript or MS-Word RTF. All papers must be written in English (only). Papers of up to 12 pages in length must be formatted according to Springer LNCS style. The use of the LaTeX2e style file available from Springer ( is strongly encouraged.

Important dates: -------------------
Full paper submission due: January 15th, 2007
Notification of acceptance: February 26th, 2007
Final paper due : March 15th, 2007

Monday, November 27, 2006

The XML Decade

I found this article to celebrate XML decade on IBM site, and share with you the first paragraph here,
"XML is approaching 10 years old. How closely depends on how you're counting. The W3C Recommendation Extensible Markup Language (XML) 1.0 was published on 10 February 1998. Work on XML started around 1996, however, rooted in almost thirty years of SGML. The design principles for XML, which guided its development were published on 25 August 1996. The first working draft, published on 14 November 1996 defined documents very similar to the majority of XML you might see today. Many of the changes between that first draft and the final recommendation were in more obscure areas of the standard. The basic idea of labeled, balanced, hierarchical tags and clearly defined text encoding were well in place in 1996, and so IBM Systems Journal accounts 2006 the year of XML's decade. Regardless of whether you agree with their counting, it is a volume well worth a thorough read by all XML professionals as it combines an interesting retrospective of XML with some useful articles discussing specific techniques and development, providing a glimpse into the future of the technology, and thus our profession. In this article I offer some comment and expansion on the treatment in IBM Systems Journal, focusing on the keynote article "Technical context and cultural consequences of XML" and one of the other contained papers, "Emerging patterns in the use of XML for information modeling in vertical industries". The latter paper is concerned with a common theme of Thinking XML--the development and adoption of industry-specific XML vocabularies. "

For the whole article, please refer to

Borrow one logo for my usage

Sunday, November 26, 2006

About Weka

Weka is a famous open-source collection of machine learning algorithms for data mining tasks. It originated from Waikato university in NewZealand.

Data mining related job positions is a good resource to find exciting data mining related jobs provided by known or emerging companies.

KDNUGGETS.COM collection of data mining softwares contains almost all the existing software package related to data mining, text mining, web mining, and predictive analysis.

Good links for agile modeling - collects many articles specifically related to iterative and agile methods, plus links; - articles on agile modeling; - has specificalized for years in object technology; - Brad Appleton maintains a large collection of links on software engineering, including iterative methods; - The Chinese font page links to an English version, with a search engine referencing iterative and agile articles.

Thursday, November 23, 2006

COLT 2007

The Twentieth Annual Conference on Learning Theory (COLT 2007) will take place on 13-15 June, 2007, in San Diego, California, as part of the 2007 Federated Computing Research Conference (FCRC) .

Electronic submission of papers
January 16, 2007 (5:59pm PST)
Elec. submission of two-page open problems
February 15, 2007
Notification of acceptance or rejection
March 12, 2007
Final submission of all papers
March 23, 2007
Conference dates
June 13-15, 2007

Interested to higher your ranking in the search result

Pls. visit for any interesting information though it is not encouraged.

about PMML standard

For those engaged in data mining and predictive application development, you are recommended to follow the PMML standard from It is supported by mainstream companies in this field, such as SPSS, SAS, IBM, Microsoft, etc. The newest version is 3.1, but as I know, v3.2 is to born soon.

Wednesday, November 22, 2006


The Industrial/Government Applications Track of the Thirteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2007) will highlight challenges, lessons, concerns, and research issues arising out of deploying applications of KDD technology. The focus would be on promoting the exchange of ideas between researchers and practitioners of data mining.
The KDD-2007 Industrial/Government Applications (I/G) Track seeks to:
provide a forum for exchanging ideas between KDD practitioners, researchers, companies, and government organizations; and
help commercial and government organizations highlight successful KDD applications,
raise interesting (research) challenges and other concerns more specific to industry and government -- customer privacy issues, analysis of data not generally available in academia, issues of scale that arise more heavily in a corporate setting, etc.
The I/G Applications Track solicits papers describing attempts to deploy KDD solutions relevant to commercial or government challenges. The primary emphasis is on papers that advance our understanding of practical, applied, or pragmatic issues and perhaps highlight new research challenges in real KDD applications. Applications can be in any field including scientific, engineering, commercial, governmental, social, or political. The I/G Applications Track will consist of competitively-selected contributed papers - presented in oral and poster form - as well as invited talks. The full conference will also feature keynote presentations, workshops, tutorials, research track papers, and the KDD Cup competition. We envision submissions along four sub-areas:
Emerging applications, technology, and issues;
Deployed KDD case studies;
Product and experience descriptions; and
Pragmatic issues and research considerations in fielding real applications.
Emerging application, technology, and issue papers discuss prototype applications, tools for focused domains or tasks, useful techniques or methods, useful system architectures, scalability enablers, tool evaluations, or integration of KDD and other technologies. Case studies describe deployed projects with measurable benefits that include KDD technology. Such papers need to demonstrate the importance and impact of the work clearly. Product submissions clearly describe KDD technology embedded in commercial products (without otherwise being a product advertisement). Pragmatic issues and considerations include important practical and research considerations, approaches, and architectures that enable successful applications. Submitters are encouraged (but not required) to select one (or more) of these sub-areas for their papers.In their submission, authors are required to explain why the application is important, the specific need for KDD technology to solve the problem (including why other methods perhaps not based on data mining may fall short), and any innovations or lessons learned in the solution.
For submission details and organizers, see
Important Dates:
Abstracts due: 23 February, 2007 Paper submissions due: 28 February, 2007 (9 pages)

A good example for LaTex explanation

For anyone who want to publish papers frequently, you have to learn LaTex that is great invention by CS scientist Knuth who is the author of "The Art of Programming" too.

About SPSS

SPSS has become a leader in predictive analytics technologies through a combination of commitment to innovation and dedication to customers. You will find SPSS customers in virtually every industry, including telecommunications, banking, finance, insurance, healthcare, manufacturing, retail, consumer packaged goods, higher education, government, and market research.
Customers use SPSS predictive analytics software to anticipate change, manage both daily operations and special initiatives more effectively, and realize positive, measurable benefits. By incorporating predictive analytics into their daily operations, they become Predictive Enterprises—able to direct and automate decisions to meet business goals and achieve measurable competitive advantage.
Today, SPSS is shipping great tools and solutions to customers all over the world, such as SPSS for statistical analysis, Clementine for data mining, and Dimension for survey.

Microsoft XBN Introduction

XBN is the XML-based Bayesian Network storage format, originating from its ascendant BNIF which can be traced back to UAI'96, like XMLBIF. It will replace Microsoft's old DSC format that is used by MSBN tool. For more detail, pls. refer to

KDD 2007

SIGKDD 2007 will be hold in San Jose, California, USA.

Overview of Clementine

Clementine is an integrated data mining workbench. It provides a wide array of data mining techniques, along with pre-built vertical solutions, in an integrated and comprehensive manner, with a special focus on visualisation and ease-of-use. The latest release (Clementine 11.0) incorporates many enhancements, which fall into a number of areas. Productivity is probably the most important of these. One of the key concerns within the business community about analytic activity is the time that is taken. In Clementine 11.0, there are many enhancements that are targeted specifically at enhancing the productivity of the analyst. In addition, data mining is no longer a back-room activity - its results are now widely deployed throughout the enterprise and are leveraged by many users – this new release includes a number of significant features designed to assist in this enterprise-wide deployment.


Google的搜索机制是:几个分布的Crawler(自动搜索软件)同时工作——在网上“爬行”,URL服务器负责向Crawler提供URL的列表。Crawler所找到的网页被送到存储服务器中。存储服务器于是把这些网页压缩后存入一个知识库(repository)中。每个网页都有一个关联ID——doc ID,当一个新的URL从一个网页中解析出来时,就被分配一个doc ID。索引库和排序器负责建立索引,索引库从知识库中读取记录,将文档解压并进行解析。每个文档就转换成一组词的出现状况,称为hits。hits记录了词、词在文档中的位置、字体大小、大小写等。索引库把这些hit又分成一组“barrels”,产生经过部分排序后的索引。索引库同时分析网页中所有的链接,并将重要信息存在Anchors文档中,该这个文档包含了足够信息,可以用来判断一个链接被链入或链出的结点信息。 URL分解器(URL Resolver)阅读Anchors文档,并把相对的URL转换成绝对的URLs,并生成doc ID,它进一步为Anchor文本编制索引,并与Anchor所指向的doc ID建立关联。同时,它还产生由doc ID对(pairs of doc ID)所形成的数据库。这个链接数据库(Links)用于计算所有文档的页面等级(Pagerank)。 排序器会读取barrels,并根据词的ID号(word ID)列表来生成倒排挡。一个名DumpLexicon的程序则把上面的列表和由索引库产生的一个新的词表结合起来产生另一个新的词表供搜索器(Searcher)使用。这个搜索器就是利用一个Web服务器,并使用由DumpLexicon所生成的词表,并利用上述倒排挡以及页面等级来回答用户的提问。 从Google的体系结构、搜索原理中可以看到,其关键是:利用URL分解器获得Links信息,并且运用一定的算法得出页面等级的信息,这正是网络结构挖掘技术。

Tuesday, November 21, 2006


XMLBIF - XML-based BayesNets Interchange Format. It is a XML format to store a Bayes Network aiming at a commonly-agreed standard in this community. It is originally defined by Fabio Cozman and members of the GeNie project, notably Marek Druzdzel and Daniel Garcia. Its newest version is 0.3, and implemented in the JavaBayes and GeNie systems. Possibly, Netica and Hugin will support this format too.

The goal of the current XMLBIF format is to represent directed acyclic graph (DAG) that can be associated to conditional probability measures for discrete variables, with the possibility that decision and utility variables be present in the graph. Bayesian Network is one of the most famous DAG widely known today.

I will introduce the detail of XMLBIF in the coming days, and welcome any suggestion and idea exchange with you, my friend.

CS Conference Ranking

AREA: Artificial Intelligence and Related Subjects
Rank 1:
AAAI: American Association for AI National Conference
CVPR: IEEE Conf on Comp Vision and Pattern Recognition
IJCAI: Intl Joint Conf on AI
ICCV: Intl Conf on Computer Vision
ICML: Intl Conf on Machine Learning
KDD: Knowledge Discovery and Data Mining
KR: Intl Conf on Principles of KR & Reasoning
NIPS: Neural Information Processing Systems
UAI: Conference on Uncertainty in AI
ICAA: International Conference on Autonomous Agents
ACL: Annual Meeting of the ACL (Association of Computational Linguistics)
Rank 2:
NAACL: North American Chapter of the ACL
AID: Intl Conf on AI in Design
AI-ED: World Conference on AI in Education
CAIP: Inttl Conf on Comp. Analysis of Images and Patterns
CSSAC: Cognitive Science Society Annual Conference
ECCV: European Conference on Computer Vision
EAI: European Conf on AI
EML: European Conf on Machine Learning
GP: Genetic Programming Conference
IAAI: Innovative Applications in AI
ICIP: Intl Conf on Image Processing
ICNN/IJCNN: Intl (Joint) Conference on Neural Networks
ICPR: Intl Conf on Pattern Recognition
ICDAR: International Conference on Document Analysis and Recognition
ICTAI: IEEE conference on Tools with AI
AMAI: Artificial Intelligence and Maths
DAS: International Workshop on Document Analysis Systems
WACV: IEEE Workshop on Apps of Computer Vision
COLING: International Conference on Computational Liguistics
EMNLP: Empirical Methods in Natural Language Processing
EACL: Annual Meeting of European Association Computational Lingustics
CoNLL: Conference on Natural Language Learning
Rank 3:
PRICAI: Pacific Rim Intl Conf on AI
AAI: Australian National Conf on AI
ACCV: Asian Conference on Computer Vision
AI*IA: Congress of the Italian Assoc for AI
ANNIE: Artificial Neural Networks in Engineering
ANZIIS: Australian/NZ Conf on Intelligent Inf. Systems
CAIA: Conf on AI for Applications
CAAI: Canadian Artificial Intelligence Conference
ASADM: Chicago ASA Data Mining Conf: A Hard Look at DM
EPIA: Portuguese Conference on Artificial Intelligence
FCKAML: French Conf on Know. Acquisition & Machine Learning
ICANN: International Conf on Artificial Neural Networks
ICCB: International Conference on Case-Based Reasoning
ICGA: International Conference on Genetic Algorithms
ICONIP: Intl Conf on Neural Information Processing
IEA/AIE: Intl Conf on Ind. & Eng. Apps of AI & Expert Sys
ICMS: International Conference on Multiagent Systems
ICPS: International conference on Planning Systems
IWANN: Intl Work-Conf on Art & Natural Neural Networks
PACES: Pacific Asian Conference on Expert Systems
SCAI: Scandinavian Conference on Artifical Intelligence
SPICIS: Singapore Intl Conf on Intelligent System
PAKDD: Pacific-Asia Conf on Know. Discovery & Data Mining
SMC: IEEE Intl Conf on Systems, Man and Cybernetics
PAKDDM: Practical App of Knowledge Discovery & Data Mining
WCNN: The World Congress on Neural Networks
WCES: World Congress on Expert Systems
INBS: IEEE Intl Symp on Intell. in Neural \& Bio Systems
ASC: Intl Conf on AI and Soft Computing
PACLIC: Pacific Asia Conference on Language, Information and Computation
ICCC: International Conference on Chinese Computing
ICADL: International Conference on Asian Digital Libraries
RANLP: Recent Advances in Natural Language Processing
NLPRS: Natural Language Pacific Rim Symposium
ICRA: IEEE Intl Conf on Robotics and Automation
NNSP: Neural Networks for Signal Processing
ICASSP: IEEE Intl Conf on Acoustics, Speech and SP
GCCCE: Global Chinese Conference on Computers in Education
ICAI: Intl Conf on Artificial Intelligence
AEN: IASTED Intl Conf on AI, Exp Sys & Neural Networks
WMSCI: World Multiconfs on Sys, Cybernetics & Informatics
LREC: Language Resources and Evaluation Conference

AREA: Hardware and Architecture

Rank 1:
ASPLOS: Architectural Support for Prog Lang and OS
ISCA: ACM/IEEE Symp on Computer Architecture
ICCAD: Intl Conf on Computer-Aided Design
DAC: Design Automation Conf
MICRO: Intl Symp on Microarchitecture
HPCA: IEEE Symp on High-Perf Comp Architecture
Rank 2:
FCCM: IEEE Symposium on Field Programmable Custom Computing Machines
SUPER: ACM/IEEE Supercomputing Conference
ICS: Intl Conf on Supercomputing
ISSCC: IEEE Intl Solid-State Circuits Conf
HCS: Hot Chips Symp
VLSI: IEEE Symp VLSI Circuits
ISSS: International Symposium on System Synthesis
DATE: IEEE/ACM Design, Automation & Test in Europe Conference
Rank 3:
ICA3PP: Algs and Archs for Parall Proc
EuroMICRO: New Frontiers of Information Technology
ACS: Australian Supercomputing Conf
Advanced Research in VLSI
International Symposium on System Synthesis
International Symposium on Computer Design
International Symposium on Circuits and Systems
Asia Pacific Design Automation Conference
International Symposium on Physical Design
International Conference on VLSI Design

AREA: Applications

Rank 1:
I3DG: ACM-SIGRAPH Interactive 3D Graphics
ACM-MM: ACM Multimedia Conference
DCC: Data Compression Conf
SIGMETRICS: ACM Conf on Meas. & Modelling of Comp Sys
SIGIR: ACM SIGIR Conf on Information Retrieval
PECCS: IFIP Intl Conf on Perf Eval of Comp \& Comm Sys
WWW: World-Wide Web Conference
Rank 2:
EUROGRAPH: European Graphics Conference
CGI: Computer Graphics International
CANIM: Computer Animation
PG: Pacific Graphics
NOSSDAV: Network and OS Support for Digital A/V
PADS: ACM/IEEE/SCS Workshop on Parallel \& Dist Simulation
WSC: Winter Simulation Conference
ASS: IEEE Annual Simulation Symposium
MASCOTS: Symp Model Analysis \& Sim of Comp \& Telecom Sys
PT: Perf Tools - Intl Conf on Model Tech \& Tools for CPE
NetStore: Network Storage Symposium
MMCN: ACM/SPIE Multimedia Computing and Networking
JCDL: Joint Conference on Digital Libraries
Rank 3:
ACM-HPC: ACM Hypertext Conf
MMM: Multimedia Modelling
ICME: Intl Conf on MMedia & Expo
DSS: Distributed Simulation Symposium
SCSC: Summer Computer Simulation Conference
WCSS: World Congress on Systems Simulation
ESS: European Simulation Symposium
ESM: European Simulation Multiconference
HPCN: High-Performance Computing and Networking
Geometry Modeling and Processing
DS-RT: Distributed Simulation and Real-time Applications
IEEE Intl Wshop on Dist Int Simul and Real-Time Applications
ECIR: European Colloquium on Information Retrieval
DVAT: IS\&T/SPIE Conf on Dig Video Compression Alg \& Tech
MME: IEEE Intl Conf. on Multimedia in Education
ICMSO: Intl Conf on Modelling, Simulation and Optimisation
ICMS: IASTED Intl Conf on Modelling and Simulation

AREA: System Technology (Including networking and security)

Rank 1:
SIGCOMM: ACM Conf on Comm Architectures, Protocols & Apps
SPAA: Symp on Parallel Algms and Architecture
PODC: ACM Symp on Principles of Distributed Computing
PPoPP: Principles and Practice of Parallel Programming
MassPar: Symp on Frontiers of Massively Parallel Proc
RTSS: Real Time Systems Symp
SOSP: ACM SIGOPS Symp on OS Principles
OSDI: Usenix Symp on OS Design and Implementation
CCS: ACM Conf on Comp and Communications Security
S&P (Oakland): IEEE Symposium on Security and Privacy
MOBICOM: ACM Intl Conf on Mobile Computing and Networking
MOBIHOC: ACM International Symposium on Mobile Ad Hoc Networking and Computing
ICNP: Intl Conf on Network Protocols
OPENARCH: IEEE Conf on Open Arch and Network Prog
PACT: Intl Conf on Parallel Arch and Compil Tech
INFOCOM: Annual Joint Conf IEEE Comp & Comm Soc
Rank 2:
USENIX Symp on Internet Tech and Sys
CC: Compiler Construction
IPDPS: Intl Parallel and Dist Processing Symp
MOBISYS: International Conference on Mobile Systems, Applications, and Services
SenSys: ACM Conference on Embedded Networked Sensor Systems
ICPP: Intl Conf on Parallel Processing
ICDCS: IEEE Intl Conf on Distributed Comp Systems
SRDS: Symp on Reliable Distributed Systems
MPPOI: Massively Par Proc Using Opt Interconns
ASAP: Intl Conf on Apps for Specific Array Processors
Euro-Par: European Conf. on Parallel Computing
Usenix Security Symposium
NDSS: ISOC Network and Distributed System Security Symposium
ESORICS: European Symposium on Research in Computer Security
RAID: International Symposium on Recent Advances in Intrusion Detection
DSN: The International Conference on Dependable Systems and Networks
ACSAC: Annual Computer Security Applications Conference
WCW: Web Caching Workshop
LCN: IEEE Annual Conference on Local Computer Networks
IPCCC: IEEE Intl Phoenix Conf on Comp & Communications
CCC: Cluster Computing Conference
ICC: Intl Conf on Comm
WCNC: IEEE Wireless Communications and Networking Conference
IPSN: International Conference on Information Processing in Sensor Networks
IPTPS: Annual International Workshop on Peer-To-Peer Systems
CSFW: IEEE Computer Security Foundations Workshop
Rank 3:
MPCS: Intl. Conf. on Massively Parallel Computing Systems
GLOBECOM: Global Comm
IMC: Internet Measurement Conference
IC3N: Intl Conf on Comp Comm and Networks
ICCC: Intl Conf on Comp Communication
NOMS: IEEE Network Operations and Management Symp
CONPAR: Intl Conf on Vector and Parallel Processing
VAPP: Vector and Parallel Processing
ICPADS: Intl Conf. on Parallel and Distributed Systems
Public Key Cryptosystems
Fast Software Encryption
SecureComm: Int. Conf on Security and Privacy for Emerging Areas in Communication Networks
AsiaCCS: ACM Symposium on Information, Computer and Communications Security
ACNS: International Conference on Applied Cryptography and Network Security
Annual Workshop on Selected Areas in Cryptography
Australasia Conference on Information Security and Privacy
Int. Conf on Inofrm and Comm. Security
Financial Cryptography
Workshop on Information Hiding
Smart Card Research and Advanced Application Conference
ICON: Intl Conf on Networks
IMSA: Intl Conf on Internet and MMedia Sys
NCC: Nat Conf Comm
IN: IEEE Intell Network Workshop
Softcomm: Conf on Software in Tcomms and Comp Networks
INET: Internet Society Conf
Workshop on Security and Privacy in E-commerce
EEE: IEEE Conference on e-Technology, e-Commerce and e-Service (Suggested by Roy Grønmo. Thanks)

PARCO: Parallel Computing
SE: Intl Conf on Systems Engineering

AREA: Programming Languages and Software Engineering

Rank 1:
POPL: ACM-SIGACT Symp on Principles of Prog Langs
PLDI: ACM-SIGPLAN Symp on Prog Lang Design & Impl
OOPSLA: OO Prog Systems, Langs and Applications
ICFP: Intl Conf on Function Programming
JICSLP/ICLP/ILPS: (Joint) Intl Conf/Symp on Logic Prog
ICSE: Intl Conf on Software Engineering
FSE: ACM Conference on the Foundations of Software Engineering (inc: ESEC-FSE when held jointly)
FM/FME: Formal Methods, World Congress/Europe
CAV: Computer Aided Verification
Rank 2:
CP: Intl Conf on Principles & Practice of Constraint Prog
TACAS: Tools and Algos for the Const and An of Systems
ESOP: European Conf on Programming
ICCL: IEEE Intl Conf on Computer Languages
PEPM: Symp on Partial Evalutation and Prog Manipulation
SAS: Static Analysis Symposium
RTA: Rewriting Techniques and Applications
ESEC: European Software Engineering Conf
IWSSD: Intl Workshop on S/W Spec & Design
CAiSE: Intl Conf on Advanced Info System Engineering
ITC: IEEE Intl Test Conf
IWCASE: Intl Workshop on Cumpter-Aided Software Eng
SSR: ACM SIGSOFT Working Conf on Software Reusability
SEKE: Intl Conf on S/E and Knowledge Engineering
ICSR: IEEE Intl Conf on Software Reuse
ASE: Automated Software Engineering Conference
PADL: Practical Aspects of Declarative Languages
ISRE: Requirements Engineering
ICECCS: IEEE Intl Conf on Eng. of Complex Computer Systems
IEEE Intl Conf on Formal Engineering Methods
Intl Conf on Integrated Formal Methods
FOSSACS: Foundations of Software Science and Comp Struct
Rank 3:
FASE: Fund Appr to Soft Eng
APSEC: Asia-Pacific S/E Conf
PAP/PACT: Practical Aspects of PROLOG/Constraint Tech
ALP: Intl Conf on Algebraic and Logic Programming
PLILP: Prog, Lang Implentation & Logic Programming
LOPSTR: Intl Workshop on Logic Prog Synthesis & Transf
ICCC: Intl Conf on Compiler Construction
COMPSAC: Intl. Computer S/W and Applications Conf
CSM: Conf on Software Maintenance
TAPSOFT: Intl Joint Conf on Theory & Pract of S/W Dev
WCRE: SIGSOFT Working Conf on Reverse Engineering
AQSDT: Symp on Assessment of Quality S/W Dev Tools
IFIP Intl Conf on Open Distributed Processing
Intl Conf of Z Users
IFIP Joint Int'l Conference on Formal Description Techniques and Protocol Specification, Testing, And Verificati
PSI (Ershov conference)
UML: International Conference on the Unified Modeling Language
EDOC: IEEE Conference on Enterprise Computing (Suggested by Roy Grønmo.)
Australian Software Engineering Conference
IEEE Int. W'shop on Object-oriented Real-time Dependable Sys. (WORDS)
IEEE International Symposium on High Assurance Systems Engineering
The Northern Formal Methods Workshops
Formal Methods Pacific
Int. Workshop on Formal Methods for Industrial Critical Systems
JFPLC - International French Speaking Conference on Logic and Constraint Programming
L&L - Workshop on Logic and Learning
SFP - Scottish Functional Programming Workshop
HASKELL - Haskell Workshop
LCCS - International Workshop on Logic and Complexity in Computer Science
VLFM - Visual Languages and Formal Methods
NASA LaRC Formal Methods Workshop
(1) FATES - A Satellite workshop on Formal Approaches to Testing of Software
(1) Workshop On Java For High-Performance Computing
(1) DSLSE - Domain-Specific Languages for Software Engineering
(1) FTJP - Workshop on Formal Techniques for Java Programs
(*) WFLP - International Workshop on Functional and (Constraint) Logic Programming
(*) FOOL - International Workshop on Foundations of Object-Oriented L anguages
(*) SREIS - Symposium on Requirements Engineering for Information Sec urity
(*) HLPP - International workshop on High-level parallel programming and applications
(*) INAP - International Conference on Applications of Prolog
(*) MPOOL - Workshop on Multiparadigm Programming with OO Languages
(*) PADO - Symposium on Programs as Data Objects
(*) TOOLS: Int'l Conf Technology of Object-Oriented Languages and Systems
(*) Australasian Conference on Parallel And Real-Time Systems

AREA: Algorithms and Theory

Rank 1:
STOC: ACM Symp on Theory of Computing
FOCS: IEEE Symp on Foundations of Computer Science
COLT: Computational Learning Theory
LICS: IEEE Symp on Logic in Computer Science
SCG: ACM Symp on Computational Geometry
SODA: ACM/SIAM Symp on Discrete Algorithms
SPAA: ACM Symp on Parallel Algorithms and Architectures
PODC: ACM Symp on Principles of Distributed Computing
ISSAC: Intl. Symp on Symbolic and Algebraic Computation
CRYPTO: Advances in Cryptology
EUROCRYPT: European Conf on Cryptography
Rank 2:
CONCUR: International Conference on Concurrency Theory
ICALP: Intl Colloquium on Automata, Languages and Prog
STACS: Symp on Theoretical Aspects of Computer Science
CC: IEEE Symp on Computational Complexity
WADS: Workshop on Algorithms and Data Structures
MFCS: Mathematical Foundations of Computer Science
SWAT: Scandinavian Workshop on Algorithm Theory
ESA: European Symp on Algorithms
IPCO: MPS Conf on integer programming & comb optimization
LFCS: Logical Foundations of Computer Science
ALT: Algorithmic Learning Theory
EUROCOLT: European Conf on Learning Theory
WDAG: Workshop on Distributed Algorithms
ISTCS: Israel Symp on Theory of Computing and Systems
ISAAC: Intl Symp on Algorithms and Computation
FST&TCS: Foundations of S/W Tech & Theoretical CS
LATIN: Intl Symp on Latin American Theoretical Informatics
RECOMB: Annual Intl Conf on Comp Molecular Biology
CADE: Conf on Automated Deduction
IEEEIT: IEEE Symposium on Information Theory
Rank 3:
MEGA: Methods Effectives en Geometrie Algebrique
ASIAN: Asian Computing Science Conf
CCCG: Canadian Conf on Computational Geometry
FCT: Fundamentals of Computation Theory
WG: Workshop on Graph Theory
CIAC: Italian Conf on Algorithms and Complexity
ICCI: Advances in Computing and Information
AWTI: Argentine Workshop on Theoretical Informatics
CATS: The Australian Theory Symp
COCOON: Annual Intl Computing and Combinatorics Conf
UMC: Unconventional Models of Computation
MCU: Universal Machines and Computations
GD: Graph Drawing
SIROCCO: Structural Info & Communication Complexity
ALEX: Algorithms and Experiments
ALG: ENGG Workshop on Algorithm Engineering
LPMA: Intl Workshop on Logic Programming and Multi-Agents
EWLR: European Workshop on Learning Robots
CITB: Complexity & info-theoretic approaches to biology
FTP: Intl Workshop on First-Order Theorem Proving (FTP)
CSL: Annual Conf on Computer Science Logic (CSL)
AAAAECC: Conf On Applied Algebra, Algebraic Algms & ECC
DMTCS: Intl Conf on Disc Math and TCS
Information Theory Workshop

AREA: Data Bases

Rank 1:
SIGMOD: ACM SIGMOD Conf on Management of Data
PODS: ACM SIGMOD Conf on Principles of DB Systems
VLDB: Very Large Data Bases
ICDE: Intl Conf on Data Engineering
ICDT: Intl Conf on Database Theory
Rank 2:
SSD: Intl Symp on Large Spatial Databases
DEXA: Database and Expert System Applications
FODO: Intl Conf on Foundation on Data Organization
EDBT: Extending DB Technology
DOOD: Deductive and Object-Oriented Databases
DASFAA: Database Systems for Advanced Applications
CIKM: Intl. Conf on Information and Knowledge Management
SSDBM: Intl Conf on Scientific and Statistical DB Mgmt
CoopIS - Conference on Cooperative Information Systems
ER - Intl Conf on Conceptual Modeling (ER)
Rank 3:
COMAD: Intl Conf on Management of Data
BNCOD: British National Conference on Databases
ADC: Australasian Database Conference
ADBIS: Symposium on Advances in DB and Information Systems
DaWaK - Data Warehousing and Knowledge Discovery
RIDE Workshop
IFIP-DS: IFIP-DS Conference
IFIP-DBSEC - IFIP Workshop on Database Security
NGDB: Intl Symp on Next Generation DB Systems and Apps
ADTI: Intl Symp on Advanced DB Technologies and Integration
FEWFDB: Far East Workshop on Future DB Systems
MDM - Int. Conf. on Mobile Data Access/Management (MDA/MDM)
ICDM - IEEE International Conference on Data Mining
VDB - Visual Database Systems
IDEAS - International Database Engineering and Application Symposium
ARTDB - Active and Real-Time Database Systems
CODAS: Intl Symp on Cooperative DB Systems for Adv Apps
DBPL - Workshop on Database Programming Languages
EFIS/EFDBS - Engineering Federated Information (Database) Systems
KRDB - Knowledge Representation Meets Databases
NDB - National Database Conference (China)
NLDB - Applications of Natural Language to Data Bases
KDDMBD - Knowledge Discovery and Data Mining in Biological Databases Meeting
FQAS - Flexible Query-Answering Systems
IDC(W) - International Database Conference (HK CS)
RTDB - Workshop on Real-Time Databases
SBBD: Brazilian Symposium on Databases
WebDB - International Workshop on the Web and Databases
WAIM: Interational Conference on Web Age Information Management
(1) DASWIS - Data Semantics in Web Information Systems
(1) DMDW - Design and Management of Data Warehouses
(1) DOLAP - International Workshop on Data Warehousing and OLAP
(1) DMKD - Workshop on Research Issues in Data Mining and Knowledge Discovery
(1) KDEX - Knowledge and Data Engineering Exchange Workshop
(1) NRDM - Workshop on Network-Related Data Management
(1) MobiDE - Workshop on Data Engineering for Wireless and Mobile Access
(1) MDDS - Mobility in Databases and Distributed Systems
(1) MEWS - Mining for Enhanced Web Search
(1) TAKMA - Theory and Applications of Knowledge MAnagement
(1) WIDM: International Workshop on Web Information and Data Management
(1) W2GIS - International Workshop on Web and Wireless Geographical Information Systems
* CDB - Constraint Databases and Applications
* DTVE - Workshop on Database Technology for Virtual Enterprises
* IWDOM - International Workshop on Distributed Object Management
* IW-MMDBMS - Int. Workshop on Multi-Media Data Base Management Systems
* OODBS - Workshop on Object-Oriented Database Systems
* PDIS: Parallel and Distributed Information Systems

--- The original source is :

Sunday, November 05, 2006

Useful patter for data mining software development(I)

STRATEGY is a good candidate since it encourages HAS-A relationships. A data mining algorithm will HAS various parts, including setting, input, output, variable list ... Besides, it enables us to swith dynamically among specific implementation choices. For example, we can choose equal-interval binning, or we can use equal-frequency binning, or dynamic benning method. I find cons and pros about applying STRTEGY online, sharing with you here:

Benefits in using Strategy Pattern
- A family of algorithms can be defined as a class hierarchy and can be used interchangeably to alter application behavior without changing its architecture.
- By encapsulating the algorithm separately, new algorithms complying with the same interface can be easily introduced.
- The application can switch strategies at run-time.
- Strategy enables the clients to choose the required algorithm, without using a "switch" statement or a series of "if-else" statements.
- Data structures used for implementing the algorithm is completely encapsulated in Strategy classes. Therefore, the implementation of an algorithm can be changed without affecting the Context class.
- Strategy Pattern can be used instead of sub-classing the Context class. Inheritance hardwires the behavior with the Context and the behavior cannot be changed dynamically.
- The same Strategy object can be strategically shared between different Context objects. However, the shared Strategy object should not maintain states across invocations.
Drawbacks in using Strategy Pattern
- The application must be aware of all the strategies to select the right one for the right situation.
- Strategy and Context classes may be tightly coupled. The Context must supply the relevant data to the Strategy for implementing the algorithm and sometimes, all the data passed by the Context may not be relevant to all the Concrete Strategies.
- Context and the Strategy classes normally communicate through the interface specified by the abstract Strategy base class. Strategy base class must expose interface for all the required behaviors, which some concrete Strategy classes might not implement.
- In most cases, the application configures the Context with the required Strategy object. Therefore, the application needs to create and maintain two objects in place of one.
Since, the Strategy object is created by the application in most cases; the Context has no control on lifetime of the Strategy object. However, the Context can make a local copy of the Strategy object. But, this increases the memory requirement and has a sure performance impact.


It is to be hold in Beijing next year. For anyone who is interested with data management and data mining related research, you can visit and submit something. SIGMOD 2007 welcome both research and industry track papers and demos. Link to SIGMOD'07