ICDM '05 Accepted Papers


Out of 630 paper submissions, the Program Committee selected 69 regular papers and 72 short papers for presentation at the conference.

ICDM '05 Regular Papers (69)

220 "Training Support Vector Machines using Gilbert's Algorithm" Shawn Martin
229 "Online Hierarchical Clustering in a Data Warehouse Environment" Elke Achter, Christian Bö, Hans-Peter Kriegel, and Peer Kröger
234 "A Generic Framework for Efficient Subspace Clustering of High-Dimensional Data" Hans-Peter Kriegel, Peer Kröger, Matthias Renz, and Sebastian Wurst
245 "Approximate Inverse Frequent Itemset Mining: Privacy, Complexity, and Approximation" Yongge Wang and Xintao Wu
274 "Finding the Most Unusual Time Series Subsequence: Algorithms and Applications" Eamonn Keogh, Jessica Lin, and Ada Fu
275 "Making Subsequence Time Series Clustering Meaningful" Jason Chen
276 "Finding Representative Set from Massive Data" Feng Pan, Wei Wang, Anthony K. H. Tung, and Jiong Yang
305 "Supervised Tensor Learning" Dacheng Tao, Xuelong Li, Weiming Hu, Stephen Maybank, and Xindong Wu
309 "Combining Multiple Clustering by Soft Correspondence" Bo Long, Zhongfei (Mark) Zhang, and Philip S. Yu
310 "eMailSift: Email Classification Based on Structure and Content" Manu Aery and Sharma Chakravarthy
314 "A Random Walk through Human Associations" Raz Tamir
316 "A Border-Based Approach for Hiding Sensitive Frequent Itemsets" Xingzhi Sun and Philip S. Yu
326 "Alternate Representation of Distance Matrices for Characterization of Protein Structure" Keith Marsolo and Srinivasan Parthasarathy
332 "CanTree: A Tree Structure for Efficient Incremental Mining of Frequent Patterns" Carson Kai-Sang Leung, Quamrul I. Khan, and Tariqul Hoque
334 "Summarization - Compressing Data into an Informative Representation" Varun Chandola and Vipin Kumar
337 "Learning Instance Greedily Cloning Naive Bayes for Ranking" Liangxiao Jiang and Harry Zhang
348 "Efficient Query Filtering for Streaming Time Series" Li Wei, Eamonn Keogh, Helga Van Herle, and Agenor Mafra-Neto
362 "Mining Frequent Spatio-temporal Sequential Patterns" Huiping Cao, Nikos Mamoulis and David W. Cheung
367 "Template-Based Privacy Preservation in Classification Problems" Ke Wang, Benjamin C. M. Fung, and Philip S. Yu
376 "A Bernoulli Relational Model for Nonlinear Embedding" Gang Wang and Frederick Lochovsky
411 "Bifold Constraint-Based Mining by Simultaneous Monotone and Anti-Monotone Checking" Mohammad El-Hajj, Osmar Zaiane, and Paul Nalos
418 "SVM Feature Selection for Classification of SPECT Images of Alzheimer's Disease using Spatial Information" Glenn Fung and Jonathan Stoeckel
421 "Sharing Classifiers among Ensembles from Related Problem Domains" Yi Zhang, Nick Street, and Samuel Burer
425 "A Visual Data Mining Framework for Convenient Identification of Useful Knowledge" Kaidi Zhao, Bing Liu, Thomas Tirpak, and Weimin Xiao
430 "Balancing Exploration and Exploitation: A New Algorithm for Active Machine Learning" Thomas Osugi, Deng Kun, and Stephen Scott
439 "AMIOT: Induced Ordered Tree Mining in Tree-structured Databases" Shohei Hido and Hiroyuki Kawano
453 "Parameter-Free Spatial Data Mining Using MDL" Spiros Papadimitriou, Aristides Gionis, Panayiotis Tsaparas, Heikki Mannila, and Christos Faloutsos
455 "Neighborhood Formation and Anomaly Detection in Bipartite Graph" Jimeng Sun, Huiming Qu, Deepayan Chakrabarti, and Christos Faloutsos
457 "Efficient Text Classification by Weighted Proximal SVM" Dong Zhuang, Benyu Zhang, Qiang Yang, and Zheng Chen
469 "Mining Minimal Distinguishing Subsequence Patterns with Gap Constraints" Xiaonan Ji, James Bailey, and Guozhu Dong
479 "Learning Functional Dependency Networks based on Genetic Programming" Wing Ho Shum, Kwong Sak Leung, and Man Leung Wong
493 "Usage-based PageRank for Web Personalization" Magdalini Eirinaki and Michalis Vazirgiannis
511 "A new algorithm for finding Minimal Sample Uniques for use in Statistical Disclosure Assessment" Anna Manning and David Haglin
542 "Kernel-Density-Based Clustering of Time Series Subsequences Using a Continuous Random-Walk Noise Model" Anne Denton
547 "Handling Generalized Cost Functions in the Partitioning Optimization Problem Through Sequential Binary Programming" Alan Abrahams, Adrian Becker, Daniel Fleder, and Ian MacMillan
548 "Shortest-path kernels on graphs" Karsten Borgwardt and Hans-Peter Kriegel
552 "Effective and Efficient Distributed Model-based Clustering" Hans-Peter Kriegel, Peer Kröger, Alexey Pryakhin, and Matthias Schubert
553 "Effective Estimation of Posterior Probabilties: Explaining the Accuracy of Randomized Decision Tree Approaches" Wei Fan, Ed Greengrass, Joe McClosky, Philp Yu, and Kevin Drummey
557 "Leveraging Relational Autocorrelation with Latent Group Models" Jennifer Neville and David Jensen
559 "Compound Classification Models for Recommender Systems" Lars Schmidt-Thieme
571 "On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams" Peng Wang, Haixun Wang, Xiaochen Wu, Wei Wang, and Baile Shi
578 "An Empirical Bayes Approach to Detect Anomalies in Dynamic Multidimensional Arrays" deepak agarwal
579 "Stability of Feature Selection Algorithms" Alexandros Kalousis, Julien Prados, and Melanie Hilario
585 "Using Information-theoretic Measures to Assess Association Rule Interestingness" Julien Blanchard, Fabrice Guillet, Regis Gras, and Henri Briand
594 "Orthogonal Neighborhood Preserving Projections" Effrosyni Kokiopoulou and Yousef Saad
596 "An Algorithm for In-Core Frequent Itemset Mining on Streaming Data" Ruoming Jin and Gagan Agrawal
627 "A thorough experimental study of datasets for frequent itemsets" Frédéric Flouvat, Fabien De Marchi, and Jean-Marc Petit
638 "Hierarchy-Regularized Latent Semantic Indexing" Yi Huang, Kai Yu, Matthias Schubert, Shipeng Yu, and Hans-Peter Kriegel
646 "Discriminatively Trained Markov Model for Sequence Classification" Oksana Yakhnenko, Adrian Silvescu, and Vasant Honavar
647 "Discovering Frequent Arrangements of Temporal Intervals" Panagiotis Papapetrou, George Kollios, Stan Sclaroff, and Dimitrios Gunopulos
662 "A Heterogeneous Field Matching Method for Record Linkage" Steven Minton, Claude Nanjo, Craig Knoblock, Martin Michalowski, and Matthew Michelson
672 "Higher-Order Web Link Analysis Using Multilinear Algebra" Tamara Kolda, Brett Bader, and Joseph Kenny
697 "The Parameterized Complexity of Enumerating All, All Maximal, and All Closed Frequent Itemsets" Matthew Hamilton, Rhonda Chaytor, and Todd Wareham
710 "Improving Automatic Query Classification via Semi-supervised Learning" Steven Beitzel, Eric Jensen, David Lewis, Abdur Chowdhury, Aleksander Kolcz, and Ophir Frieder
731 "Extracting Frequent Subsequences from a Single Long Data Sequence: A Novel Anti-Monotonic Measure and a Simple On-Line Algorithm" Koji Iwanuma, Ryuichi Ishihara, Yo Takano, and Hidetomo Nabeshima
742 "Modeling Multiple Time Series for Anomaly Detection" Philip Chan and Matthew Mahoney
745 "Mining Patterns of Change in Remote Sensing Image Databases" Marcelino Pereira S. Silva, Gilberto Câmara, Ricardo Cartaxo M. Souza, Dalton M. Valeriano, and Maria Isabel S. Escada
746 "X-mHMM: An Efficient Algorithm for Training Mixtures of HMMs when the Number of Mixtures is Unknown" Zoltán Szamonek and Csaba Szepesvári
753 "Integrating Hidden Markov Models and Spectral Analysis for Sensory Time Series Clustering" Jie Yin and Qiang Yang
769 "Ranking-Based Evaluation of Regression Models" Saharon Rosset, Claudia Perlich, and Bianca Zadrozny
771 "ViVo: Visual Vocabulary Construction for Mining Biomedical Images" Arnab Bhattacharya, Vebjorn Ljosa, Jia-Yu Pan, Mark Verardo, Hyunjeong Yang, Christos Faloutsos and Ambuj K. Singh
786 "Adaptive Product Normalization: Using Online Learning for Record Linkage in Comparison Shopping" Mikhail Bilenko, Sugato Basu, and Mehran Sahami
794 "A Framework of Labeling Unclustered Categorical Data into Clusters Based on the Important Attribute Values" Hung-Leng Chen, Kun-Ta Chuang, and Ming-Syan Chen
800 "Finding Maximal Frequent Itemsets over Online Data Streams Adaptively" Daesu Lee and Wonsuk Lee
806 "Multi-Stage Classification" Ted Senator
813 "Classifier Fusion Using Shared Sampling Distribution For Boosting" Costin Barbu, Raja Iqbal, and Jing Peng
817 "Discriminant Analysis: A Unified Approach" Peng Zhang, Jing Peng, and Norbert Riedel
819 "Generalizing the Notion of Confidence" Michael Steinbach and Vipin Kumar
831 "WARP: Time Warping for Periodicity Detection" Mohamed Elfeky, Walid Aref, and Ahmed Elmagarmid

ICDM '05 Short Papers (72)

208 "Gradual Model Generator for Single-pass Clustering" Ismo Kärkkäinen and Pasi Fränti
237 "Effeciently Mining Frequent Closed Partial Orders" Jian Pei, Jian Liu, Haixun Wang, Ke Wang, Philip S. Yu, and Jianyong Wang
243 "Face Recognition Using Landmark-based Bidimensional Regression" Jiazheng Shi, Ashok Samal, and David Marx
246 "Speculative Markov Blanket Discovery for Optimal Feature Selection" Sandeep Yaramakala and Dimitris Margaritis
250 "A Cooperative Data Mining Approach and Its Application to Early Diabetes Detection" Jie Gao, Joerg Denzinger, and Robert C. James
270 "An Improved Categorization of Classifier's Sensitivity on Sample Selection Bias" Wei Fan, Ian Davidson, Bianca Zadrozny, and Philip S. Yu
283 "Making Logistic Regression A Core Data Mining Tool" Paul Komarek and Andrew Moore
294 "Supervised Ordering --- An Empirical Survey" Toshihiro Kamishima, Hideto Kazawa, and Shotaro Akaho
302 "Partial Elastic Matching of Time Series" Longin Jan Latecki, Vasileios Megalooikonomou, Qiang Wang, Rolf Lakaemper, Chotirat Ann Ratanamahatana, and Eamonn Keogh
335 "Text Representation and Dimension Reduction: from Vector to Tensor" Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Fengshan Bai, and Qiansheng Cheng
340 "Obtaining Best Parameter Values for Accurate Classification" Frans Coenen and Paul Leng
345 "Hierarchical Density-Based Clustering of Uncertain Data" Hans-Peter Kriegel and Martin Pfeifle
350 "Bagging with Adaptive Costs" Yi Zhang and Nick Street
351 "Spatial Clustering Of Chimpanzee Locations For Neighborhood Identification" Sandeep Mane, Carson Murray, Shashi Shekhar, Jaideep Srivastava, and Anne Pusey
356 "Sequential Pattern Mining in Multiple Data Streams" Gong Chen, Xindong Wu, and Xingquan Zhu
357 "Bit Reduction Support Vector Machine" Lawrence Hall, Tong Luo, Dmitry Goldgof, and Andrew Remsen
365 "CloseMiner: Discovering Frequent Closed Itemsets using Frequent Closed Tidsets" Gourakishwar Ningthoujam, Ranbir Sanasam, and Anjana Kakoti
420 "A Levelwise Search Algorithm for Interesting Subspace Clusters" Haiyun Bian and Raj Bhatnagar
423 "On Feature Selection through Clustering" Richard Butterworth, Gregory Piatetsky-Shapiro, and Dan Simovici
426 "Focused Community Discovery" Kirsten Hildrum and Philip Yu
432 "An Optimal Linear Time Algorithm for Quasi-Monotonic Segmentation" Daniel Lemire, Martin Brooks, and Yuhong Yan
434 "Mining Patterns That Respond to Actions" Yuelong Jiang, Ke Wang, Alexander Tuzhilin, and Ada Wai-Chee Fu
435 "Feature Selection for Building Cost-Effective Data Stream Classifiers" Like Gao and X. Sean Wang
449 "Adaptive Clustering: Obtaining Better Clusters Using Feedback and Past Experience" Abraham Bagherjeiran, Christoph Eick, Chun-Sheng Chen, and Ricardo Vilalta
466 "On the Stationarity of Multivariate Time Series for Correlation-Based Data Analysis" Kiyoung Yang and Cyrus Shahabi
478 "Text Classification with Evolving Label-sets" Shantanu Godbole, Ganesh Ramakrishnan, and Sunita Sarawagi
483 "On the Complexity of Rule Discovery from Distributed Data" Martin Scholz
494 "Privacy-Preserving Frequent Pattern Mining Across Private Databases" Ada Wai-Chee Fu, Raymond Chi-Wing Wong, and Ke Wang
501 "A Computational Framework for Taxonomic Research: Diagnosing Body Shape within Fish Species Complexes" Yixin Chen, Henry Bart, Shuqing Huang, and Huimin Chen
508 "Suppressing Data Sets to Prevent Discovery of Association Rules" Ayca Azgin Hintoglu, Ali Inan, Yucel Saygin, and Mehmet Keskinoz
518 "Efficient mining of high branching factor attribute trees" Alexandre Termier, Marie-Christine Rousset, Michele Sebag, Kouzou Ohara, Takashi Washio, and Hiroshi Motoda
521 "Mining Quantitative Frequent Itemsets Using Adaptive Density-based Subspace Clustering" Takashi Washio, Yuki Mitsunaga, and Hiroshi Motoda
533 "CTC - Correlating Tree Patterns for Classification" Albrecht Zimmermann and Bjoern Bringmann
535 "Mining Ontological Knowledge from Domain-Specific Text Documents" Xing Jiang and Ah-Hwee Tan
550 "Anomaly Intrusion Detection using Multi-Objective Genetic Fuzzy System and Agent-based Evolutionary Computation Framework" Chi-Ho Tsang, Sam Kwong, and Hanli Wang
562 "Blocking Anonymity Threats Raised by Frequent Itemset Mining" Maurizio Atzori, Francesco Bonchi, Fosca Giannotti, and Dino Pedreschi
568 "A Framework for Semi-supervised Learning based on Subjective and Objective Clustering Criteria" Maria Halkidi, Dimitrios Gunopulos, Nitin Kumar, Michalis Vazirgiannis, and Carlotta Domeniconi
573 "Categorization and Keyword Identification of Unlabeled Documents" Ning Kang, Carlotta Domeniconi, and Daniel Barbara
576 "Mining Approximate Frequent Itemset from Noisy Data" Jinze Liu, Paulsen Susan, Wei Wang, Andrew Nobel, and Jan Prins
580 "Segment-Based Injection Attacks against Collaborative Recommender Systems" Robin Burke, Bamshad Mobasher, Runa Bhaumik, and Chad Williams
584 "A Rule Evaluation Support Method with Learning Models Based on" Hidenao Abe, Shusaku Tsumoto, Miho Ohsaki, and Takahira Yamaguchi
587 "Predicting Software Escalations with Maximum ROI" Charles X. Ling, Shengli Sheng, Tilmann Bruckhaus, and Nazim H. Madhavji
590 "A Graph-Based Ranking Algorithm for Geo-Referencing Documents" Bruno Martins and Mário Silva
607 "Optimizing Constraint-Based Mining by Automatically Relaxing Constraints" Arnaud Soulet and Bruno Crémilleux
608 "Pairwise Symmetry Decomposition Method for Generalized Covariance Analysis" Tsuyoshi Ide
615 "Average Number of Frequent (Closed) Patterns in Bernouilli and Markovian Databases" Loick Lhote, Francois Rioult, and Arnaud Soulet
618 "Process Diagnosis via Electrical-Wafer-Sorting Maps Classification" Federico Di Palma, Giuseppe De Nicolao, and Guido Miraglia
645 "Merging Interface Schemas on the Deep Web via Clustering Aggregation" Wensheng Wu, AnHai Doan, and Clement Yu
650 "Instability of Classifiers on Categorical Data" Arno Siebes, Muhammad Subianto, and Ad Feelders
657 "Economical Active Feature-value Acquisition through Expected Utility Estimation" Prem Melville, Maytal Saar-Tsechansky, Foster Provost, and Raymond Mooney
664 "Pruning Social Networks Using Structural Properties and Descriptive Attributes" Lisa Singh, Lise Getoor, and Louis Licamele
667 "Bias Analysis in Text Classification for Highly Skewed Data" Lei Tang and Huan Liu
679 "Fast Frequent String Mining Using Suffix Arrays" Johannes Fischer, Volker Heun, and Stefan Kramer
681 "Semi-supervised Mixture of Kernels via LPBoost Methods" Jinbo Bi, Glenn Fung, Murat Dundar, and Bharat Rao
682 "A Preference Model for Structured Supervised Learning Tasks" Fabio Aiolli
696 "Dynamic Ensemble Re-Construction for Better Ranking" Jin Huang and Charles X. Ling
704 "A Join-less Approach for Co-location Pattern Mining: A Summary of Results" Jin Soung Yoo, Shashi Shekhar, and Mete Celik
714 "Mining chains of relations" Aristides Gionis, Foto Afrati, Gautam Das, Heikki Mannila, Taneli Mielikainen, and Panayiotis Tsaparas
724 "Learning through Changes: An Empirical Study of Dynamic Behaviors of Probability Estimation Trees" Kun Zhang, Zujia Xu, Jing Peng, and Bill Buckles
740 "Example-Based Robust Outlier Detection in High Dimensional Datasets" Cui Zhu, Hiroyuki Kitagawa, and Christos Faloutsos
765 "CLUMP: A Scalable and Robust Framework for Structure Discovery" Kunal Punera and Joydeep Ghosh
767 "Parallel algorithms for distance-based and density-based outliers" Elio Lozano and Edgar Acuna
777 "FS3 : A Random Walk based Free-Form Spatial Scan Statistic for Anomalous Window Detection" Vandana Janeja and Vijayalakshmi Atluri
785 "Hot Item Mining and Summarization from Multiple Auction Web Sites" Tak-Lam Wong and Wai Lam
789 "A Scalable Collaborative Filtering Framework based on Co-clustering" Thomas George and Srujana Merugu
791 "Triple Jump Acceleration for the EM Algorithm" Han-Shen Huang, Chun-Nan Hsu, and Bou-Ho Yang
797 "Visualizing Global Manifold Based on Distributed Local Data Abstraction" Xiaofeng Zhang and William K. Cheung
804 "On Learning Asymmetric Dissimilarity Measures" Krishna Kummamuru, Raghu Krishnapuram, and Rakesh Agrawal
805 "Semi-Supervised Clustering with Metric Learning using Relative Comparisons" Nimit Kumar, Krishna Kummamuru, and Deepa Paranjpe
812 "CLUGO: A Clustering Algorithm for Automated Functional Annotations Based on Gene Ontology" In-Yee Lee, Jan-Ming Ho, and Ming-Syan Chen
821 "A Random Rotation Perturbation Approach to Privacy Preserving Data Classification" Keke Chen and Ling Liu
823 "Automatically Mining Result Records from Search Engine Response Pages" Dheerendranath Mundluru