| 220 "Training Support Vector Machines using Gilbert's Algorithm" Shawn Martin |
| 229 "Online Hierarchical Clustering in a Data Warehouse Environment" Elke Achter, Christian Bö, Hans-Peter Kriegel, and Peer Kröger |
| 234 "A Generic Framework for Efficient Subspace Clustering of High-Dimensional Data" Hans-Peter Kriegel, Peer Kröger, Matthias Renz, and Sebastian Wurst |
| 245 "Approximate Inverse Frequent Itemset Mining: Privacy, Complexity, and Approximation" Yongge Wang and Xintao Wu |
| 274 "Finding the Most Unusual Time Series Subsequence: Algorithms and Applications" Eamonn Keogh, Jessica Lin, and Ada Fu |
| 275 "Making Subsequence Time Series Clustering Meaningful" Jason Chen |
| 276 "Finding Representative Set from Massive Data" Feng Pan, Wei Wang, Anthony K. H. Tung, and Jiong Yang |
| 305 "Supervised Tensor Learning" Dacheng Tao, Xuelong Li, Weiming Hu, Stephen Maybank, and Xindong Wu |
| 309 "Combining Multiple Clustering by Soft Correspondence" Bo Long, Zhongfei (Mark) Zhang, and Philip S. Yu |
| 310 "eMailSift: Email Classification Based on Structure and Content" Manu Aery and Sharma Chakravarthy |
| 314 "A Random Walk through Human Associations" Raz Tamir |
| 316 "A Border-Based Approach for Hiding Sensitive Frequent Itemsets" Xingzhi Sun and Philip S. Yu |
| 326 "Alternate Representation of Distance Matrices for Characterization of Protein Structure" Keith Marsolo and Srinivasan Parthasarathy |
| 332 "CanTree: A Tree Structure for Efficient Incremental Mining of Frequent Patterns" Carson Kai-Sang Leung, Quamrul I. Khan, and Tariqul Hoque |
| 334 "Summarization - Compressing Data into an Informative Representation" Varun Chandola and Vipin Kumar |
| 337 "Learning Instance Greedily Cloning Naive Bayes for Ranking" Liangxiao Jiang and Harry Zhang |
| 348 "Efficient Query Filtering for Streaming Time Series" Li Wei, Eamonn Keogh, Helga Van Herle, and Agenor Mafra-Neto |
| 362 "Mining Frequent Spatio-temporal Sequential Patterns" Huiping Cao, Nikos Mamoulis and David W. Cheung |
| 367 "Template-Based Privacy Preservation in Classification Problems" Ke Wang, Benjamin C. M. Fung, and Philip S. Yu |
| 376 "A Bernoulli Relational Model for Nonlinear Embedding" Gang Wang and Frederick Lochovsky |
| 411 "Bifold Constraint-Based Mining by Simultaneous Monotone and Anti-Monotone Checking" Mohammad El-Hajj, Osmar Zaiane, and Paul Nalos |
| 418 "SVM Feature Selection for Classification of SPECT Images of Alzheimer's Disease using Spatial Information" Glenn Fung and Jonathan Stoeckel |
| 421 "Sharing Classifiers among Ensembles from Related Problem Domains" Yi Zhang, Nick Street, and Samuel Burer |
| 425 "A Visual Data Mining Framework for Convenient Identification of Useful Knowledge" Kaidi Zhao, Bing Liu, Thomas Tirpak, and Weimin Xiao |
| 430 "Balancing Exploration and Exploitation: A New Algorithm for Active Machine Learning" Thomas Osugi, Deng Kun, and Stephen Scott |
| 439 "AMIOT: Induced Ordered Tree Mining in Tree-structured Databases" Shohei Hido and Hiroyuki Kawano |
| 453 "Parameter-Free Spatial Data Mining Using MDL" Spiros Papadimitriou, Aristides Gionis, Panayiotis Tsaparas, Heikki Mannila, and Christos Faloutsos |
| 455 "Neighborhood Formation and Anomaly Detection in Bipartite Graph" Jimeng Sun, Huiming Qu, Deepayan Chakrabarti, and Christos Faloutsos |
| 457 "Efficient Text Classification by Weighted Proximal SVM" Dong Zhuang, Benyu Zhang, Qiang Yang, and Zheng Chen |
| 469 "Mining Minimal Distinguishing Subsequence Patterns with Gap Constraints" Xiaonan Ji, James Bailey, and Guozhu Dong |
| 479 "Learning Functional Dependency Networks based on Genetic Programming" Wing Ho Shum, Kwong Sak Leung, and Man Leung Wong |
| 493 "Usage-based PageRank for Web Personalization" Magdalini Eirinaki and Michalis Vazirgiannis |
| 511 "A new algorithm for finding Minimal Sample Uniques for use in Statistical Disclosure Assessment" Anna Manning and David Haglin |
| 542 "Kernel-Density-Based Clustering of Time Series Subsequences Using a Continuous Random-Walk Noise Model" Anne Denton |
| 547 "Handling Generalized Cost Functions in the Partitioning Optimization Problem Through Sequential Binary Programming" Alan Abrahams, Adrian Becker, Daniel Fleder, and Ian MacMillan |
| 548 "Shortest-path kernels on graphs" Karsten Borgwardt and Hans-Peter Kriegel |
| 552 "Effective and Efficient Distributed Model-based Clustering" Hans-Peter Kriegel, Peer Kröger, Alexey Pryakhin, and Matthias Schubert |
| 553 "Effective Estimation of Posterior Probabilties: Explaining the Accuracy of Randomized Decision Tree Approaches" Wei Fan, Ed Greengrass, Joe McClosky, Philp Yu, and Kevin Drummey |
| 557 "Leveraging Relational Autocorrelation with Latent Group Models" Jennifer Neville and David Jensen |
| 559 "Compound Classification Models for Recommender Systems" Lars Schmidt-Thieme |
| 571 "On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams" Peng Wang, Haixun Wang, Xiaochen Wu, Wei Wang, and Baile Shi |
| 578 "An Empirical Bayes Approach to Detect Anomalies in Dynamic Multidimensional Arrays" deepak agarwal |
| 579 "Stability of Feature Selection Algorithms" Alexandros Kalousis, Julien Prados, and Melanie Hilario |
| 585 "Using Information-theoretic Measures to Assess Association Rule Interestingness" Julien Blanchard, Fabrice Guillet, Regis Gras, and Henri Briand |
| 594 "Orthogonal Neighborhood Preserving Projections" Effrosyni Kokiopoulou and Yousef Saad |
| 596 "An Algorithm for In-Core Frequent Itemset Mining on Streaming Data" Ruoming Jin and Gagan Agrawal |
| 627 "A thorough experimental study of datasets for frequent itemsets" Frédéric Flouvat, Fabien De Marchi, and Jean-Marc Petit |
| 638 "Hierarchy-Regularized Latent Semantic Indexing" Yi Huang, Kai Yu, Matthias Schubert, Shipeng Yu, and Hans-Peter Kriegel |
| 646 "Discriminatively Trained Markov Model for Sequence Classification" Oksana Yakhnenko, Adrian Silvescu, and Vasant Honavar |
| 647 "Discovering Frequent Arrangements of Temporal Intervals" Panagiotis Papapetrou, George Kollios, Stan Sclaroff, and Dimitrios Gunopulos |
| 662 "A Heterogeneous Field Matching Method for Record Linkage" Steven Minton, Claude Nanjo, Craig Knoblock, Martin Michalowski, and Matthew Michelson |
| 672 "Higher-Order Web Link Analysis Using Multilinear Algebra" Tamara Kolda, Brett Bader, and Joseph Kenny |
| 697 "The Parameterized Complexity of Enumerating All, All Maximal, and All Closed Frequent Itemsets" Matthew Hamilton, Rhonda Chaytor, and Todd Wareham |
| 710 "Improving Automatic Query Classification via Semi-supervised Learning" Steven Beitzel, Eric Jensen, David Lewis, Abdur Chowdhury, Aleksander Kolcz, and Ophir Frieder |
| 731 "Extracting Frequent Subsequences from a Single Long Data Sequence: A Novel Anti-Monotonic Measure and a Simple On-Line Algorithm" Koji Iwanuma, Ryuichi Ishihara, Yo Takano, and Hidetomo Nabeshima |
| 742 "Modeling Multiple Time Series for Anomaly Detection" Philip Chan and Matthew Mahoney |
| 745 "Mining Patterns of Change in Remote Sensing Image Databases" Marcelino Pereira S. Silva, Gilberto Câmara, Ricardo Cartaxo M. Souza, Dalton M. Valeriano, and Maria Isabel S. Escada |
| 746 "X-mHMM: An Efficient Algorithm for Training Mixtures of HMMs when the Number of Mixtures is Unknown" Zoltán Szamonek and Csaba Szepesvári |
| 753 "Integrating Hidden Markov Models and Spectral Analysis for Sensory Time Series Clustering" Jie Yin and Qiang Yang |
| 769 "Ranking-Based Evaluation of Regression Models" Saharon Rosset, Claudia Perlich, and Bianca Zadrozny |
| 771 "ViVo: Visual Vocabulary Construction for Mining Biomedical Images" Arnab Bhattacharya, Vebjorn Ljosa, Jia-Yu Pan, Mark Verardo, Hyunjeong Yang, Christos Faloutsos and Ambuj K. Singh |
| 786 "Adaptive Product Normalization: Using Online Learning for Record Linkage in Comparison Shopping" Mikhail Bilenko, Sugato Basu, and Mehran Sahami |
| 794 "A Framework of Labeling Unclustered Categorical Data into Clusters Based on the Important Attribute Values" Hung-Leng Chen, Kun-Ta Chuang, and Ming-Syan Chen |
| 800 "Finding Maximal Frequent Itemsets over Online Data Streams Adaptively" Daesu Lee and Wonsuk Lee |
| 806 "Multi-Stage Classification" Ted Senator |
| 813 "Classifier Fusion Using Shared Sampling Distribution For Boosting" Costin Barbu, Raja Iqbal, and Jing Peng |
| 817 "Discriminant Analysis: A Unified Approach" Peng Zhang, Jing Peng, and Norbert Riedel |
| 819 "Generalizing the Notion of Confidence" Michael Steinbach and Vipin Kumar |
| 831 "WARP: Time Warping for Periodicity Detection" Mohamed Elfeky, Walid Aref, and Ahmed Elmagarmid |
| 208 "Gradual Model Generator for Single-pass Clustering" Ismo Kärkkäinen and Pasi Fränti |
| 237 "Effeciently Mining Frequent Closed Partial Orders" Jian Pei, Jian Liu, Haixun Wang, Ke Wang, Philip S. Yu, and Jianyong Wang |
| 243 "Face Recognition Using Landmark-based Bidimensional Regression" Jiazheng Shi, Ashok Samal, and David Marx |
| 246 "Speculative Markov Blanket Discovery for Optimal Feature Selection" Sandeep Yaramakala and Dimitris Margaritis |
| 250 "A Cooperative Data Mining Approach and Its Application to Early Diabetes Detection" Jie Gao, Joerg Denzinger, and Robert C. James |
| 270 "An Improved Categorization of Classifier's Sensitivity on Sample Selection Bias" Wei Fan, Ian Davidson, Bianca Zadrozny, and Philip S. Yu |
| 283 "Making Logistic Regression A Core Data Mining Tool" Paul Komarek and Andrew Moore |
| 294 "Supervised Ordering --- An Empirical Survey" Toshihiro Kamishima, Hideto Kazawa, and Shotaro Akaho |
| 302 "Partial Elastic Matching of Time Series" Longin Jan Latecki, Vasileios Megalooikonomou, Qiang Wang, Rolf Lakaemper, Chotirat Ann Ratanamahatana, and Eamonn Keogh |
| 335 "Text Representation and Dimension Reduction: from Vector to Tensor" Ning Liu, Jun Yan, Benyu Zhang, Zheng Chen, Fengshan Bai, and Qiansheng Cheng |
| 340 "Obtaining Best Parameter Values for Accurate Classification" Frans Coenen and Paul Leng |
| 345 "Hierarchical Density-Based Clustering of Uncertain Data" Hans-Peter Kriegel and Martin Pfeifle |
| 350 "Bagging with Adaptive Costs" Yi Zhang and Nick Street |
| 351 "Spatial Clustering Of Chimpanzee Locations For Neighborhood Identification" Sandeep Mane, Carson Murray, Shashi Shekhar, Jaideep Srivastava, and Anne Pusey |
| 356 "Sequential Pattern Mining in Multiple Data Streams" Gong Chen, Xindong Wu, and Xingquan Zhu |
| 357 "Bit Reduction Support Vector Machine" Lawrence Hall, Tong Luo, Dmitry Goldgof, and Andrew Remsen |
| 365 "CloseMiner: Discovering Frequent Closed Itemsets using Frequent Closed Tidsets" Gourakishwar Ningthoujam, Ranbir Sanasam, and Anjana Kakoti |
| 420 "A Levelwise Search Algorithm for Interesting Subspace Clusters" Haiyun Bian and Raj Bhatnagar |
| 423 "On Feature Selection through Clustering" Richard Butterworth, Gregory Piatetsky-Shapiro, and Dan Simovici |
| 426 "Focused Community Discovery" Kirsten Hildrum and Philip Yu |
| 432 "An Optimal Linear Time Algorithm for Quasi-Monotonic Segmentation" Daniel Lemire, Martin Brooks, and Yuhong Yan |
| 434 "Mining Patterns That Respond to Actions" Yuelong Jiang, Ke Wang, Alexander Tuzhilin, and Ada Wai-Chee Fu |
| 435 "Feature Selection for Building Cost-Effective Data Stream Classifiers" Like Gao and X. Sean Wang |
| 449 "Adaptive Clustering: Obtaining Better Clusters Using Feedback and Past Experience" Abraham Bagherjeiran, Christoph Eick, Chun-Sheng Chen, and Ricardo Vilalta |
| 466 "On the Stationarity of Multivariate Time Series for Correlation-Based Data Analysis" Kiyoung Yang and Cyrus Shahabi |
| 478 "Text Classification with Evolving Label-sets" Shantanu Godbole, Ganesh Ramakrishnan, and Sunita Sarawagi |
| 483 "On the Complexity of Rule Discovery from Distributed Data" Martin Scholz |
| 494 "Privacy-Preserving Frequent Pattern Mining Across Private Databases" Ada Wai-Chee Fu, Raymond Chi-Wing Wong, and Ke Wang |
| 501 "A Computational Framework for Taxonomic Research: Diagnosing Body Shape within Fish Species Complexes" Yixin Chen, Henry Bart, Shuqing Huang, and Huimin Chen |
| 508 "Suppressing Data Sets to Prevent Discovery of Association Rules" Ayca Azgin Hintoglu, Ali Inan, Yucel Saygin, and Mehmet Keskinoz |
| 518 "Efficient mining of high branching factor attribute trees" Alexandre Termier, Marie-Christine Rousset, Michele Sebag, Kouzou Ohara, Takashi Washio, and Hiroshi Motoda |
| 521 "Mining Quantitative Frequent Itemsets Using Adaptive Density-based Subspace Clustering" Takashi Washio, Yuki Mitsunaga, and Hiroshi Motoda |
| 533 "CTC - Correlating Tree Patterns for Classification" Albrecht Zimmermann and Bjoern Bringmann |
| 535 "Mining Ontological Knowledge from Domain-Specific Text Documents" Xing Jiang and Ah-Hwee Tan |
| 550 "Anomaly Intrusion Detection using Multi-Objective Genetic Fuzzy System and Agent-based Evolutionary Computation Framework" Chi-Ho Tsang, Sam Kwong, and Hanli Wang |
| 562 "Blocking Anonymity Threats Raised by Frequent Itemset Mining" Maurizio Atzori, Francesco Bonchi, Fosca Giannotti, and Dino Pedreschi |
| 568 "A Framework for Semi-supervised Learning based on Subjective and Objective Clustering Criteria" Maria Halkidi, Dimitrios Gunopulos, Nitin Kumar, Michalis Vazirgiannis, and Carlotta Domeniconi |
| 573 "Categorization and Keyword Identification of Unlabeled Documents" Ning Kang, Carlotta Domeniconi, and Daniel Barbara |
| 576 "Mining Approximate Frequent Itemset from Noisy Data" Jinze Liu, Paulsen Susan, Wei Wang, Andrew Nobel, and Jan Prins |
| 580 "Segment-Based Injection Attacks against Collaborative Recommender Systems" Robin Burke, Bamshad Mobasher, Runa Bhaumik, and Chad Williams |
| 584 "A Rule Evaluation Support Method with Learning Models Based on" Hidenao Abe, Shusaku Tsumoto, Miho Ohsaki, and Takahira Yamaguchi |
| 587 "Predicting Software Escalations with Maximum ROI" Charles X. Ling, Shengli Sheng, Tilmann Bruckhaus, and Nazim H. Madhavji |
| 590 "A Graph-Based Ranking Algorithm for Geo-Referencing Documents" Bruno Martins and Mário Silva |
| 607 "Optimizing Constraint-Based Mining by Automatically Relaxing Constraints" Arnaud Soulet and Bruno Crémilleux |
| 608 "Pairwise Symmetry Decomposition Method for Generalized Covariance Analysis" Tsuyoshi Ide |
| 615 "Average Number of Frequent (Closed) Patterns in Bernouilli and Markovian Databases" Loick Lhote, Francois Rioult, and Arnaud Soulet |
| 618 "Process Diagnosis via Electrical-Wafer-Sorting Maps Classification" Federico Di Palma, Giuseppe De Nicolao, and Guido Miraglia |
| 645 "Merging Interface Schemas on the Deep Web via Clustering Aggregation" Wensheng Wu, AnHai Doan, and Clement Yu |
| 650 "Instability of Classifiers on Categorical Data" Arno Siebes, Muhammad Subianto, and Ad Feelders |
| 657 "Economical Active Feature-value Acquisition through Expected Utility Estimation" Prem Melville, Maytal Saar-Tsechansky, Foster Provost, and Raymond Mooney |
| 664 "Pruning Social Networks Using Structural Properties and Descriptive Attributes" Lisa Singh, Lise Getoor, and Louis Licamele |
| 667 "Bias Analysis in Text Classification for Highly Skewed Data" Lei Tang and Huan Liu |
| 679 "Fast Frequent String Mining Using Suffix Arrays" Johannes Fischer, Volker Heun, and Stefan Kramer |
| 681 "Semi-supervised Mixture of Kernels via LPBoost Methods" Jinbo Bi, Glenn Fung, Murat Dundar, and Bharat Rao |
| 682 "A Preference Model for Structured Supervised Learning Tasks" Fabio Aiolli |
| 696 "Dynamic Ensemble Re-Construction for Better Ranking" Jin Huang and Charles X. Ling |
| 704 "A Join-less Approach for Co-location Pattern Mining: A Summary of Results" Jin Soung Yoo, Shashi Shekhar, and Mete Celik |
| 714 "Mining chains of relations" Aristides Gionis, Foto Afrati, Gautam Das, Heikki Mannila, Taneli Mielikainen, and Panayiotis Tsaparas |
| 724 "Learning through Changes: An Empirical Study of Dynamic Behaviors of Probability Estimation Trees" Kun Zhang, Zujia Xu, Jing Peng, and Bill Buckles |
| 740 "Example-Based Robust Outlier Detection in High Dimensional Datasets" Cui Zhu, Hiroyuki Kitagawa, and Christos Faloutsos |
| 765 "CLUMP: A Scalable and Robust Framework for Structure Discovery" Kunal Punera and Joydeep Ghosh |
| 767 "Parallel algorithms for distance-based and density-based outliers" Elio Lozano and Edgar Acuna |
| 777 "FS3 : A Random Walk based Free-Form Spatial Scan Statistic for Anomalous Window Detection" Vandana Janeja and Vijayalakshmi Atluri |
| 785 "Hot Item Mining and Summarization from Multiple Auction Web Sites" Tak-Lam Wong and Wai Lam |
| 789 "A Scalable Collaborative Filtering Framework based on Co-clustering" Thomas George and Srujana Merugu |
| 791 "Triple Jump Acceleration for the EM Algorithm" Han-Shen Huang, Chun-Nan Hsu, and Bou-Ho Yang |
| 797 "Visualizing Global Manifold Based on Distributed Local Data Abstraction" Xiaofeng Zhang and William K. Cheung |
| 804 "On Learning Asymmetric Dissimilarity Measures" Krishna Kummamuru, Raghu Krishnapuram, and Rakesh Agrawal |
| 805 "Semi-Supervised Clustering with Metric Learning using Relative Comparisons" Nimit Kumar, Krishna Kummamuru, and Deepa Paranjpe |
| 812 "CLUGO: A Clustering Algorithm for Automated Functional Annotations Based on Gene Ontology" In-Yee Lee, Jan-Ming Ho, and Ming-Syan Chen |
| 821 "A Random Rotation Perturbation Approach to Privacy Preserving Data Classification" Keke Chen and Ling Liu |
| 823 "Automatically Mining Result Records from Search Engine Response Pages" Dheerendranath Mundluru |