


default search action
12th KDD 2006: Philadelphia, PA, USA
- Tina Eliassi-Rad, Lyle H. Ungar, Mark Craven, Dimitrios Gunopulos:

Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, PA, USA, August 20-23, 2006. ACM 2006, ISBN 1-59593-339-5
Conference invited talks
- John A. Stankovic:

Self-Organizing wireless sensor networks in action. 1 - Andrew W. Moore:

New cached-sufficient statistics algorithms for quickly answering statistical questions. 2 - Rakesh Agrawal:

Next frontier. 3
Research track papers
- Elke Achtert, Christian Böhm, Hans-Peter Kriegel, Peer Kröger, Arthur Zimek

:
Deriving quantitative models for correlation clusters. 4-13 - Alekh Agarwal, Soumen Chakrabarti, Sunny Aggarwal:

Learning to rank networked entities. 14-23 - Deepak Agarwal, Andrew McGregor, Jeff M. Phillips, Suresh Venkatasubramanian, Zhengyuan Zhu

:
Spatial scan statistics: approximations and performance study. 24-33 - Aris Anagnostopoulos

, Michail Vlachos
, Marios Hadjieleftheriou, Eamonn J. Keogh, Philip S. Yu:
Global distance-based segmentation of trajectories. 34-43 - Lars Backstrom, Daniel P. Huttenlocher, Jon M. Kleinberg, Xiangyang Lan:

Group formation in large social networks: membership, growth, and evolution. 44-54 - Daniel Barbará, Carlotta Domeniconi, James P. Rogers:

Detecting outliers using transduction and statistical testing. 55-64 - Christian Böhm, Christos Faloutsos

, Jia-Yu Pan, Claudia Plant
:
Robust information-theoretic clustering. 65-75 - Justin Brickell, Vitaly Shmatikov:

Efficient anonymity-preserving data collection. 76-85 - Gregory Buehrer, Srinivasan Parthasarathy

, Amol Ghoting:
Out-of-core frequent pattern mining on a commodity PC. 86-95 - Toon Calders, Bart Goethals

, Szymon Jaroszewicz
:
Mining rank-correlated sets of numerical attributes. 96-105 - Jin Chen, Wynne Hsu, Mong-Li Lee, See-Kiong Ng:

NeMoFinder: dissecting genome-wide protein-protein interactions with meso-scale network motifs. 106-115 - Jason V. Davis, Inderjit S. Dhillon:

Estimating the global pagerank of web communities. 116-125 - Chris H. Q. Ding, Tao Li, Wei Peng, Haesun Park:

Orthogonal nonnegative matrix t-factorizations for clustering. 126-135 - Wei Fan, Joe McCloskey, Philip S. Yu:

A general framework for accurate and fast regression by data summarization in random decision trees. 136-146 - Wei Fan, Ian Davidson:

Reverse testing: an efficient framework to select amongst classifiers under sample selection bias. 147-156 - George Forman:

Quantifying trends accurately despite classifier error and class imbalance. 157-166 - Aristides Gionis, Heikki Mannila, Taneli Mielikäinen, Panayiotis Tsaparas

:
Assessing data mining results via swap randomization. 167-176 - Kosuke Hashimoto, Kiyoko F. Aoki-Kinoshita

, Nobuhisa Ueda, Minoru Kanehisa, Hiroshi Mamitsuka
:
A new efficient probabilistic model for mining labeled ordered trees. 177-186 - Steven C. H. Hoi, Michael R. Lyu, Edward Y. Chang:

Learning the unified kernel machines for classification. 187-196 - Tamás Horváth, Jan Ramon, Stefan Wrobel:

Frequent subgraph mining in outerplanar graphs. 197-206 - Alexander Ihler

, Jon Hutchins, Padhraic Smyth
:
Adaptive event detection with time-varying poisson processes. 207-216 - Thorsten Joachims:

Training linear SVMs in linear time. 217-226 - Yiping Ke

, James Cheng, Wilfred Ng
:
Mining quantitative correlated patterns using an information-theoretic approach. 227-236 - Arno J. Knobbe, Eric K. Y. Ho:

Maximally informative k-itemsets and their efficient discovery. 237-244 - Yehuda Koren, Stephen C. North, Chris Volinsky:

Measuring and extracting proximity in networks. 245-255 - Ravi Kumar, Kunal Punera, Andrew Tomkins:

Hierarchical topic segmentation of websites. 257-266 - Longin Jan Latecki

, Marc Sobel, Rolf Lakämper:
New EM derived from Kullback-Leibler divergence. 267-276 - Kristen LeFevre, David J. DeWitt, Raghu Ramakrishnan:

Workload-aware anonymization. 277-286 - Ping Li, Trevor Hastie, Kenneth Ward Church:

Very sparse random projections. 287-296 - Bing Liu, Kaidi Zhao, Jeffrey Benkler, Weimin Xiao:

Rule interestingness analysis using OLAP operations. 297-306 - Elsa Loekito, James Bailey:

Fast mining of high dimensional expressive contrast patterns using zero-suppressed binary decision diagrams. 307-316 - Bo Long, Xiaoyun Wu, Zhongfei (Mark) Zhang, Philip S. Yu:

Unsupervised learning on k-partite graphs. 317-326 - Michael W. Mahoney, Mauro Maggioni, Petros Drineas

:
Tensor-CUR decompositions for tensor-based data. 327-336 - Qiaozhu Mei, Dong Xin, Hong Cheng, Jiawei Han, ChengXiang Zhai:

Generating semantic annotations for frequent patterns with context analysis. 337-346 - Taneli Mielikäinen, Evimaria Terzi, Panayiotis Tsaparas

:
Aggregating time partitions. 347-356 - Matthew J. Rattigan, Marc E. Maier, David D. Jensen:

Using structure indices for efficient approximation of network properties. 357-366 - Rómer Rosales, Glenn Fung:

Learning sparse metrics via linear programming. 367-373 - Jimeng Sun, Dacheng Tao, Christos Faloutsos

:
Beyond streams and graphs: dynamic tensor analysis. 374-383 - Lei Tang, Jianping Zhang, Huan Liu:

Acclimatizing taxonomic semantics for hierarchical content classification from semantics to data-driven taxonomy. 384-393 - Yufei Tao

, Xiaokui Xiao, Shuigeng Zhou:
Mining distance-based outliers from large databases in any metric space. 394-403 - Hanghang Tong, Christos Faloutsos

:
Center-piece subgraphs: problem definition and fast solutions. 404-413 - Ke Wang, Benjamin C. M. Fung:

Anonymizing sequential releases. 414-423 - Xuerui Wang, Andrew McCallum:

Topics over time: a non-Markov continuous-time model of topical trends. 424-433 - Geoffrey I. Webb:

Discovering significant rules. 434-443 - Dong Xin, Hong Cheng, Xifeng Yan, Jiawei Han:

Extracting redundancy-aware top-k patterns. 444-453 - Jieping Ye, Tie Wang:

Regularized discriminant analysis for high dimensional, low sample size data. 454-463 - Shipeng Yu, Kai Yu, Volker Tresp, Hans-Peter Kriegel, Mingrui Wu:

Supervised probabilistic principal component analysis. 464-473 - Dell Zhang, Wee Sun Lee:

Extracting key-substring-group features for text classification. 474-483 - Qiankun Zhao, Tie-Yan Liu, Sourav S. Bhowmick, Wei-Ying Ma

:
Event detection from evolution of click-through data. 484-493 - Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Ying Ma

:
Simultaneous record detection and attribute labeling in web data extraction. 494-503
Research track posters
- Naoki Abe, Bianca Zadrozny, John Langford:

Outlier detection by active learning. 504-509 - Charu C. Aggarwal, Jian Pei

, Bo Zhang
:
On privacy preservation against adversarial data mining. 510-516 - Bavani Arunasalam, Sanjay Chawla:

CCCS: a top-down associative classifier for imbalanced class distribution. 517-522 - Tanya Y. Berger-Wolf

, Jared Saia:
A framework for analysis of dynamic social networks. 523-528 - Indrajit Bhattacharya, Lise Getoor, Louis Licamele:

Query-time entity resolution. 529-534 - Cristian Bucila, Rich Caruana, Alexandru Niculescu-Mizil:

Model compression. 535-541 - Robin D. Burke

, Bamshad Mobasher
, Chad Williams
, Runa Bhaumik:
Classification features for attack detection in collaborative recommender systems. 542-547 - Vitor R. Carvalho, William W. Cohen:

Single-pass online learning: performance, voting schemes and online feature selection. 548-553 - Deepayan Chakrabarti

, Ravi Kumar, Andrew Tomkins:
Evolutionary clustering. 554-560 - Aristides Gionis, Heikki Mannila, Kai Puolamäki, Antti Ukkonen:

Algorithms for discovering bucket orders from data. 561-566 - Hongyu Guo, Herna L. Viktor:

Mining relational data through correlation-based multiple view validation. 567-573 - Tomoharu Iwata, Kazumi Saito, Takeshi Yamada:

Recommendation method for extending subscription periods. 574-579 - Wolfgang Jank, Galit Shmueli, Shanshan Wang:

Dynamic, real-time forecasting of online auctions via functional models. 580-585 - Szymon Jaroszewicz

:
Polynomial association rules with applications to logistic regression. 586-591 - Nan Jiang, Le Gruenwald:

CFI-Stream: mining closed frequent itemsets in data streams. 592-597 - Arnd Christian König, Eric Brill:

Reducing the human overhead in text categorization. 598-603 - Deept Kumar, Naren Ramakrishnan

, Richard F. Helm, Malcolm Potts:
Algorithms for storytelling. 604-610 - Ravi Kumar, Jasmine Novak, Andrew Tomkins:

Structure and evolution of online social networks. 611-617 - Sven Laur, Helger Lipmaa, Taneli Mielikäinen:

Cryptographically private support vector machines. 618-624 - Hady Wirawan Lauw

, Ee-Peng Lim
, Ke Wang:
Bias and controversy: beyond the statistical deviation. 625-630 - Jure Leskovec, Christos Faloutsos

:
Sampling from large graphs. 631-636 - Jinze Liu, Qi Zhang, Wei Wang

, Leonard McMillan
, Jan F. Prins:
Clustering pair-wise dissimilarity data into partially ordered sets. 637-642 - Dharmesh M. Maniyar, Ian T. Nabney

:
Visual data mining using principled projection algorithms and information visualization techniques. 643-648 - Qiaozhu Mei, ChengXiang Zhai:

A mixture model for contextual text mining. 649-655 - Srujana Merugu, Saharon Rosset, Claudia Perlich:

A new multi-view regression approach with an application to customer wallet estimation. 656-661 - Riadh Ben Messaoud, Omar Boussaid, Sabine Loudcher Rabaséda:

Efficient multidimensional data representations based on multiple correspondence analysis. 662-667 - Fabian Mörchen:

Algorithms for time series knowledge mining. 668-673 - J. Saketha Nath, Chiranjib Bhattacharyya, M. Narasimha Murty:

Clustering based large margin classification: a scalable approach using SOCP formulation. 674-679 - David Newman, Chaitanya Chemudugunta, Padhraic Smyth

:
Statistical entity-topic models. 680-686 - Noam Palatin, Arie Leizarowitz, Assaf Schuster, Ran Wolff:

Mining for misconfigured machines in grid systems. 687-692 - Jia-Yu Pan, André G. R. Balan, Eric P. Xing, Agma J. M. Traina, Christos Faloutsos

:
Automatic mining of fruit fly embryo images. 693-698 - Seung-Taek Park, David M. Pennock, Omid Madani, Nathan Good, Dennis DeCoste:

Naïve filterbots for robust cold-start recommendations. 699-705 - Myra Spiliopoulou, Irene Ntoutsi, Yannis Theodoridis, René Schult:

MONIC: modeling and monitoring cluster transitions. 706-711 - Fabian M. Suchanek, Georgiana Ifrim, Gerhard Weikum:

Combining linguistic and statistical analysis to extract relations from web documents. 712-717 - Bin Tan, Xuehua Shen, ChengXiang Zhai:

Mining long-term search history to improve search accuracy. 718-723 - Ivor W. Tsang

, András Kocsor, James T. Kwok:
Efficient kernel feature extraction for massive data sets. 724-729 - Chao Wang, Srinivasan Parthasarathy

:
Summarizing itemset patterns using probabilistic models. 730-735 - Haixun Wang, Jian Yin, Jian Pei

, Philip S. Yu, Jeffrey Xu Yu:
Suppressing model overfitting in mining concept-drifting data streams. 736-741 - Steve Wedig, Omid Madani:

A large-scale analysis of query logs for assessing personalization opportunities. 742-747 - Li Wei, Eamonn J. Keogh:

Semi-supervised time series classification. 748-753 - Raymond Chi-Wing Wong, Jiuyong Li

, Ada Wai-Chee Fu, Ke Wang:
(alpha, k)-anonymity: an enhanced k-anonymity model for privacy preserving data publishing. 754-759 - Gang Wu, Edward Y. Chang, Yen-Kuang Chen

, Christopher J. Hughes
:
Incremental approximate matrix factorization for speeding up support vector machines. 760-766 - Mingxi Wu, Chris Jermaine:

Outlier detection by sampling with accuracy guarantees. 767-772 - Dong Xin, Xuehua Shen, Qiaozhu Mei, Jiawei Han:

Discovering interesting patterns through user's interactive feedback. 773-778 - Hui Xiong, Junjie Wu, Jian Chen:

K-means clustering versus validation measures: a data distribution perspective. 779-784 - Jian Xu, Wei Wang, Jian Pei

, Xiaoyuan Wang, Baile Shi, Ada Wai-Chee Fu:
Utility-based anonymization using local recoding. 785-790 - Illhoi Yoo, Xiaohua Hu, Il-Yeol Song:

Integration of semantic-based bipartite graph representation and mutual refinement strategy for biomedical literature clustering. 791-796 - Zhiping Zeng, Jianyong Wang, Lizhu Zhou, George Karypis

:
Coherent closed quasi-clique discovery from large dense graph databases. 797-802 - Minghua Zhang, Wynne Hsu, Mong-Li Lee:

Mining progressive confident rules. 803-808 - Sheng Zhang, Amit Chakrabarti

, James Ford, Fillia Makedon:
Attack detection in time series for recommender systems. 809-814 - Shichao Zhang, Feng Chen, Xindong Wu, Chengqi Zhang

:
Identifying bridging rules between conceptual clusters. 815-820 - Tong Zhang, Alexandrin Popescul, Byron Dom:

Linear prediction models with graph regularization for web-page categorization. 821-826 - Lizhuang Zhao, Mohammed J. Zaki, Naren Ramakrishnan:

BLOSOM: a framework for mining arbitrary boolean expressions. 827-832
Industrial and government applications track invited talks
- Jeff Jonas:

Introducing perpetual analytics. 833 - William Kahn:

Capital One's statistical problems: our top ten list. 834 - Andrew McCallum:

Information extraction, data mining and joint inference. 835 - Michael Cavaretta:

Data mining challenges in the automotive domain. 836
Industrial and government applications track papers
- Jinbo Bi, Senthil Periaswamy, Kazunori Okada

, Toshiro Kubota, Glenn Fung, Marcos Salganicoff, R. Bharat Rao:
Computer aided detection via asymmetric cascade of sparse hyperplane classifiers. 837-844 - Rebecca Castaño, Dominic Mazzoni, Nghia Tang, Ronald Greeley, Thomas Doggett, Benjamin Cichy, Steve A. Chien, Ashley Davies:

Onboard classifiers for science event detection on a remote sensing spacecraft. 845-851 - George Forman, Evan Kirshenbaum, Jaap Suermondt:

Pragmatic text mining: minimizing human effort to quantify many issues in call logs. 852-861 - Seth Hettich, Michael J. Pazzani:

Mining for proposal reviewers: lessons learned at the national science foundation. 862-871 - Chao Liu, Chen Chen, Jiawei Han, Philip S. Yu:

GPLAG: detection of software plagiarism by program dependence graph analysis. 872-881 - Fabian Mörchen, Ingo Mierswa, Alfred Ultsch:

Understandable models Of music collections based on exhaustive feature generation with temporal statistics. 882-891 - Kaidi Zhao, Bing Liu, Jeffrey Benkler, Weimin Xiao:

Opportunity map: identifying causes of failure - a deployed data mining system. 892-901
Industrial and government applications track posters
- Eugene Agichtein, Zijian Zheng:

Identifying "best bet" web search results by mining past user behavior. 902-908 - Rich Caruana, Mohamed Farid Elhawary, Art Munson, Mirek Riedewald, Daria Sorokina, Daniel Fink, Wesley M. Hochachka, Steve Kelling:

Mining citizen science data to predict orevalence of wild bird species. 909-915 - Julien Etienne, Bernd Wachmann, Lei Zhang:

A component-based framework for knowledge discovery in bioinformatics. 916-921 - Byron J. Gao, Obi L. Griffith, Martin Ester, Steven J. M. Jones:

Discovering significant OPSM subspace clusters in massive gene expression data. 922-928 - Charles X. Ling, Victor S. Sheng, Tilmann F. W. Bruckhaus, Nazim H. Madhavji:

Maximum profit mining and its application in software development. 929-934 - Ingo Mierswa, Michael Wurst, Ralf Klinkenberg, Martin Scholz, Timm Euler:

YALE: rapid prototyping for complex data mining tasks. 935-940 - Sankar Virdhagriswaran, Gordon Dakin:

Camouflaged fraud detection in domains with complex relationships. 941-947 - Lian Yan, Patrick Baldasare:

Beyond classification and ranking: constrained optimization of the ROI. 948-953
Panel
- Gregory Piatetsky-Shapiro, Robert Grossman, Chabane Djeraba, Ronen Feldman, Lise Getoor, Mohammed Javeed Zaki:

Is there a grand challenge or X-prize for data mining? 954-956

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














