DMIN'11

The 2011 International Conference on Data Mining


ISBN #:  1-60132-168-6

EDITOR:
Robert Stahlbock

ASSOCIATE EDITORS:
Mahmoud Abou-Nasr, Hamid R. Arabnia, Nikolaos Kourentzes, Philippe Lenca, Wolfram-M. Lippe, Gary M. Weiss

Foreword

SESSION: REAL-WORLD DATA MINING APPLICATIONS, CHALLENGES, AND PERSPECTIVES

Three Different Paradigms for Interactive Data Clustering

Terje Kristensen, Vemund Jakobsen

Noise-Tolerant Active Learning

Tsutomu Osoda, Satoru Miyano

Breast Cancer Risk Score A Data Mining Approach to Improve Readability

Emilien Gauthier, Laurent Brisson, Philippe Lenca, Stephane Ragusa

Soft-sensors for Real-time Monitoring and Control of a Black Liquor Concentration Process

Mouloud Amazouz, radu Platon

Ant Colony Optimization with Ants' Individual Memories

Hiroki Inoue, Yasuhiko Kato

A Prediction Model for Recognition of Bad Credit Customers in Saman Bank Using Neural Networks

Masoud Yaghini, Toktam Zhiyan, Mehdi Fallahi

A New Approach for Handling Numeric Ranges for Graph-Based Knowledge Discovery

Oscar Romero , Lawrence Holder, Jesus Gonzalez

A Framework for Detecting Vulnerable, Cascaded Fuzzy Cycles in the Carbon Chain

James Buckley, Jennifer Seitzer

Reliability Analysis of Markov Blanket Learning Algorithms (1996-2010)

Shunkai Fu, Michel Desmarais, Weibin Chen

Sobek: a Text Mining Tool for Educational Applications

Eliseo Reategui, Miriam Klemann, Daniel Epstein, Alexandre Lorenzatti

A Novel Soft Computing Hybrid for Data Imputation

Ankaiah Narravula, Ravi Vadlamani

Bankruptcy Prediction in Banks by Principal Component Analysis Threshold Accepting Trained Wavelet Neural Network Hybrid

Vasu Madireddi, Ravi Vadlamani

A Real Application on Non-technical Losses Detection: The MIDAS Project

Juan I. Guerrero, Carlos Leon, Felix Biscarri, Inigo Monedero, Jesus Biscarri, Rocio Millan

Capstone Project: Event Monitoring and Alerting System

Wook-Sung Yoo, Geetha Rajgopalan

Constrained Nonnegative Matrix Factorization for Data Privacy

Nirmal Thapa, Lian Liu, Pengpeng Lin, Jie Wang, Jun Zhang

An Optimization Framework for Process Discovery Algorithms

Ton Weijters

Facial Nerve Stream Trajectory Data Modelling and Visualization

Jalel Akaichi, Hanen Bouali, Zeineb Dhouioui

Domain Specific Services for Continuous Diagnoses in the Context of Ambient Assisted Living

– AAL

Bjoern-Helge Busch, Ralph Welge

Position of Gateway Drugs in the Spectrum of Adolescent Drug-Use Initiation in Indiana - Relevance of Market Basket Analysis of Data Mining in Detection of Common Substance-Use Initiation Sequences

Ahmed YoussefAgha, Wasantha Jayawardene

SESSION: SEGMENTATION, CLUSTERING, ASSOCIATION

Clustering Approach Based On Von Neumann Topology Artificial Bee Colony Algorithm

Wenping Zou, Yunlong Zhu, Hanning Chen, Tao Ku

Hierarchical Random Graphs for Networks with Weighted Edges and Multiple Edge Attributes

David Allen, Tsai-Ching Lu, Dave Huber, Hankyu Moon

A Two-Stage Algorithm for Data Clustering

Abdolreza Hatamlou, Salwani Abdullah

A Binary Based Approach for Generating Association Rules

Mohamed El Hadi Benelhadj, Khedija Arour, Mahmoud Boufaida, Yahya Slimani

Casino Fraud Data Mining

Robert Woodley, Warren Noll, Kevin Shallenberger

A Case Study on Clustering and Mining Business Processes from a University

Pedro Esposito, Marco Vaz, Jano Souza, Luciano Terres

Centrality Preservation in Anonymized Social Networks

Traian Marius Truta, Alina Campan, Ashley Gasmi, Nicholas Cooper, Andrew Elstun

Design of Customer Behavior Analysis Model in Automobile Marketing

Yuanyuan Mao, Lan Huang, Guishen Wang, Shuxue Zou

An Approach to Selecting Proper Dimensions for Noisy Data

Yong Shi, Jerry Meisner

Adaptive Neuro Fuzzy Networks based on Quantum Subtractive Clustering

Ali Mousavi, Mehrdad Jalali, Mahdi Yaghoubi

A Clustering Approach to Unsupervised Attack Detection in Collaborative Recommender Systems

Runa Bhaumik, Bamshad Mobasher, Robin Burke

Stable Clustering of Temporal Gene Expression Data

Gaolin Zheng, Guowang Mu, Chung-Hao Chen, Xinyu Huang

Heuristic Approaches for Embedded Processor System Size Reduction

Rahul Dixit, Harpreet Singh

Probabilistic Vector Machines

Henri Luchian, Andrei Sucila

Mining Association Rules from Responded Questionnaire of Sanitary Education Guidance

Yo-Ping Huang, Zheng-Hong Deng, Shan-Shan Wang

A New Term Weighting Scheme For Document Clustering

Keerthiram Murugesan, Jun Zhang

A New Approach to Present Prototypes in Clustering of Time Series

Saeed R. Aghabozorgi, Teh Ying Wah, Amineh Amini, Mahmud R. Saybani

SESSION: REGRESSION, CLASSIFICATION

Threshold Value Based Traffic Congestion Identification Method

Zhanquan Sun, Weidong Gu, Jinqiao Feng, Xiaomin Zhu

Comparison of Single Image Processing and Bilateral Image Feature Subtraction in Breast Cancer Detection

Aijuan Dong, Sinanovic Senad

Simple R-Tree for Temporal Searches

Paul Te Braak, Richi Nayak

On Sample Selection Bias in Large-Scale Online Stream Mining: a Model Indexing Approach

Xiong Deng, Moustafa Ghanem, Yike Guo

HIOPGA: A New Hybrid Metaheuristic Algorithm to Train Feedforward Neural Networks for Prediction

Masoud Yaghini, Mohammad M. Khoshraftar, Mehdi Fallahi

Incremental Classification Based on Association Rules Algorithm (ICBA)

Sararak Tanarat, Worapoj Kreesuradej

Constrained Multi-Label Classification: A Semidefinite Programming Approach

Hui Wu, Guangzhi Qu, Hui Zhang, Craig Hartrick

An Empirical Study of Class Noise Impacts on Supervised Learning Algorithms and Measures

Victor Sheng, Rahul Tada, Abhinav Atla

A Novel Protein Secondary Structure Intelligent Prediction System

Bingru Yang

Bankruptcy Prediction with Missing Data

Qi Yu, Yoan Miche, Amaury Lendasse, Eric Severin

An EM-based Multi-Step Piecewise Surface Regression Learning Algorithm

Juan Luo, Alexander Brodsky

SESSION: EXPLORATIVE DATA MINING, DATA PREPROCESSING, FEATURE SELECTION

A Computerized Feature Reduction Using Principal Component Analysis for Accident Duration Forecasting on Freeway

Ying Lee

Example Labeling Difficulty within Repeated Labeling

Victor Sheng

Mining Frequent Item Sets Efficiently by Using Compression Techniques

Selim Mimaroglu, Cagri Cubukcu, Emin Aksehirli, Ertunc Erdil

Modeling Functional Outliers for High Frequency Time Series Forecasting with Neural Networks: An Empirical Evaluation for Electricity Load Data

Nikolaos Kourentzes

Feature Selection with Hybrid Mutual Information and Genetic Algorithm

Vahid Chahkandi, Mehrdad Jalali, Mahsa Mirshahi, Ali Hosseini

Finding Perfect-Predictor Feature Sets for Supervised Classification Using Genetic Algorithms

Alexander Liu, Cheryl Martin

A Novel Knowledge-discovering Approach from Massive Data

Heni Bouhamed, Ahmed Rebai, Thierry Lecroq, Maher Jaoua

Improved Interpretability of the Unified Distance Matrix with Connected Components

Lutz Hamel, Chris W. Brown

On Selecting the Number of Bins for a Histogram

Sai Venu Gopal Lolla, Lawrence L. Hoberock

SESSION: WEB AND TEXT MINING

A Secure Knowledge Discovery Framework for Clinical Informatics

Yueh-Hsun Shih , Chung-Yueh Lien, Chi-Hsien Chen, Chia-Hung Hsiao, Woei-Chyn Chu

Pattern-based Aggregation of Named Entity Extractors

Tracy Lemmond, Paul Kidwell, Kofi Boakye, Nathan Perry, Joseph Guensche, John Nitao, William Hanley, Ryan Prenger, Ron Glaser

Sentiment Detection with Character n-Grams

Tino Hartmann, Sebastian Klenk, Andre Burkovski, Gunther Heidemann

Objective Words Can Improve Sentiment Classification for Word of Mouth

Chihli Hung, Hao-Kai Lin, Chih-Fong Tsai

Privacy-Preserving Profiling

Thomas Barnard, Adam Prugel-Bennett

SESSION: FEATURE SELECTION + CLUSTERING METHODS + TESTING APPLICATIONS

Perspective of Feature Selection Techniques in Bioinformatics

Satish Kumar, Mohammad Khalid Siddiqui

Modularity and Spectral Co-Clustering for Categorical Data

Lazhar Labiod, Mohamed Nadif

Statistical Procedure For Simultaneous Testing of Many Hypothesis

Gurpreet Bawa