DMIN'16
The 2016 International Conference on Data Mining
Foreword
REAL-WORLD DATA MINING APPLICATIONS, CHALLENGES, AND
PERSPECTIVES
Clustering and Prediction of Solar Radiation Daily Patterns
3
Giuseppe Nunnari, Silvia Nunnari
Merging Event Logs for Process Mining with Hybrid Artificial Immune
10
Algorithm
Yang Xu, Qi Lin, Martin Q. Zhao
Forecasting Movements in Oil Spot Prices using Data Mining Methods
17
M. E. Malliaris, A. G. Malliaris
Inter-correlative Histogram Feature and Dimension Reduction for Content
24
Based Multimedia Retrieval
Vinoda Reddy Baradi, P.Suresh Varma Penshulal, A.Govardhan Aluseri
Detecting Mass Emergency Events on Social Media: One Classification
31
Problem or Many?
Viktor Pekar, Jane Binner, Hossein Najafi
An Application of Data Mining in Energy Industry
38
Jongsawas Chongwatpol
Zonification of Heavy Traffic in Mexico City
40
Ruben-F Estrada-S, Alejandro Molina, Adriana Perez-Espinosa, Araceli Reyes-C, Jose
Luis Quiroz-F, Emilio Bravo-G

Song Genre Classification via Lyric Text Mining
44
Anthony Canicatti
Using Machine Learning Algorithms to Improve the Prediction Accuracy in
50
Disease Identification: An Empirical Example
Yong Cai, Dong Dai, Shaowen Hua
Random Under-Sampling Ensemble Methods for Highly Imbalanced Rare
54
Disease Classification
Dong Dai, Shaowen Hua

DATA SCIENCE AND DATA SERVICES
DL4MD: A Deep Learning Framework for Intelligent Malware Detection
61
William Hardy, Lingwei Chen, Shifu Hou, Yanfang Ye, Xin Li
Skill Identification Using Time Series Data Mining
68
Toshiyuki Maeda, Masumi Yajima
Internet of Things Technologies to Rationalize the Data Acquisition in
73
Industrial Asset Management
Sini-Kaisu Kinnunen, Antti Yla-Kujala, Salla Marttonen-Arola, Timo Karri, David
Baglee

Efficient Algorithms for Mining DNA Sequences
80
Guojun Mao
Approximation Algorithms for D-hop Dominating Set Problem
86
Alina Campan, Traian Marius Truta, Matthew Beckerich
Self-Organizing Map Convergence
92
Robert Tatoian, Lutz Hamel
A Probabilistic Logic-based Approach for Subjective Interestingness Analysis
99
Jose Carlos Ferreira da Rocha, Alaine Margarete Guimaraes, Valter Luis Estevam Jr
A Hybrid Weighted Nearest Neighbor Approach to Mine Imbalanced Data
106
Harshita Patel, G.S. Thakur
Best Practices in Measurements for Asset Characterization in Complex
111
Engineering Systems
Giulio D'Emilia, Diego Pascual Galar, Antonella Gaspari
Strategies for Distributed Curation of Social Media Data for Safety and
118
Pharmacovigilance
Tim A. Casperson, Jeffery L. Painter, Juergen Dietrich
Gene Selection from Microarray Data for Age-related Macular Degeneration
125
by Data Mining
Yuhan Hao, Gary Weiss
SEGMENTATION, CLUSTERING, ASSOCIATION + WEB / TEXT /
MULTIMEDIA MINING + SOFTWARE
String Vector Based AHC as Approach to Word Clustering
133
Taeho Jo
Integrating Sequential Pattern Mining Techniques and Support Vector
139
Machines for Sequence Classification
Chieh-Yuan Tsai, Yu-Yu Yao

Common Sense Knowledge, Ontology and Text Mining for Implicit
146
Requirements
Onyeka Emebo, Aparna Varde, Olawande Daramola
KDPMEL: A Knowledge Discovery Process Modeling and Enacting Language 153
Hesham Mansour
Detecting Change in News Feeds Using a Context Based Graph
161
Lenin Mookiah, William Eberle, Maitrayi Mondal
PCSE-KDD: A Process-Centered Support Environment for the Knowledge
168
Discovery Processes
Hesham Mansour
Robust Speaker Recognition in the Presence of Speech Coding Distortion for
176
Remote Access Applications
Robert W. Mudrowsky, Ravi P. Ramachandran, Umashanger Thayasivam, Sachin S.
Shetty

REGRESSION AND CLASSIFICATION
A Cross-Validation Method for Linear Regression Model Selection
187
Jingwei Xiong, Junfeng Shang
Convolutional Neural Net and Bearing Fault Analysis
194
Dean Lee, Vincent Siu, Rick Cruz, Charles Yetman
Bayesian Learning of Clique Tree Structure
201
Cetin Savkli, J. Ryan Carr, Philip Graff, Lauren Kennell
Mixtures of Polynomials for Regression Problems
208
J. Carlos Luengo, Rafael Rumi
Predictive Modeling for Student Retention at St.Cloud State University
215
Hasith Dissanayake, David Robinson, Omar Al-Azzam
The Generalized Shortest Path Kernel for Classifying Cluster Graphs
222
Linus Hermansson
Combining Ensembles
229
Ulf Johansson, Henrik Linusson, Tuve Lofstrom, Cecilia Sonstrod
Named Entity Recognition in Affiliations of Biomedical Articles Using
236
Statistics and HMM Classifiers
Jongwoo Kim, George Thoma

Document Outline