PDPTA'11

The 2011 International Conference on Parallel and Distributed Processing Techniques and Applications


PDPTA'11 Vol I -  ISBN #:  1-60132-193-7
PDPTA'11 Vol II - ISBN #:  1-60132-194-5
PDPTA'11 Set -    ISBN #:  1-60132-195-3

EDITOR: Hamid R. Arabnia

ASSOCIATE EDITORS: 
Minoru Ito, Kazuki Joe, Hiroaki Nishikawa, Hiroshi Ishii, Fernando G. Tinetti, 
Ashu M. G. Solo, George A. Gravvanis

Foreword

SESSION: TOOLS AND MODELS FOR PARALLELIZATION AND INFRASTRUCTURE + POWER

AWARE COMPUTING AND POWER EFFICIENCY

High Performance I/O and Data Management

William Dai

Generation of Correct Parallel Programs Guided by Rewriting Rules

Hidekatsu Koike, Kiyoshi Akama

AutoSCOPE: Automatic Suggestions for Code Optimizations using PerfExpert

Olalekan Sopeju, Martin Burtscher, Ashay Rane, James Browne

MapReduce with Deltas

Ralf Lammel, David Saile

Towards Utilizing Remote GPUs for CUDA Program Execution

Xiaonan Ji, Spencer Davis, Erikson Hardesty, Xu Liang, Sabuj Saha, Hai Jiang

Power Saving Mechanism for Multi-cluster Resource Manager with Dynamic Loading Prediction Scheduling Algorithm

Chang-Hsing Wu, Yi-Lun Pan

Developing an Intelligent Layer for Automatic Parallel Detection Implemented on Different High Performance Computing Platform

Mohamed Ahamed Mead, Hesham ElDeeb, Salwa Nassar

Go2ADLB: An Interface for Using ADLB Within Go

Ralph Butler, Chrisila Pettey, Brian Manifold

Evaluation Iterative Solver for pCDR on GPU Accelerator

Chih-Wei Hsieh, Sheng-Hsiu Kuo, Chau-Yi Chou

MOWIC: Modern Web-based Interface Toolkit for Cluster

Daniel Cleland, Chi Shen

A Hybrid Software Framework for the GPU Acceleration of Multi-Threaded Monte Carlo Applications

Joo Hong Lee, Mark Jones, Paul Plassmann

Framework Construction of Energy Efficiency System of Data Center

Haiping Qu, Xiuwen Wang, Lu Xu


SESSION: COMMUNICATION SYSTEMS + INTERCONNECTION NETWORKS


A Performance Metric for Message Forwarding Schemes of Massively Multiplayer Peer-to-Peer Based Networked Virtual Environments

James Mathias, Daniel Watson

A New Property of Interconnection Networks

Yuan-Kang Shih, Jimmy J. M. Tan, Lih-Hsing Hsu

Audrey: The Model and Implementation of a Hybrid P2P Framework for Massive Virtual Environments

James Mathias, Daniel Watson

Cycle Embedding in Folded Hypercubes

Y-Chuang Chen, Lieh-Yu Lin

A Cluster-Based Quantitative Reliability Model

Eduardo Canete, Manuel Diaz, Luis Llopis, Bartolome Rubio

Fault-tolerant Routing Algorithms Based on Approximate Routable Probabilities for Hypercube Networks

Thuy Duong Dinh, Keiichi Kaneko

The Hyper-Panconnectedness of the Crossed Cube

Hon-Chan Chen, Tzu-Liang Kung, Lih-Hsing Hsu

Modification and Evaluation of Software-Based Communications Unit of a LSC on Chip

Akiko Narita, Naoya Kato, Kenji Ichijo, Yoshio Yoshioka

A Protocol for Realtime Switched Communication in FPGA Clusters

Richard Anderson, Yoginder Dandass

Communicator Sensitive Static Analysis of MPI Collective Communication

Zhaofei Wang

SESSION: SIMULATION + NUMERICAL METHODS + PDE AND MATHEMATICAL PHYSICS

AND ENGINEERING

GPU Acceleration of Solving Parabolic Partial Differential Equations Using Difference

Equations

David Foster

Lock Graph: A Tree-Based Locking Method for Parallel Collision Handling with Diverse Particle Populations

Mark Lewis, Cameron Swords

Multi-agent System Simulation in Scala: An Evaluation of Actors for Parallel Simulation

Aaron Todd, Amara Keller, Mark Lewis, Martin Kelly

Asynchronous Communication for Finite-Difference Simulations on GPU Clusters using CUDA and MPI

Daniel Playne, Ken Hawick

An Efficient Computational Approach for Solving a Class of Nonlinear Integral Equations

Khosrow Maleknejad, Parvin Torabi

Enumerating Order 7 de Bruijn Sequences

Gregory Mayhew

SESSION: GRID AND CLOUD COMPUTING

FTProfiler: A New Profiling Tool for GridFTP Servers

Huong Luu, Rajkumar Kettimuthu, Marianne Winslett

A SLA-based Framework with Support for Meta-scheduling in Advance for Grids

Javier Conejero, Blanca Caminero, Carmen Carrion

CORS - A Cost Optimized Resource Reservation Scheme for Grid

Rifat Shahriyar, Md. Mostofa Akbar, M. Sohel Rahman, Md. Faizul Bari, Shampa Shahriyar

Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers

Wei Chen, Samuel Hargrove, Heh Miao, Liang Hong

The Analysis for Virtualization Performance in Cluster and Cloud Computing

Ying-Chuan Chen, Shuen-Tai Wang, Hsi-Ya Chang, Te-Ming Chen, Chin-Hung Li

Market Basket Analysis Algorithm with Map/Reduce of Cloud Computing

Jongwook Woo, Yuhang Xu

SESSION: PARALLEL ALGORITHMS AND APPLICATIONS

Graph Generation on GPUs using Dynamic Memory Allocation

Arno Leist, Ken Hawick

Hierarchical Parallelization of Molecular Fragment Analysis on Multicore Cluster

Liu Peng, Bhupesh Bansal, Ashish Sharma, Rajiv Kalia, Aiichiro Nakano, Priya Vashishta

Accelerating the Hough Transform with CUDA on Graphics Processing Units

Su Chen, Hai Jiang

Fast Dot Correlation in Optical Metrology on GPGPUs

Ralf Seidler, Andreas Schafer, Dietmar Fey

Evaluation of HPC Architectures for BRAMS Numerical Weather Model

Eugenio Sper de Almeida, Michael Bauer, Alvaro Luiz Fazenda

An Updated Self-stabilizing Algorithm to Maximal 2-packing and a Linear Variation under Synchronous Daemon

Zhengnan Shi

Using OpenCL for Implementing Simple Parallel Graph Algorithms

Michael J. Dinneen, Masoud Khosravani, Andrew Probert

Design of a Mutual Situation Awareness Control Protocol Between Smart Homes by Using Location Transition Model

Mengqiao Zhang, Junbo Wang, Zixue Cheng, Yongping Chen, Lei Jing

A Massively Parallel Algorithm for Polyline Simplification Using an Associative Computing Model

Huy Tran, Michael Scherger

ViFramework: A Framework for Networked Video Streaming Components

Bram Kersten, Kris Van Rens, Rudolf Mak

Computing the Configuration Space Using Arrays with Reconfigurable Optical Buses

John Jenq

Design and Optimization of Hybrid MD5-Blowfish Encryption on GPUs

Zhu Wang, Josh Graham, Noura Ajam, Hai Jiang

Multi-GPU Load Balancing for In-situ Visualization

Robert Hagan, Yong Cao

Designing a Parallel Collaborative SAT Solver

Pascal Vander-Swalmen, Gilles Dequen, Michael Krajecki

On Using a Graphics Processing Unit to Solve The Closest Substring Problem

Jon Calhoun, Josh Graham, Hai Jiang

Achieving High Throughput Sequencing with Graphics Processing Units

Su Chen, Chaochao Zhang, Feng Shen, Ling Bai, Hai Jiang, Damir Herman

Optimization of a Single Seam Removal Using a GPU

Rok Cesnovar, Patricio Bulic, Tomaz Dobravec

An Experiment in Parallelizing the Fast Fourier Transform

Timothy O'Neil, Ameen Mirza, Dale Mugler

Parallel Processing of Geospatial Time-series Data

Monte Lunacek, Peter Graf, Wesley Jones

A Parallel GPU Version of the Traveling Salesman Problem

Molly A. O'Neil, Dan Tamir, Martin Burtscher

Genetic Algorithm based on Number of Children and Height Task for Multiprocessor Task Scheduling

Marjan Abdeyazdan, Vahid Arjmand, Amir Masoud Rahmani, Hamid Raeis ghanavati

A Parallel Algorithm based on Simulated Annealing for Land use Zoning Plans

Marcos Suarez, Ines Sante, Francisco F. Rivera, Rafael Crecente, Marcos Boullon, Juan Porta, Jorge Parapar, Ramon Doallo

Shared Memory, Message Passing, and Hybrid Merge Sorts for Standalone and Clustered SMPs

Atanas Radenski

Rapid Performance of a Generalized Distance Calculation

Scott Fisackerly, Eric Chu, David Foster

GPU Cluster with MATLAB

Alberto Guillen, Maribel Garcia-Arenas, Luis-Javier Herrera, Hector Pomares, Ignacio Rojas

A Parallel Domain Decomposition Algorithm for Solving the Equation of Nitric Oxide Diffusion in the Nervous System

Jianxin Wang, Heng Wu, Yu Zhuang

Accelerating the Computation and Verification of Molecular Collision Models: A Case Study in Legacy Code Parallelization

Kurt O'Hearn, Christian Trefftz, George McBane, Gregory Wolffe

A Safety-strengthened Election Protocol Based on an Unreliable Failure Detector in Distributed Systems

Yong-Hwan Cho, Seon-Hyong Lee, Yeong-Mok Kim, Sung-Hoon Park

Genetic Ensemble (G-Ensemble) for Meteorological Prediction Enhancement

Hisham Ihshaish, Ana Cortes, Miquel A. Senar

Study of Mobile Collaborative Information System using Distributed Database Architecture

Mahmoud Abaza, Duane Cato

SESSION: ULTRA LOW POWER DATA-DRIVEN NETWORKING SYSTEM AND ITS

REALIZATION

Intermediate Achievement of Ultra-Low-Power Data-Driven Networking System: ULP-DDNS

Hiroaki Nishikawa, Kazuhiro Aoki, Hiroshi Ishii, Makoto Iwata

Chip Multiprocessor Platform for Ultra-Low-Power Data-Driven Networking System - ULP-DDNS

Shuji Sannomiya, Ryotaro Kuroda, Kazuhiro Aoki, Kei Miyagi, Makoto Iwata, Hiroaki Nishikawa

Multi-Grain Power Control Scheme in Ultra-Low-Power Data-Driven Chip multiprocessor - ULP-DDCMP

Yukikuni Nishida, Shuji Sannomiya, Hiroaki Nishikawa

Self-Timed Power-Aware Pipeline Chip and Its Evaluation

Kei Miyagi, Shuji Sannomiya, Makoto Iwata, Hiroaki Nishikawa

Study on Applying Ultra-Low-Power Data-Driven Processor to Wireless Base Station

Hideki Yamauchi, Hiroaki Nishikawa

Broadcast Voice Streaming by Load-aware Flooding over Ad Hoc Network achieving Reduction of Traffic and Power Consumption

Keisuke Utsu, Hiroaki Nishikawa, Hiroshi Ishii

Proposal on Battery-aware Counter-based Flooding over Ad Hoc Networks

Keisuke Utsu, Hiroshi Sano, Turganzhan Kassymov, Hiroaki Nishikawa, Hiroshi Ishii

SESSION: SYSTEMS SOFTWARE + OS + THREADS + PROGRAMMING MODELS +

ARCHITECTURE ISSUES

Model Checking Task Sets with Preemption Thresholds

Mitchell Neilsen

Analysis of False Cache Line Sharing Effects on Multicore CPUs

Suntorn Sae-eung, Robert Chun

A RISC-Based Moving Tiny Threads Architecture

Ville Leppanen, Jari-Matti Makela, Martti Forsell

Parallel RISC Architecture. A Functional Approach Based on Backus's FP language

Mihaela Malita, Gheorghe Stefan

Mobile Process Resumption in Java Without Bytecode Rewriting

Matthew Sowders, Jan B. Pedersen

Supporting Ordered Multiprefix Operations in Emulated Shared Memory CMPs

Martti Forsell, Jussi Roivainen

Efficient Virtual Machine Scheduling-policy for Virtualized Heterogeneous Multicore Systems

Ibrahim Takouna, Wesam Dawoud, Christoph Meinel

Prototyping a Library of Algorithmic Skeletons with Bulk Synchronous Parallel ML

Noman Javed, Frederic Loulergue, Julien Tesson, Wadoud Bousdira

A Parallel Architecture Using HDF for Storing DICOM Medical Images on Distributed File Systems

Tiago Soares, Douglas de Macedo, Michael Bauer, Mario Dantas

Dogleg Channel Routing with Parallel Mixed Integer Linear Programming Solvers

I-Lun Tseng, Yung-Wei Kao, Cheng-Yuan Chang, Adam Postula

Thick Control Flows: Introduction and Prospects

Ville Leppanen, Martti Forsell, Jari-Matti Makela

Dynamic Workflow Composition and Execution

Binh Minh Nguyen, Viet D. Tran, Ladisav Hluchy

Predicting CPU Availability of a Multi-core Processor Executing Concurrent Java Threads

Khondker Hasan, NicolasGrounds Grounds, John Antonio

SESSION: EVALUATION METHODS AND PERFORMANCE ANALYSIS

Examining Anomalous Network Performance with Confidence

Bradley Settlemyer, Stephen Hodson, Jeffery Kuehn, Stephen Poole

Methodology to Predict the Performance Behavior of Shared-Memory Parallel Applications on Multicore Systems

John Corredor, Juan Carlos Moure, Dolores Rexachs, Daniel Franco, Emilio Luque

Effects of GPU and CPU Loads on Performance of CUDA Applications

Maksim Bobrov, Roy Melton, Stanislaw Radziszowski, Marcin Lukowiak

Implementation and Evaluation of Program Development Middleware for Cell Broadband Engine Clusters

Toshiaki Kamata, Masahiro Yamada, Akihiro Shitara, Yuri Nishikawa, Masato Yoshimi, Hideharu Amano

Performance Analysis and Evaluation of LANL’s PaScalBB I/O nodes using Quad-Data-Rate Infiniband and Multiple 10-Gigabit Ethernets Bonding

Hsing-bung Chen, Alfred Torrez, Parks Fields, Juan C. Franco, Daniel Illescas, Rocio Perez-Medina, Jharrod LaFon, Ben Haynes, John Herrera

A Set of Microbenchmarks for Measuring OpenMP Task Overheads

James LaGrone, Ayodunni Aribuki, Barbara Chapman

SESSION: FAULT-TOLERANT SYSTEMS + FAULT DETECTION METHODS AND TOOLS

Relentless Computing: Enabling Fault-Tolerant, Numerically Intensive Computation in

Distributed Environments

Lucas A. Wilson, John A. Lockman III

On the Calculation of the Checkpoint Interval in Run-Time for Parallel Applications

Leonardo Fialho, Dolores Rexachs, Emilio Luque

Defining the Checkpoint Interval for Uncoordinated Checkpointing Protocols

Leonardo Fialho, Dolores Rexachs, Emilio Luque

Byzantine-Tolerant Grouping Fault Detection Protocol under High Churn Networks

Huawei Lu, Shuyu Chen, Xiaoqin Zhang, Guanghui Chang

SESSION: PARALLEL COMPUTING IN CLUSTERS: OPTIMIZATION AND PARALLELIZATION OF SEQUENTIAL APPLICATIONS

Scalability Analysis of a Parallel Dynamic Data Driven Genetic Algorithm for Forest Fire Spread Prediction

Monica Malen Denham, Ana Cortes, Tomas Margalef

Combining Scalability and Efficiency for SPMD Applications on Multicore Clusters

Ronal Muresano, Dolores Rexachs, Emilio Luque

A Methodology to Calculate a Program's Robustness against Transient Faults

Joao Gramacho, Dolores Rexachs, Emilio Luque

Update and Restructure Legacy Code for (or Before) Parallel Processing

Fernando G. Tinetti, Mariano Mendez, Mónica A. Lopez, Juan C. Labraga, Pedro G. Cajaraville

Broadcast and Partial Computing Algorithms for Cholesky Factorization on a Cluster of Multicore Computers

Fernando G. Tinetti, Gustavo Wolfmann

Parallel Smith-Waterman Algorithm for DNA Sequences Comparison on Different Cluster Architectures.

Enzo Rucci, Armando E. De Giusti, Franco Chichizola

Parallel Optimal and Suboptimal Heuristic Search on Multicore Clusters. Performance Analysis.

Victoria Sanz, Marcelo Naiouf, Armando E. De Giusti

Parallel Algorithms on Clusters of Multicores: Comparing Message Passing vs Hybrid Programming.

Fabiana Leibovich, Laura De Giusti, Marcelo Naiouf

Distributed Search on Large NoSQL Databases

Fernando G. Tinetti, Francisco Paez, Luis I. Aita, Demian Barry

SESSION: WORKSHOP ON MATHEMATICAL MODELING AND PROBLEM SOLVING, MPS

An Attribute Graph Grammar for UML Package Diagrams and its Applications

Takaaki Goto, Tetsuro Nishino, Kensei Tsuchida

Classification of Idiopathic Interstitial Pneumonia CT Images using Convolutional-net with Sparse Feature Extractors

Taiju Inagaki, Hayaru Shouno, Shoji Kido

Efficient and Approximate Simulation Algorithm of Kinetic Folding of an RNA Molecule

Takumi Tanigawa, Satoshi Kobayashi

DNA Logic Circuits with a DNA Polymerase and a Nicking Enzyme

Ryo Hirose, Satoshi Kobayashi, Ken Komiya

An Improved Shift Strategy for the Modified Discrete Lotka-Volterra with Shift Algorithm

Masami Takata, Takumi Yamashita, Akira Ajisaka, Kinji Kimura, Yoshimasa Nakamura

Evaluation of the SVM Based Multi-Fonts Kanji Character Recognition Method for Early-Modern Japanese Printed Books

Manami Fukuo, Yurie Enomoto, Naoko Yoshii, Masami Takata, Tsukasa Kimesawa, Kazuki Joe

Optimization of the Particle-based Volume Rendering for GPUs with Hiding Data Transfer Latency

Kyoko Nakao, Erika Matsui, Naoko Yoshii, Masami Takata, Kazuki Joe

A Real-time Analysis Environment for a Wireless BMI Device Enobio

Yu Ishikawa, Sanae Teramae, Naoko Yoshii, Masami Takata, Kazuki Joe

Distributed PACS using Network Shared File System

Tomoyuki Hiroyasu, Yoshiyuki Minamitani, Masato Yoshimi, Mitsunori Miki

A Framework for Genetic Algorithms in Parallel Environments

Tomoyuki Hiroyasu, Ryosuke Yamanaka, Masato Yoshimi, Mitsunori Miki

An Intelligent Lighting System to Realize Individual Lighting Environments Based on Estimated Daylight Distribution

Mitsunori Miki, Takuro Yoshii, Tomoyuki Hiroyasu, Masato Yoshimi, Hiroyuki Yonemoto

Event Detection using Archived Smart House Sensor Data Obtained Using Symbolic Aggregate Approximation

Ayaka Onishi, Chiemi Watanabe

Semi-ShuffledBF: Performance Improvement of a Privacy-Preserving Query Method for a DaaS Model Using a Bloom Filter

Shizuka Kaneko, Chiemi Watanabe, Toshiyuki Amagasa

Implementation and Performance Evaluation of New Inverse Iteration Algorithm with Householder Transformation in Terms of the Compact WY Representation

Hiroyuki Ishigami, Kinji Kimura, Yoshimasa Nakamura

Resultant-factorization Technique for Obtaining Solutions to Ordinary Differential Equations

Kinji Kimura, Hiroshi Yoshida

Hierarchical Visualization of Similarities between Probabilistic Distributions for Profiling

Akira Ito, Tomohiro Yoshikawa, Takeshi Furuhashi

Construction of a Mathematical Model and Quantitative Assessments of Impression in Western Painting

Sachi Urano

Abstraction of DNA Graph Structures for Efficient Enumeration and Simulation

Ibuki Kawamata, Fumiaki Tanaka, Masami Hagiya

A Heuristic Line Balancing Algorithm Accounting for Component Mounting Order

Hiroshige Tozaki, Hidenori Ohta, Mario Nakamori

SESSION: NOVEL APPLICATIONS AND ALGORITHMS + CUDA + GPU + GPGPU + MULTI-CORE + CLUSTER COMPUTING + I/O SYSTEMS + TOOLS

Scalable Data-Privatization Threading for Hybrid MPI/OpenMP Parallelization of Molecular Dynamics

Manaschai Kunaseth, David Richards, James Glosli, Rajiv Kalia, Aiichiro Nakano, Priya Vashishta

Efficient Data Access for Open Modeling Interface (OpenMI) Components

Tom Bulatewicz, Daniel Andresen

Leveraging Parallelism with CUDA and OpenCL

Song Park, Dale Shires, James Ross, David Richie

Distributed Parallel D8 Up-Slope Area Calculation in Digital Elevation Models

Richard Barnes, Clarence Lehman, David Mulla

Selecting the Best Tridiagonal System Solver Projected on Multi-Core CPU and GPU Platforms

Pablo Quesada-Barriuso, Julian Lamas-Rodriguez, Dora B. Heras, Montserrat Boo, Francisco

Arguello

Parallel Merge Sort Implementation Using OpenMP

Jaeyoung Park, Kyoung-Gun Lee, Jong Tae Kim

Low-synchronisation Work Stealing under Parallel Data-List Processing in Multicores

Jorge Buenabad-Chavez, Miguel A. Castro-Garcia, Jose L. Quiroz-Fabian, Daniel M. Yellin, Graciela Roman-Alonso, Edgar F. Hernandez-Ventura

Analysis of GPGPU Platforms Efficiency in General-Purpose Computations

Pavel Kartashev, Vladislav Nazaruk

Study of Performance Issues on a SMP-NUMA System Using the Roofline Model

Juan Angel Lorenzo, Juan Carlos Pichel, Tomás F. Pena, Marcos Suarez, Francisco F. Rivera

Performance Modeling of Intel and Portland Compilers Using Westmere-Based Infiniband HPC Cluster

Muhammed Al-Mulhem, Raed Al-Shaikh

Predictive and Distributed Routing Balancing for HPC Clusters

Carlos Núñez Castillo, Diego Lugones, Daniel Franco, Emilio Luque

A Parallel Algorithm for the Verification of Covering Arrays

Himer Avila-George, Jose Torres-Jimenez, Vicente Hernandez, Nelson Rangel-Valdez

Methodology for Performance Evaluation of the Input/Output System

Sandra Mendez, Dolores Rexachs, Emilio Luque

Computational Aspects of Silicate Networks

Paul Manuel, Indra Rajasingh, Albert William, Antony Kishore

Improving Distributed Processing in the COPAR System

Stephen Hartley, Joel Crichlow, Michael Hosein

A Novel Cloud Computing Data Fragmentation Service Design for Distributed Systems

Ismail Hababeh

Parallelizing Tompa's Exact Algorithm for Finding Short Motifs in DNA

Christopher T. Mitchell, Jonathan Grochowski, Julian H. Dale, Nicolas B. Wilson, Jens Mache

RNS: Remote Node Selection for HPC Clusters

Seyedeh Leili Mirtaheri, Ehsan Mousavi Khaneghah, Siavash Ghiasvand, Mohammad Norouzi Arab, Ashkan Shirpour, Mohsen Sharifi

Multimerge

Fernando Couto, Fabio Couto

A Study of Memory Access Patterns in Irregular Parallel Codes Using Hardware Counter-Based Tools

Oscar G. Lorenzo, Juan Angel Lorenzo, Jose Carlos Cabaleiro, Dora B. Heras, Marcos Suarez, Juan Carlos Pichel

VLSI Parallel Sorter Architecture for Streaming Data

Dongjae Song, Kyoung Kun Lee, Soongyu Kwon, Jong Tae Kim