Posts
Oct, 24
Research for Chinese Spam Filtering Based on GPU
Spam has become a more and more serious problem as the wide use of E-mail. Spam filtering based on mail content is a mainstream technology to solve the spam. However, the efficiency of spam filtering algorithm is becoming a bottleneck when it is used in the training of a great amount of mail samples or […]
Oct, 24
Dawn of GPU Era-Potentials of Chaos Theory
In the present era Chaos theory has tremendous potential in Computer Science Domain. The true potential of Chaos theory can be realized with the assistance of high performance computing aids such as GPU that have become available in present times. The main purpose is to develop a high performance experimental laboratory in academic institutions, for […]
Oct, 24
Large-scale Monte Carlo simulation of two-dimensional classical XY model using multiple GPUs
We study the two-dimensional classical XY model by the large-scale Monte Carlo simulation of the Swendsen-Wang multi-cluster algorithm using multiple GPUs on the open science supercomputer TSUBAME 2.0. Simulating systems up to the linear system size L=65536, we investigate the Kosterlitz-Thouless (KT) transition. Using the generalized version of the probability-changing cluster algorithm based on the […]
Oct, 24
Floating-Point Arithmetic in Transport Triggered Architectures
Many computational applications have high performance and energy-efficiency requirements which "off-the-shelf" general-purpose processors cannot meet. On the other hand, designing special-purpose hardware accelerators can be prohibitively expensive in terms of development time. One approach to the problem is to design an Application-Specific Instruction set Processor (ASIP), which is programmable, but tailor-made for the task at […]
Oct, 23
Task Parallelism and Data Distribution: An Overview of Explicit Parallel Programming Languages
Programming parallel machines as effectively as sequential ones would ideally require a language that provides high-level programming constructs to avoid the programming errors frequent when expressing parallelism. Since task parallelism is considered more error-prone than data parallelism, we survey six popular and efficient parallel language designs that tackle this difficult issue: Cilk, Chapel, X10, Habanero-Java, […]
Oct, 23
Techniques for designing GPGPU games
The increasing level of realism in digital games depends not only on the enhancement of modeling and rendering effects, but also on the improvement of different aspects such as animation, characters artificial intelligence and physics simulation. Normally, games process most of their tasks in the CPU, using the GPU only for graphics processing. Several games […]
Oct, 23
GPUSync: A Framework for Real-Time GPU Management
The integration of graphics processing units (GPUs) into real-time systems has recently become an active area of research. However, prior research on this topic has failed to produce real-time GPU allocation methods that fully exploit the available parallelism in GPU-enabled systems. In this paper, a GPU management framework called GPUSync is described that enables increased […]
Oct, 23
Overdetermined Shooting Methods for Computing Standing Water Waves with Spectral Accuracy
A high-performance shooting algorithm is developed to compute time-periodic solutions of the free-surface Euler equations with spectral accuracy in double and quadruple precision. The method is used to study resonance and its effect on standing water waves. We identify new nucleation mechanisms in which isolated large-amplitude solutions, and closed loops of such solutions, suddenly exist […]
Oct, 23
An Approach for Traffic Forecast with GPU Computing & Cellular Automata Model
Traffic forecast has been of practical interest for modern society, mainly in minimizing of jammed traffic effects due to the saturation of roads, as well as predictable the impact of road interventions. In this way, a family of computational methods that represent basic traffic characteristics is based on Cellular Automata (CA). Moreover, the simulation of […]
Oct, 22
A (ir)regularity-aware task scheduler for heterogeneous platforms
This paper addresses the design, implementation and validation of an effective scheduling scheme for both regular and irregular applications on heterogeneous platforms. The scheduler uses an empirical performance model to dynamically schedule the workload, organized into a given number of chunks, and follows the Heterogeneous Earliest Finish Time (HEFT) scheduling algorithm, which ranks the tasks […]
Oct, 22
Hybrid MPI/GPU Interpolation for Grid DEM Construction
The proliferation of lidar technology in remote sensing has resulted in extremely large, high resolution point clouds covering a wide variety of terrain. Constructing a grid digital elevation model (DEM) from these large data sets requires extensive computational resources and ample disk space. We propose a framework for leveraging modern computing resources including multi-core distributed […]
Oct, 22
Streaming Dynamic Coarse-Grained CPU/GPU Workloads with Heterogeneous Pipelines in FastFlow
Software pipelines permit the decomposition of a repetitive sequential process into a succession of distinguishable sub-processes called stages, each of which can be concurrently executed on a distinct processing element. This paper presents a heterogeneous streaming pipeline implementation using the FastFlow skeletal library for a numerical linear algebra code. By introducing minimal memory management, we […]