10271

Posts

Aug, 6

GPU Acceleration of Graph Matching, Clustering, and Partitioning

We consider sequential algorithms for hypergraph partitioning and GPU (i.e., fine-grained shared-memory parallel) algorithms for graph partitioning and clustering. Our investigation into sequential hypergraph partitioning is concerned with the efficient construction of high-quality matchings for hypergraph coarsening and optimisation with respect to general hypergraph partitioning quality metrics. We introduce the l*(l-1)-metric which exactly measures the […]
Aug, 6

GPU Programming in Functional Languages: A Comparison of Haskell GPU Embedded Domain Specific Languages

Graphical Processing Units (GPUs) are known to be excellent computation accelerators. However, their approach to data processing is very different from regular CPUs. This makes it harder for a regular developer to program these devices. In the past few years, several frameworks were introduced to simplify the programming of GPU devices. Accelerate and Obsidian are […]
Aug, 5

GPGPU Implementation of a Generative Modelling Language

We record and digitally preserve public monuments by describing them using a generative language. Such generative models allow visualisations of the monuments to be computed from descriptions that are suitable for transmission to personal mobile devices. The intent of using a personal mobile device is to make the cultural heritage experience more personal to the […]
Aug, 5

An Ultra-Fast, Optimized and Massively-Parallelized Curvelet Transform Algorithm on GP-GPUs

The Curvelet transform is among one of the most powerful time-frequency representations of an image. However, since it is not a fast algorithm it cannot be employed in most real-time and/or large scale applications. This paper proposes a novel algorithm to speed up the Curvelet transform by both optimizing it for repetitive Curvelet usage and […]
Aug, 5

Attack Signature Matching using Graphics Processors in High-Performance Intrusion Detection Systems

Network Intrusion Detection Systems (NIDS) which should perform time-consuming evaluation of every packet received from network have faced throughput challenge as a result of the increase in the speed of network communications and the high volume of Internet threats. In an NIDS, the most important and time-consuming processes are pattern matching and deep inspection of […]
Aug, 5

Simulating a Family of Tissue P Systems Solving SAT on the GPU

In order to provide efficient software tools to deal with large membrane systems, high-throughput simulators are required. Parallel computing platforms are good candidates, since they are capable of partially implementing the inherently parallel nature of the model. In this concern, today GPUs (Graphics Processing Unit) are considered as highly parallel processors, and they are being […]
Aug, 5

Ray Tracing in the Cloud using MapReduce

We present the Hadoop Online Ray Tracer (HORT), a scalable ray tracing framework for general, pay-as-you-go, cloud computing services. Using MapReduce, HORT partitions the computational workload and scene data differently than other distributed memory ray tracing frameworks. We show that this unique partitioning significantly bounds the data replication costs and inter-process communication. Consequently HORT is […]
Aug, 3

AQUAgpusph, a free 3D SPH solver accelerated with OpenCL

In this paper AQUAgpusph, a new free SPH software licensed under GPLv3 and accelerated using OpenCL, will be described. Its main differences with respect to other GPU based SPH implementations will be discussed, focusing first on the fact that is accelerated with OpenCL, second on the wide range of solid boundary condition enforcing methods have […]
Aug, 3

Strategies for Optimization of Parallel Programs

Multi-core processors are present in most forms of computing, from a pocket-size smartphone to supercomputers. Consequently, parallel and concurrent programming has reemerged as a pressing concern for everyone interested in exploring all the potential computational power in these machines. Writing parallel, and specially concurrent, programs is not a trivial task as it requires a different […]
Aug, 2

Real-Time Electroholography Using a Multi-GPU Environmental PC

We report a real-time electroholography using compact system composed of a multi-GPU environmental PC with four GPUs of Kepler architecture. Finally, our system can calculate 1,920×1,024 pixel CGH from the 3D object composed of 10,240 points in 40.3ms.
Aug, 2

DRiVE: An Example of Distributed Rendering in Virtual Environments

Most Virtual Reality (VR) applications use rendering methods which implement local illumination models, simulating only direct interaction of light with 3D objects. They do not take into account the energy exchange between the objects themselves, making the resulting images look non-optimal. The main reason for this is the simulation of global illumination having a high […]
Aug, 2

Large-Scale Sound Field Rendering in Rectangular Room with Specular Reflection

The sound field rendering is a technique to compute the sound field from the three-dimensional numerical models constructed in the computer, and it is the same concept as the graphics rendering in the computer graphics. In this paper, a GPU (Graphics Processing Unit) cluster system is applied to the sound field rendering for a large […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org