12591

Posts

Jul, 28

OMP2HMPP: HMPP Source Code Generation from Programs with Pragma Extensions

High-performance computing are based more and more in heterogeneous architectures and GPGPUs have become one of the main integrated blocks in these, as the recently emerged Mali GPU in embedded systems or the NVIDIA GPUs in HPC servers. In both GPGPUs, programming could become a hurdle that can limit their adoption, since the programmer has […]
Jul, 28

Understanding the ISA impact on GPU Architecture

The wide spread acceptance of GPU for parallel computation has created the demand for general purpose capabilities in GPU. In response, Industry is coming up rapidly with better architecture to support general purpose processing on GPUs. NVIDIA has come up with Tesla, Fermi and Kepler architecture. General Purpose Graphics Processing Units (GPGPU) are widely being […]
Jul, 28

Agent-based crowd simulation using GPU computing

The purpose of the research is to investigate agent-based approaches to virtual crowd simulation. Crowds are ubiquitous and are becoming an increasingly common phenomena in modern society, particularly in urban settings. As such, crowd simulation systems are becoming increasingly popular in training simulations, pedestrian modelling, emergency simulations, and multimedia. One of the primary challenges in […]
Jul, 28

Simulating spiking neural networks on massively parallel graphical processing units using a code generation approach with GeNN

A major challenge in computational neuroscience is to achieve high performance for real-time simulations of full size brain networks. Recent advances in GPU technology provide massively parallel, low-cost and efficient hardware that is widely available on the computer market. However, the comparatively low-level programming that is necessary to create an efficient GPU-compatible implementation of neuronal […]
Jul, 28

Improved Finite Difference Schemes for a 3-D Viscothermal Wave Equation on a GPU

Viscothermal effects in air lead to a damping of high frequencies over time. Such effects cannot be neglected in large-scale room acoustics simulations for the full audible bandwidth. In this study, full-bandwidth room acoustics is modelled using a variant of the three-dimensional wave equation including viscothermal losses in air following from a simplification of the […]
Jul, 28

CUDT: A CUDA Based Decision Tree Algorithm

Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in […]
Jul, 26

Towards fast and certified multiple-precision libraries

Many numerical problems require higher precision than the conventional floating-point (single, double) formats. One solution is to use multiple precision libraries such as GNU MPFR, which allow the manipulation of very high precision numbers. But their generality (they are able to handle numbers with millions of digits), is a quite heavy alternative when medium precision […]
Jul, 26

Parallel Computations for Hierarchical Agglomerative Clustering using CUDA

Graphics Processing Units (GPU) in today’s desktops can well be thought of as a high performance parallel processor. Traditionally, parallel computing is the usage of multiple computing resources to execute computational problems simultaneously. Such computations are possible using multi-core CPUs or computers with multiple CPUs or by using a network of computers in parallel. Today’s […]
Jul, 26

Performance Efficient DNA Sequence Detection on GPU Using Parallel Pattern Matching Approach

Bioinformatics is the field of science which applies computer science and information technology to the problems of biological science. One of the most useful applications of bioinformatics is sequence analysis. Sequence analysis, which is the process of subjecting a DNA, RNA to any wide range of analytical approaches, involves methodologies like sequence alignment and searches […]
Jul, 26

ReGen: Optimizing Genetic Selection Algorithms for Heterogeneous Computing

GenSel is a genetic selection analysis tool used to determine which genetic markers are informational for a given trait. Performing genetic selection related analyses is a time consuming and computationally expensive task. Due to an expected increase in the number of genotyped individuals, analysis times will increase dramatically. Therefore, optimization efforts must be made to […]
Jul, 26

Parallel solutions of static Hamilton-Jacobi equations for simulations of geological folds

Two new algorithms for numerical solution of static Hamilton-Jacobi equations are presented. These algorithms are designed to work efficiently on different parallel computing architectures, and numerical results for multicore CPU and GPU implementations are reported and discussed. The numerical experiments show that the proposed solution strategies scale well with the computational power of the hardware. […]
Jul, 24

ADHA: Automatic Data layout framework for Heterogeneous Architectures

Data layouts play a crucial role in determining the performance of a given application running on a given architecture. Existing parallel programming frameworks for both multicore and heterogeneous systems leave the onus of selecting a data layout to the programmer. Therefore, shifting the burden of data layout selection to optimizing compilers can greatly enhance programmer […]

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org