high performance computing on graphics processing units: hgpu.org

Posts

Dec, 8

The Sixth International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, HEART 2015

The HEART symposium is an international forum on state-of-the-art research in high-performance and power-efficient computing using accelerator technologies such as FPGAs, GPGPUs, and/or specialized accelerators. The fifth edition of HEART will take place in Boston MA, USA. The Sixth International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART) is a forum to present and […]

Dec, 8

Computer Graphics International, CGI’15

Computer Graphics International is one of the oldest and true international conference in Computer Graphics and one of the five most important ones worldwide. It is an essential yearly meeting where academics present their latest models and technologies, and explore new trends and ideas. In previous years, it had been held in many different places […]

Dec, 8

19th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, KES 2015

The conference encompasses a broad spectrum of intelligent systems related subjects. The following list provides examples of applicable topics; however, the list is not meant to exclude other applicable areas. Generic Topics of Interest Automated Design and Configuration of Sensory Systems, Self-x principles in Intelligent Engineering Systems, Knowledge-Based Systems, Expert Systems, Cognitive Systems, Neural Networks, […]

Dec, 8

International Conference on Parallel Computing 2015, ParCo2015

Section 1: Algorithms Design, analysis, and implementation of parallel algorithms in science and engineering, focusing on issues such as Scalability and speedup Efficient utilization of the memory hierarchy Communication and synchronization Data Management and Exploration Energy Efficiency. The parallel computing aspects should be emphasized. Section 2: Software and Architectures Software engineering for developing and maintaining […]

Dec, 7

Massively Parallel A* Search on a GPU

A* search is a fundamental topic in artificial intelligence. Recently, the general purpose computation on graphics processing units (GPGPU) has been widely used to accelerate numerous computational tasks. In this paper, we propose the first parallel variant of the A* search algorithm such that the search process of an agent can be accelerated by a […]

CUDA

Dec, 7

Big Integer Multiplication with CUDA FFT (cuFFT) Library

It is well recognized in the computer algebra theory and systems communities that the Fast Fourier Transform (FFT) can be used for multiplying polynomials. Theory predicts that it is fast for "large enough" polynomials. The basic idea is to use fast polynomial multiplication to perform fast integer multiplication. We can achieve really fast FFT multiplication […]

CUDA

Dec, 7

Rendering Volumetric Haptic Shapes in Mid-Air using Ultrasound

We present a method for creating three-dimensional haptic shapes in mid-air using focused ultrasound. This approach applies the principles of acoustic radiation force, whereby the non-linear effects of sound produce forces on the skin which are strong enough to generate tactile sensations. This mid-air haptic feedback eliminates the need for any attachment of actuators or […]

OpenCL

Dec, 7

Parallel GPU Processing for Fast Radio Signal Propagation Computation in GRASS-RaPlaT

Radio propagation simulation tools are important for prediction and verification of the radio signal coverage by individual transmitters or transmitter networks such as mobile phone cellular networks. In the case of a large geographic area with a relative high resolution, the simulation can become computationally demanding, taking a considerable amount of time to accomplish. Parallel […]

OpenCL

Dec, 7

Neural Networks through Shared Maps in Mobile Devices

We introduce a hybrid system composed of a convolutional neural network and a discrete graphical model for image recognition. This system improves upon traditional sliding window techniques for analysis of an image larger than the training data by effectively processing the full input scene through the neural network in less time. The final result is […]

OpenGL

Dec, 5

CUBPT: Lock-free bulk insertions to B+ tree on GPU architecture

B+-tree is one of the most widely-used index structures. To improve insertion process, several batch algorithms are proposed, which all use one thread to complete one node insertion and cannot make full use of GPU’s parallel throughput. So, a batch building and insertion method on GPU named CUBPT is proposed in this paper. During the […]

CUDA

Dec, 5

Coulomb and Landau Gauge Fixing in GPUs using CUDA and MILC

In this work, we present the GPU implementation of the overrelaxation and steepest descent method with Fourier acceleration methods for Laudau and Coulomb gauge fixing using CUDA for SU(N) with N>2. A multi-GPU implementation of the overrelaxation method is also presented using MPI and CUDA. The GPU performance was measured on BlueWaters and compared against […]

Dec, 5

Software Polarization Spectrometer "PolariS"

We have developed a software-based polarization spectrometer, PolariS, to acquire full-Stokes spectra with a very high spectral resolution of 61 Hz. The primary aim of PolariS is to measure the magnetic fields in dense star-forming cores by detecting the Zeeman splitting of molecular emission lines. The spectrometer consists of a commercially available digital sampler and […]

CUDA