Posts
Nov, 8
Speculative Parallel Evaluation Of Classification Trees On GPGPU Compute Engines
We examine the problem of optimizing classification tree evaluation for on-line and real-time applications by using GPUs. Looking at trees with continuous attributes often used in image segmentation, we first put the existing algorithms for serial and data-parallel evaluation on solid footings. We then introduce a speculative parallel algorithm designed for single instruction, multiple data […]
Nov, 7
Flocking Implementation for the Blender Game Engine
In this thesis, we discuss the development of a new Boids system that simulates flocking behavior inside the Blender Game Engine and within the framework of the Real-Time Particles System (RTPS) library developed by Ian Johnson. The collective behavior of Boids is characterized as an emergent behavior caused by following three steering behaviors: separation, alignment, […]
Nov, 7
High-Level Design for FPGA-based Multiprocessor Accelerators
Field programmable gate arrays (FPGAs) have the potential to accelerate scientific computing applications due to their highly parallel architecture. However, for programming these architectures efficiently, hardware description languages (HDL), such as Verilog or VHDL, have to be used. Many application developers are not familiar with these HDL languages, because they traditionally develop their applications using […]
Nov, 7
GPUinspiral – a low-latency, high-performance implementation of the matched-filter gravitational wave search algorithm
A very high performance search pipeline has been developed for the search for gravitational wave signals originating from coalescing compact binary systems in the M<35 MSUN mass range. The goal of this research is to provide a solution to some of the so far computationally unfeasible data analysis methods such as for example the filtering […]
Nov, 7
MELT-a Translated Domain Specific Language Embedded in the GCC Compiler
The GCC free compiler is a very large software, compiling source in several languages for many targets on various systems. It can be extended by plugins, which may take advantage of its power to provide extra specific functionality (warnings, optimizations, source refactoring or navigation) by processing various GCC internal representations (Gimple, Tree, …). Writing plugins […]
Nov, 7
Functional Programming for High-Performance Computing on Heterogeneous Architectures
Heterogeneous architectures become dominant in high-performance computing platforms but programming them remains really hard, especially because high-performance programs are usually written using low-level languages (C, Fortran, OpenMP…) and frameworks (CUDA, OpenCL…). Mid-level frameworks have been introduced to automatically perform management of distributed memory and scheduling on different devices, allowing applications to only submit tasks and […]
Nov, 7
Fast TV-L1 Optical Flow for Interactivity
Vision is a natural tool for human-computer interaction, since it pro- vides visual feedback to the user and mimics some human behaviors. It requires however the fast and robust computation of motion primi- tives, which remains a difficult problem. In this work, we propose to apply some recent mathematical results about convex optimization to the […]
Nov, 7
Keeneland: Bringing heterogeneous GPU computing to the computational science community
The Keeneland project’s goal is to develop and deploy an innovative, GPU-based high-performance computing system for the NSF computational science community.
Nov, 7
A GPU accelerated interactive interface for exploratory functional connectivity analysis of fMRI data
Functional connectivity analysis is a way to investigate how different parts of the brain are connected and interact. A common measure of connectivity is the temporal correlation between a reference voxel time series and all the other time series in a functional MRI data set. An fMRI data set generally contains more than 20,000 within-brain […]
Nov, 7
Acceleration of physics simulation engine through OpenCL
Nowadays, physics simulation is a relevant topic in several domains, from scientific areas like medicine to entertainment purposes such as movie’s effects, computer animation and games. To make easier the production of faster simulations, developers are using physics engines because they provide a variety of features like rigid and deformable body simulation, fluids dynamics and […]
Nov, 7
Computer Simulation of Dark Matter Effects on Galaxy Rotation
The main goal of this project is to develop a computer program to model the rotation of a galaxy including dark matter. The computer program will be used to answer these questions: (1) How does dark matter affect rotational curves in galaxies; (2) how accurately can this be modeled; (3) what will happen when the […]
Nov, 7
Fast Computation of Dipole Radiation in Stratified Background Using Graphics Processing Unit
We present the GPUs computation acceleration for a very recurrent electromagnetic problem which is the calculation of the field radiated by electric dipoles in a multilayer structure (Green’s tensor in stratified background), based on the well-known Sommerfeld integrals. Using an optimized parallelization scheme, huge computation acceleration is obtained. Applications of such a work are very […]