6926

Posts

Jan, 7

On-the-Fly Computing on Many-Core Processors in Nuclear Applications

Many nuclear applications still require more computational power than the current computers can provide. Furthermore, some of them require dedicated machines, because they must run constantly or no delay is allowed. To satisfy these requirements, we introduce computer accelerators which can provide higher computational power with lower prices than the current commodity processors. However, the […]
Jan, 7

Graduate Operating Systems: Project Report

Due to the high demand for secure Internet usage, an improvement of the SSL performance is needed. This paper describes a technique to improve the performance of SSL by creating a CPU/GPU hybrid proxy to sit in front of a web server to only handle the SSL overheads. This will allow the utilization of high […]
Jan, 7

Multiphase Fluid Simulations on a Multiple GPGPU PC Using Unsplit Time Integration VSIAM3

This talk presents the implementation of simulations on multiphase fluid dynamics on hardware of multiple GPGPU architecture by using robust and efficient numerical methods. An unsplit formulation for the advection computation is proposed to take the place of the original split formulation in the so-called VSIAM3 method. The new formulation improves dimensional symmetry of numerical […]
Jan, 6

GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement

In hardware-aware high performance computing, block- asynchronous iteration and mixed precision iterative refinement are two techniques that are applied to leverage the computing power of SIMD accelerators like GPUs. Although they use a very different approach for this purpose, they share the basic idea of compensating the convergence behaviour of an inferior numerical algorithm by […]
Jan, 6

Artifact-Free JPEG Decompression with Total Generalized Variation

We propose a new model for the improved reconstruction of JPEG (Joint Photographic Experts Group) images. Given a JPEG compressed image, our method first determines the set of possible source images and then specifically chooses one of these source images satisfying additional regularity properties. This is realized by employing the recently introduced Total Generalized Variation […]
Jan, 6

Hyper neural network on OpenCL

The goal of this thesis is to design and implement a hyper neural network that has a topology with limited number inputs of individual neurons and uses genetic programming as the learning algorithm. Parallelization of this neural network is done with use of OpenCL standard which allows running it on wide range of devices. From […]
Jan, 6

Lossless Compression of Variable-Precision Floating-Point Buffers on GPUs

In this work, we explore the lossless compression of 32-bit floating-point buffers on graphics hardware. We first adapt a state-of-the-art 16-bit floating-point color and depth buffer compression scheme for operation on 32-bit data and propose two specific enhancements: dynamic bucket selection and a Fibonacci encoder. Next, we describe a unified codec for any type of […]
Jan, 6

Optimizing a Biomedical Imaging Orientation Score Framework

A branch of Biomedical image processing involves analyzing images containing elongates structures. The enhancement of these structures in noisy image data is often required to enable automatic image analysis. A framework for such noise reduction based on Coherence Enhancing Diffusion (CED) using Orientation Scores (OS) has been developed. However, owing to the high computational complexity […]
Jan, 6

CodePy

The C/C++ metaprogramming toolkit for Python [16], CodePy [2], is analysed according to its source code generation possibility and its way to generate extension modules for Python. The combination of both results in generating C code in a Python script and executing it from within the same script. Insights are given on how this roundtrip […]
Jan, 6

Low-power Task Scheduling for GPU Energy Reduction

Graphics processing units (GPU) have been intensively used by high-performance computing applications. However, GPU’s large power consumption is a big issue coexisting with the high parallelism. Although Dynamic Voltage and Frequency Scaling (DVFS) [1] has been heavily studied and successfully applied to real products for saving CPU power consumption, DVFS is still relatively new for […]
Jan, 6

Multiple-GPU Scalability of Phase-Field Simulation for Dendritic Solidification

Mechanical properties of metallic materials like steel depend on the solidification process. In order to study the morphology of the microstructure in the materials, the phase-field model derived from the non-equilibrium statistical physics is applied and the interface dynamics is solved by GPU computing. Since very high performance is required, 3-dimensional simulations have not been […]
Jan, 6

Efficient 3D reconstruction of large-scale urban environments from street-level video

Recovering the 3-dimensional (3D) structure of a scene from 2-dimensional (2D) images is a fundamental problem in computer vision. This technology has many applications in computer graphics, entertainment, robotics, transportation, manufacturing, security, etc. One application is 3D mapping. For example, Google Earth and Microsoft Bing Maps provide a 3D virtual replica of many of the […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: