Posts
Jul, 15
Coupling between Meshless FEM Modeling and Rendering on GPU for Real-time Physically-based Volumetric Deformation
For real-time rendering of physically-based volumetric deformation, a meshless finite element method (FEM) is proposed and implemented on the new-generation Graphics Processing Unit (GPU). A tightly coupled deformation and rendering pipeline is defined for seamless modeling and rendering: First, the meshless FEM model exploits the vertex shader stage and the transform feedback mechanism of the […]
Jul, 15
ab-Stream: A Framework for programming Many-core
The common approach to program many-core processor is to write processor-specific code with low level APIs for different processors, which could achieve good performance but would result in serious portability issues: programmers are required to write a specific version code for target architecture. Therefore, we present ab-Stream, an extensible framework for programming many-threaded processor based […]
Jul, 15
Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU
This paper presents results of an implementation of code generator for fast general matrix multiply (GEMM) kernels. When a set of parameters is given, the code generator produces the corresponding GEMM kernel written in OpenCL. The produced kernels are optimized for high-performance implementation on GPUs from AMD. Access latencies to GPU global memory is the […]
Jul, 15
Distributed OpenCL Distributing OpenCL Platform on Network Scale
This paper presents a framework that extends OpenCL by distributing computing process to many computing resources connected via network and enables the computing resources to run in parallel. Using JSON RPC (Remote Procedure Call technique relying on JavaScript Object Notation) in communication layer, Distributed OpenCL framework provides platform and operating system independency. Using this framework, […]
Jul, 15
A Performance Model for Memory Bandwidth Constrained Applications on Graphics Engines
Graphics engines are excellent execution platforms for high-throughput computations that exploit a large degree of available parallelism. The achieved performance is, however, highly dependent on the access patterns that the application imposes on the memory subsystem. Here, we propose an analytic model that helps improve the understanding of the performance of memory-limited kernels that employ […]
Jul, 14
Optimizing All-to-All and Allgather Communications on GPGPU Clusters
High Performance Computing (HPC) is rapidly becoming an integral part of Science,Engineering and Business. Scientists and engineers are leveraging HPC solutions to run their applications that require high bandwidth, low latency, and very high compute capabilities. General Purpose Graphics Processing Units (GPGPUs)are becoming more popular within the HPC community because of their highly parallel structure, […]
Jul, 14
New Techniques for Spectral Image Acquisition and Analysis
This thesis describes typical spectral imaging techniques and spectral image analysis algorithms that are in general use. Three developed spectral imaging systems are proposed. The first imaging system consists of two line scanning based spectral cameras. These cameras are combined in one simultaneous measuring process, which can be used for capturing a wide range of […]
Jul, 14
Implementing the Approximate Message Passing (AMP) Algorithm on a GPU
We consider the recovery of sparse signals from a limited number of noisy observations using the AMP algorithm. In this paper, we present two fast implementations of this algorithm that run on a CPU and on a GPU and which can either be used for arbitrary unstructured measurement matrices or take advantage of the structure […]
Jul, 14
A Survey of Cloud Lighting and Rendering Techniques
The rendering of participating media still forms a big challenge for computer graphics. This remark is particularly true for real-world clouds with their inhomogeneous density distributions, large range of spatial scales and different forms of appearance. We survey techniques for cloud visualization and classify them relative to the type of volume representation, lighting and rendering […]
Jul, 14
GPU Based Computation of the Structural Tensor for Real-Time Figure Detection
In this paper we present a real-time realization of the method of detection of local structures in images of predefined orientation. The method is based on an analysis of the structural tensor computed in monochrome and color images. Thanks to the GPU implementation of the low-level feature detection an order-of-magnitude speed-up was achieved compared to […]
Jul, 13
International Conference on Parallel Computing 2013, ParCo2013
ParCo2013 continues the tradition of the international conferences on parallel computing started in Berlin, Germany in 1983. This makes it one of the longest running international conferences on parallel computing. Over the years the conference established itself as the foremost platform for exchanging know-how on the newest parallel computing strategies, technologies, methods and tools. The […]
Jul, 13
International Conference on Computational Physics, ICCP 2013
The XXXIV International Conference on Computational Physics is the premier forum for the presentation of new advances and research results in the fields of Computational Physics. The conference will bring together leading academic scientists, researchers and scholars in the domain of interest from around the world. Topics of interest for submission include, but are not […]