Posts
Apr, 22
High-Speed Private Information Retrieval Computation on GPU
A Private Information Retrieval (PIR) scheme is a protocol in which a user retrieves a record out of n from a replicated database, while hiding from the database which record has been retrieved, as long as the different replicas do not collude. A specially interesting sub-field of research, called single-database PIR, deals with the schemes […]
Apr, 22
Challenges of mapping financial analytics to many-core architecture
Summary form only given. In the past 20 years there has been an explosive growth of the variety of traded financial instruments, from European and American options to a more complex, alas ill-fated, credit derivatives. The rapid increase in computational power coupled with the use of mathematical tools for valuing these instruments and estimating the […]
Apr, 22
MITHRA: Multiple data independent tasks on a heterogeneous resource architecture
With the advent of high-performance COTS clusters, there is a need for a simple, scalable and fault-tolerant parallel programming and execution paradigm. In this paper, we show that the popular MapReduce programming model can be utilized to solve many interesting scientific simulation problems with much higher performance than regular cluster computers by leveraging GPGPU accelerators […]
Apr, 22
Fast generating of a digital hologram using general-purpose computation on graphics processing units
In this paper, we propose a method for fast generating a digital hologram using General-Purpose Computation on Graphics Processing Units (GPGPU). This method can reduce the computational time of generating a digital hologram by using the parallel processing with CUDA. And we demonstrate the effectiveness of our algorithm through a variety of experiment.
Apr, 22
Preliminary implementation of two parallel programs for fractal image coding on GPUs
GPGPU (General Purpose computing on Graphic Processing Unit) attracts a great deal of attention, which is used for general-purpose computations like numerical calculations as well as graphic processing. In this paper, we implement Fractal image coding algorithms on GPUs by using CUDA (Compute Unified Device Architecture) and evaluate the effectiveness of the shared memory using […]
Apr, 22
Parallel Zigzag Scanning and Huffman Coding for a GPU-based MPEG-2 Encoder
GPUs excel in parallel computations, so they are very efficient calculating the discrete cosine transform of spatial domain images, as required for video encoding. The last steps of MPEG-2 compression, however, are inherently sequential since they require a serial processing of the resulting DCT coefficients. As that can easily become a bottleneck in GPUbased video […]
Apr, 22
Accelerating the numerical simulation of magnetic field lines in tokamaks using the GPU
trip3d is a field line simulation code that numerically integrates a set of nonlinear magnetic field line differential equations. The code is used to study properties of magnetic islands and stochastic or chaotic field line topologies that are important for designing non-axisymmetric magnetic perturbation coils for controlling plasma instabilities in future machines. The code is […]
Apr, 22
Scalable Clustering Using Graphics Processors
We present new algorithms for scalable clustering using graphics processors. Our basic approach is based on k-means. By changing the order of determining object labels, and exploiting the high computational power and pipeline of graphics processing units (GPUs) for distance computing and comparison, we speed up the k-means algorithm substantially. We introduce two strategies for […]
Apr, 22
GPU accelerated simulations of 3D deterministic particle transport using discrete ordinates method
Graphics Processing Unit (GPU), originally developed for real-time, high-definition 3D graphics in computer games, now provides great faculty in solving scientific applications. The basis of particle transport simulation is the time-dependent, multi-group, inhomogeneous Boltzmann transport equation. The numerical solution to the Boltzmann equation involves the discrete ordinates (Sn) method and the procedure of source iteration. […]
Apr, 22
AMD Fusion Developer Summit 2011, AFDS 2011
Heterogeneous computing is moving into the mainstream, and a broader range of applications are already on the way. As the provider of world-class CPUs, GPUs, and APUs, AMD offers unique insight into these technologies and how they interoperate. Attend the AMD Fusion Developer Summit to learn about the opportunities that lie ahead.
Apr, 21
Pretty Good Accuracy in Matrix Multiplication with GPUs
With systems such as Road Runner, there is a trend in super computing to offload parallel tasks to special purpose co-processors, composed of many relatively simple scalar processors. The cheaper commodity class equivalent of such a processor would be the graphics card, potentially offering super computer power within the confines of a desktop PC. Graphics […]
Apr, 21
Fast Variable Center-Biased Windowing for High-Speed Stereo on Programmable Graphics Hardware
We present a high-speed dense stereo algorithm that achieves both good quality results and very high disparity estimation throughput on the graphics processing unit (GPU). The key idea is a variable center-biased windowing approach, enabling an adaptive selection of the most suitable support patterns with varying sizes and shapes. As the fundamental construct for variable […]