Posts
Jun, 11
2nd International Workshop on GPUs and Scientific Applications, GPUScA 2011
Held in conjunction with PACT 2011. GPUs are cost-effective platforms for computational intensive applications providing tremendous peak performance. However, it is a major challenge to deliver the intrinsic performance of such architectures to end applications. The goal of this workshop is to bring together GPU experts with computational science experts. The workshop addresses programming approaches […]
Jun, 11
IEEE 18th International Symposium on High Performance Computer Architecture, HPCA 2012
The International Symposium on High-Performance Computer Architecture (HPCA 2012) provides a high-quality forum for scientists and engineers to present their latest research findings in this rapidly-changing field. Authors are invited to submit papers on all aspects of high-performance computer architecture. Topics of interest include, but are not limited to: Processor, cache and memory architectures Parallel […]
Jun, 10
Accelerating light scattering simulations of nanostructures by reconfigurable computing
In order to characterize nanostructures and nanosurfaces in production processes, measuring methods based on light scattering gain increasing importance. Thus the simulation capability of laser light scattering on surfaces with a size of several hundred or thousand wavelenghts in diameter and light scattering models on the nanometer scale are required to validate these new measurement […]
Jun, 10
Massively LDPC Decoding on Multicore Architectures
Unlike usual VLSI approaches necessary for the computation of intensive Low-Density Parity-Check (LDPC) code decoders, this paper presents flexible software-based LDPC decoders. Algorithms and data structures suitable for parallel computing are proposed in this paper to perform LDPC decoding on multicore architectures. To evaluate the efficiency of the proposed parallel algorithms, LDPC decoders were developed […]
Jun, 10
CUDA Based Fast Implementation of Very Large Matrix Computation
CUDA (Compute Unified Device Architecture) acceleration of very large scale matrix-vector and matrix-matrix multiplication is presented in this paper. The intrinsic parallelism in the matrix computations are exploited thoroughly. By dividing the entire matrix computation to multiple sub-groups, scalable performance improvement can be achieved using multiple GPUs. The key operations are accelerated by GPU. And […]
Jun, 10
Planetary-Scale Terrain Composition
Many interrelated planetary height map and surface image map data sets exist, and more data are collected each day. Broad communities of scientists require tools to compose these data interactively and explore them via real-time visualization. While related, these data sets are often unregistered with one another, having different projection, resolution, format, and type. We […]
Jun, 10
The Research of Real-Time Shadow Rendering Algorithm of Virtual Scenes
Shadow scenes by shadow mapping has long suffered from the problem of under-sampling artifacts due to too little shadow map resolution leading to so-called perspective and projection aliasing. On this issue, we present a new practical real-time shadow mapping algorithm. Firstly we sample the scene from the eye-point on the GPU to get the needed […]
Jun, 10
Accelerating Multi-layer Perceptron based short term demand forecasting using Graphics Processing Units
Load forecasting plays a vitally important role in the operation and planning of the power system in a deregulated electricity market. A large variety of methods have been proposed for load forecasting. In this paper, we introduce the Graphics Processing Units (GPU) based computing to accelerate the short term load forecasting with multi-layer perceptron (MLP). […]
Jun, 10
The scoring sequences on profile Hidden Markov Models with delete states elimination by GPUs
A profile Hidden Markov Model (HMM) is well suited for representing profiles of multiple sequences alignments, and it has been becoming the main method of multiple sequences alignments in bioinformatics. The scoring of sequences on profile HMMs is compute-intensive, especially when there are many Markov models and many states in each model. A parallel algorithm […]
Jun, 10
Real-time rain simulation in cartoon style
An efficient method for simulating cartoon style rain in 3D environment is proposed here. By taking advantage of the parallelism and programmability of GPUs (graphic processing units), real-time interaction can be achieved. Splashing of raindrop is simulated using collision detection, series of stylized textures and rotations of point sprites. To simulate wind-driven raining effect, the […]
Jun, 10
Real-time rendering of large-scale tree scene
High-quality, realistic visualization of vegetation and tree model is always a long-standing goal of complex virtual natural scene. Rendering a photo-realistic forest scene in real time has an important significance in simulating the growing tree. In this paper, we present a method of 3D tree modeling and a hybrid rendering algorithm of large-scale forest scene […]
Jun, 10
Handwritten Digit Recognition with a Committee of Deep Neural Nets on GPUs
The competitive MNIST handwritten digit recognition benchmark has a long history of broken records since 1998. The most recent substantial improvement by others dates back 7 years (error rate 0.4%) . Recently we were able to significantly improve this result, using graphics cards to greatly speed up training of simple but deep MLPs, which achieved […]