13019

Posts

Oct, 25

On the Efficiency of CPU and Hybrid CPU-GPU Systems in Computational Biology Tasks

The complexity and diversity of the computational biology tasks requires a deliberate approach to the computational resource management. We have analyzed the performance of the common CPU and hybrid CPU-GPU hardware configurations in molecular dynamics and homology modeling tasks. Our results show that on dual-processor nodes it is in overall more efficient to execute two […]
Oct, 25

Medical imaging using CUDA

As multiple sclerosis is known to cause atrophy and deformation in the brain, it also influences the shape and size of the corpus callosum. Longitudinal studies try to quantify these changes using medical image analysis techniques for measuring and analyzing the shape and size of a corpus callosum cross-sechtion embedded in a specially selected measurement […]
Oct, 25

CUVLE: Variable-Length Encoding on CUDA

Data compression is the process of representing information in a compact form, in order to reduce the storage requirements and, hence, communication bandwidth. It has been one of the critical enabling technologies for the ongoing digital multimedia revolution for decades. In the variable-length encoding (VLE) compression method, most frequently occurring symbols are replaced by codes […]
Oct, 25

5th International Conference on Computer Communication and Management, ICCCM 2015

Submission Deadline: 2015-03-01 Publication: Conference papers can be selected and published into International Journal of Computer and Communication Engineering (IJCCE) or Journal of Advanced Management Science(JOAMS) excellent papers will be select to be published in International Journal of e-Education, e-Business, e-Management and e-Learning(IJEEEE) Topic: A. Computing • Parallel and Distributing Computing • High-Performance Computing • […]
Oct, 25

4th International Conference on Industrial and Intelligent Information, ICIII 2015

Submission Deadline: 2015-03-01 Publication: All accepted papers of ICIII 2015 will be published in the following journals with ISSN: * Journal of Industrial and Intelligent Information (ISSN:2301-3745, DOI: 10.12720/jiii), and will be indexed by Ulrich’s Periodicals Directory, Google Scholar(http://scholar.google.com/), EBSCO, Engineering & Technology Digital Library (http://www.etlibrary.org/) and Electronic Journals Library Topic: Track 1. Neural networks […]
Oct, 25

4th International Conference on System Engineering and Modeling, ICSEM 2015

Submission Deadline: 2015-03-01 Publication: Submitted papers can be selected and published into one of the following Journals: *International Journal of Computer and Communication Engineering (IJCCE) (ISSN:2010-3743) Abstracting/ Indexing: EI (INSPEC, IET), Google Scholar, Engineering & Technology Digital Library, ProQuest, and Crossref, Electronic Journals Library *International Journal of Modeling and Optimization (IJMO) Abstracting/ Indexing: Engineering & […]
Oct, 24

cufftShift: High Performance CUDA-accelerated FFT-shift Library

For embarrassingly parallel algorithms, a Graphics Processing Unit (GPU) outperforms a traditional CPU on price-per-flop and price-per-watt by at least one order of magnitude. This had led to the mapping of signal and image processing algorithms, and consequently their applications, to run entirely on GPUs. This paper presents CUFFTSHIFT, a ready-to-use GPU-accelerated library, that implements […]
Oct, 24

Query Optimization in Heterogeneous CPU/GPU Environment for Time Series Databases

In recent years, processing and exploration of time series has experienced a noticeable interest. Growing volumes of data and needs of efficient processing pushed the research in new directions, including hardware based solutions. Graphics Processing Units (GPU) have significantly more applications than just rendering images. They are also used in general purpose computing to solve […]
Oct, 24

Gaussian Process Models with Parallelization and GPU acceleration

In this work, we present an extension of Gaussian process (GP) models with sophisticated parallelization and GPU acceleration. The parallelization scheme arises naturally from the modular computational structure w.r.t. datapoints in the sparse Gaussian process formulation. Additionally, the computational bottleneck is implemented with GPU acceleration for further speed up. Combining both techniques allows applying Gaussian […]
Oct, 24

Monitoring Large-scale Microblog on GPUs

To monitor bad information spreading in microblog system, large-scale data from microblog must be processed in real time. This needs high cost-effective parallel schemes. A parallel processing method on GPUs was put forward to monitor massive microblog. The proposed scheme can fully exploit the GPU feature to schedule massive threads for data-intensive tasks. The detailed […]
Oct, 24

Improved Integral Histogram Algorithm for Big Sized Images in CUDA Environment

Although integral histogram enables histogram computation of a sub-area within constant time, construction of the integral histogram requires O(nm) steps for n x m sized image. Such construction time can be reduced using parallel prefix sum algorithm. Mark Harris proposed an efficient parallel prefix sum and implemented it using CUDA GPGPU. Mark Harris’ algorithm has […]
Oct, 22

Introducing CURRENNT – the Munich open-source CUDA RecurREnt Neural Network Toolkit

In this article, we introduce CURRENNT, an open-source parallel implementation of deep recurrent neural networks (RNNs) supporting graphics processing units (GPUs) through NVIDIA’s Computed Unified Device Architecture (CUDA). CURRENNT supports uni- and bidirectional RNNs with Long Short-Term Memory (LSTM) memory cells which overcome the vanishing gradient problem. To our knowledge, CURRENNT is the first publicly […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: