16523

Posts

Sep, 8

A Lightweight Approach to Performance Portability with targetDP

Leading HPC systems achieve their status through use of highly parallel devices such as NVIDIA GPUs or Intel Xeon Phi many-core CPUs. The concept of performance portability across such architectures, as well as traditional CPUs, is vital for the application programmer. In this paper we describe targetDP, a lightweight abstraction layer which allows grid-based applications […]
Sep, 8

QSL Squasher: A Fast Quasi-Separatrix Layer Map Calculator

Quasi-Separatrix Layers (QSLs) are a useful proxy for the locations where current sheets can develop in the solar corona, and give valuable information about the connectivity in complicated magnetic field configurations. However, calculating QSL maps even for 2-dimensional slices through 3-dimensional models of coronal magnetic fields is a non-trivial task as it usually involves tracing […]
Sep, 8

A Non-linear GPU Thread Map for Triangular Domains

There is a stage in the GPU computing pipeline where a grid of thread-blocks, in parallel space, is mapped onto the problem domain, in data space. Since the parallel space is restricted to a box type geometry, the mapping approach is typically a k-dimensional bounding box (BB) that covers a p-dimensional data space. Threads that […]
Sep, 8

OpenCL/CUDA algorithms for parallel decoding of any irregular LDPC code using GPU

This article provides a scalable parallel approach of an iterative LDPC decoder, presented in a tutorial-based style. The proposed approach can be implemented in applications supporting massive parallel computing. The proposed mapping is suitable for decoding any irregular LDPC code without the limitation of the maximum node degree. The implementation of the LDPC decoder with […]
Sep, 8

International Conference on Innovation in Artificial Intelligence (ICIAI), 2017

Paper Publication: All accepted papers of ICIAI 2017 will be published in the International Conference Proceedings, which will be indexed by Ei Compendex and Scopus.
Sep, 8

The 7th International Workshop on Computer Science and Engineering (WCSE), 2017

Registered and presented papers of WCSE 2017 will be published into the conference proceedings, which will be indexed by Scopus & Ei compendex.
Sep, 8

International Conference on Information and Computer Technologies (ICICT), 2017

Publication and Indexing Accepted and registered full papers will be published into the conference proceedings, which will be included in the major data base, Ei,Scopus, etc. Invited Keynote Speakers Prof. S. Arumuga perumal, head of Department of Computer Science, S.T.Hindu College, India Prof. Dr. R. Sivakumar, head of Department of Electronics and Communication Engineering at […]
Sep, 8

3rd International Conference on Knowledge and Software Engineering (ICKSE), 2017

For papers submitted to ICKSE 2017, we offer the publications as following: 1.Publication in proceedings, which will be indexed by EI Compendex, Scopus, and ISI CPCS. 2.Publication published in the International Journal of Knowledge Engineering, which will be indexed by Proquest, Google Scholar,etc. There are two methods for submitting your paper: 1. By our electric […]
Sep, 8

8th International Conference on Computer Technologies and Development (ICCTD), 2017

For papers submitted to ICCTD 2017, we offer the publications as following: 1. Publication in Proceedings. Submissions will be peer reviewed by conference committees, and accepted papers will be published in proceedings, which will be indexed by EI Compendex, Scopus, and ISI CPCS. 2. Publication in Journal. Submissions will be reviewed by the conference committees […]
Sep, 6

cf4ocl: a C framework for OpenCL

OpenCL is an open standard for parallel programming of heterogeneous compute devices, such as GPUs, CPUs, DSPs or FPGAs. However, the verbosity of its C host API can hinder application development. In this paper we present cf4ocl, a software library for rapid development of OpenCL programs in pure C. It aims to reduce the verbosity […]
Sep, 5

Parallel Dictionary Learning Algorithms for Sparse Representations

Sparse representations are intensively used in signal processing applications, like image coding, denoising, echo channels modeling, compression, classification and many others. Recent research has shown encouraging results when the sparse signals are created through the use of a learned dictionary. The current study focuses on finding new methods and algorithms, that have a parallel form […]
Sep, 5

Parallel Tree Traversal for Nearest Neighbor Query on the GPU

The similarity search problem is found in many application domains including computer graphics, information retrieval, statistics, computational biology, and scientific data processing just to name a few. Recently several studies have been performed to accelerate the k-nearest neighbor (kNN) queries using GPUs, but most of the works develop brute-force exhaustive scanning algorithms leveraging a large […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: