Posts
Feb, 26
Full-Speed Deterministic Bit-Accurate Parallel Floating-Point Summation on Multi- and Many-Core Architectures
On modern multi-core, many-core, and heterogeneous architectures, floating-point computations, especially reductions, may become non-deterministic and thus non-reproducible mainly due to non-associativity of floating-point operations. We introduce a solution to compute deterministic sums of floating-point numbers efficiently and with the best possible accuracy. Our multi-level algorithm consists of two main stages: a filtering stage that uses […]
Feb, 26
ShearLab 3D: Faithful Digital Shearlet Transforms based on Compactly Supported Shearlets
Wavelets and their associated transforms are highly efficient when approximating and analyzing one-dimensional signals. However, multivariate signals such as images or videos typically exhibit curvilinear singularities, which wavelets are provably deficient of sparsely approximating and also of analyzing in the sense of, for instance, detecting their direction. Shearlets are a directional representation system extending the […]
Feb, 25
An Evaluation of the GAMA/StarPU Frameworks for Heterogeneous Platforms: the Progressive Photon Mapping Algorithm
Recent evolution of high performance computing moved towards heterogeneous platforms: multiple devices with different architectures, characteristics and programming models, share application workloads. To aid the programmer to efficiently explore these heterogeneous platforms several frameworks have been under development. These dynamically manage the available computing resources through workload scheduling and data distribution, dealing with the inherent […]
Feb, 25
Fast Feature Selection in a GPU Cluster Using the Delta Test
Feature or variable selection still remains an unsolved problem, due to the infeasible evaluation of all the solution space. Several algorithms based on heuristics have been proposed so far with successful results. However, these algorithms were not designed for considering very large datasets, making their execution impossible, due to the memory and time limitations. This […]
Feb, 25
Precision-Aware Soft Error Protection for GPUs
With the advent of general-purpose GPU computing, it is becoming increasingly desirable to protect GPUs from soft errors. For high computation throughout, GPUs must store a significant amount of state and have many execution units. The high power and area costs of full protection from soft errors make selective protection techniques attractive. Such approaches provide […]
Feb, 25
A GaBP-GPU Algorithm of Solving Large-Scale Sparse Linear Systems
According to GaBP (Gaussian Belief Propagation) algorithm, this article presents a GaBP-GPU algorithm of solving large-scale symmetric diagonally dominant sparse linear systems based on GPU. Combined with GaBP-GPU algorithm, a storage format (MCSC) is presented. We extract some diagonally dominant matrices from the University of Florida Sparse Matrix Collection as test examples. The experimental results […]
Feb, 25
Auto-Tuning Dedispersion for Many-Core Accelerators
Dedispersion is a basic algorithm to reconstruct impulsive astrophysical signals. It is used in high sampling-rate radio astronomy to counteract temporal smearing by intervening interstellar medium. To counteract this smearing, the received signal train must be dedispersed for thousands of trial distances, after which the transformed signals are further analyzed. This process is expensive on […]
Feb, 24
2014 3rd International Conference on Industrial and Intelligent Information, ICIII 2014
Publication: All accepted papers of ICIII 2014 will be published in the Journal of Industrial and Intelligent Information (ISSN:2301-3745, DOI: 10.12720/jiii), and will be indexed by Ulrich’s Periodicals Directory, Google Scholar, EBSCO, Engineering & Technology Digital Library and Electronic Journals Library Submission Deadline: 2014-03-20 Call for Paper: 1. Neural networks and Applications 2. Machine Learning […]
Feb, 24
2014 3rd International Conference on System Engineering and Modeling, ICSEM 2014
Submission Deadline: 2014-03-20 Publication: All accepted papers of ICSEM 2014 will be published in the International Journal of Modeling and Optimization (ISSN:2010-3697), and will be included in the Engineering & Technology Digital Library, and indexed by Electronic Journals Library ProQuest,Google Scholar, Crossref, DOAJ and EI (INSPEC, IET). Call for Papers: Information Systems Engineering IS development […]
Feb, 24
2014 4th International Conference on Computer Communication and Management, ICCCM 2014
Publication: All accepted papers of ICCCM 2014 will be published in the following journals with ISSN: * International Journal of Computer and Communication Engineering (ISSN: 2010-3743), which will be indexed by Engineering & Technology Digital Library, Google Scholar, ProQuest, and Crossref. * Journal of Advanced Management Science (ISSN:2168-0787),which will be indexed by Ulrich’s Periodicals Directory, […]
Feb, 23
Real Time Face Detection on GPU Using OpenCL
This paper presents a novel approach for real time face detection using heterogeneous computing. The algorithm uses local binary pattern (LBP) as feature vector for face detection. OpenCL is used to accelerate the code using GPU[1]. Illuminance invariance is achieved using gamma correction and Difference of Gaussian(DOG) to make the algorithm robust against varying lighting […]
Feb, 23
Accelerating Content-Based Image Retrieval via GPU-adaptive Index Structure
A tremendous amount of work has been conducted in content-based image retrieval (CBIR) on designing efficient index structure to accelerate the retrieval process. Most of them improve the retrieval efficiency via complex index structures, and few take into account the parallel implementation of algorithm on underlying hardware. It makes the existing index structures suffer from […]

