18543

Posts

Sep, 23

Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks

As heterogeneous architectures are becoming mainstream for HPC systems, application programmers are looking for programming model implementations that offer both performance and portability across platforms. Two directive-based programming models for accelerator programming that aim at doing this are OpenMP 4/4.5 and OpenACC. Many users want to know the difference between these two programming models, the […]
Sep, 23

Parallel LZ77 Decoding using a GPU

Data compression, as a process, aims to satisfy the modern world’s need for speed and efficiency by reducing the cost of storing and transmitting information. Over the past few years, there have been several attempts to improve the performance and reduce the execution times of older compression algorithms by adapting them to make use of […]
Sep, 23

Support for Parallel Scan in OpenMP

Prefix Scan (or simply scan) is an operator that computes all the partial sums of a vector. A scan operation results in a vector where each element is the sum of the preceding elements in the original vector up to the corresponding position. Scan is a key operation in many relevant problems like sorting, lexical […]
Sep, 23

Scalability Analysis of Synchronous Data-Parallel Artificial Neural Network (ANN) Learners

Artificial Neural Networks (ANNs) have been established as one of the most important algorithmic tools in the Machine Learning (ML) toolbox over the past few decades. ANNs’ recent rise to widespread acceptance can be attributed to two developments: (1) the availability of large-scale training and testing datasets; and (2) the availability of new computer architectures […]
Sep, 23

SoaAlloc: Accelerating Single-Method Multiple-Objects Applications on GPUs

We propose SoaAlloc, a dynamic object allocator for Single-Method Multiple-Objects applications in CUDA. SoaAlloc is the first allocator for GPUs that (a) arranges allocations in a SIMD-friendly Structure of Arrays (SOA) data layout, (b) provides a do-all operation for maximizing the benefit of SOA, and (c) is on par with state-of-the-art memory allocators for raw […]
Sep, 18

International Conference on Image, Video and Signal Processing (IVSP), 2019

The 2019 International Conference on Image, Video and Signal Processing (IVSP 2019) will be held during 25-28 February, 2019 in Shanghai, China. IVSP 2019 aims to provide researchers and practitioners from academia and industry with a forum to report on the latest developments in video, image and signal processing, multimedia and computer graphics. The conference […]
Sep, 18

International Joint Conference on Signals, Systems and Computers (CSSC), 2018

Venue: Khalifa University, Abu Dhabi, UAE Khalifa University (also known as Khalifa University of Science, Technology & Research, or KUSTAR) is a science-focused university located in Abu Dhabi, United Arab Emirates with a satellite campus in Sharjah. In 2017 it is ranked as the 401st best university in the world by QS rankings. Founded in 2007 […]
Sep, 18

International Joint Conference on Robotics, Automation and Mechatronics (JCRAM), 2018

JCRAM 2018 welcomes researchers, engineers, scientists and industry professionals to an open forum where advances in the field of Robotics, Automation and Mechatronics can be shared and examined. The conference is an ideal platform for keeping up with advances and changes to a consistently morphing field. Publication and Indexing All accepted papers will be published in […]
Sep, 18

International Joint Conference on Robotics and Artificial Intelligence (JCRAI), 2018

JCRAI 2018 welcomes researchers, engineers, scientists and industry professionals to an open forum where advances in the field of Robotics and Artificial Intelligence can be shared and examined. The conference is an ideal platform for keeping up with advances and changes to a consistently morphing field. Publication and Indexing All accepted papers will be published in […]
Sep, 18

International Joint Conference on Computer Vision and Pattern Recognition (CCVPR), 2018

CCVPR 2018 welcomes researchers, engineers, scientists and industry professionals to an open forum where advances in the field of Computer Vision and Pattern Recognition can be shared and examined. The conference is an ideal platform for keeping up with advances and changes to a consistently morphing field. Publication and Indexing All accepted papers will be published […]
Sep, 16

Characterising Across-Stack Optimisations for Deep Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are extremely computationally demanding, presenting a large barrier to their deployment on resource-constrained devices. Since such systems are where some of their most useful applications lie (e.g. obstacle detection for mobile robots, vision-based medical assistive technology), significant bodies of work from both machine learning and systems communities have attempted to provide […]
Sep, 16

A deep learning approach to autonomous lunar landing

Over the past few years, in the huge field of Artificial Intelligence (AI), new Machine Learning techniques are playing a central role, proving to be very powerful and versatile. For this reason, it is expected that they could become protagonist of space applications and they are already under study. Thanks to the large availability of […]

* * *

* * *

HGPU group © 2010-2019 hgpu.org

All rights belong to the respective authors

Contact us: