18645

Posts

Dec, 9

Clacc: Translating OpenACC to OpenMP in Clang

OpenACC was launched in 2010 as a portable programming model for heterogeneous accelerators. Although various implementations already exist, no extensible, open-source, production-quality compiler support is available to the community. This deficiency poses a serious risk for HPC application developers targeting GPUs and other accelerators, and it limits experimentation and progress for the OpenACC specification. To […]
Dec, 9

High Performance Portable Tsunami Simulations on Many-core CPU, GPU, and FPGA

Tsunami generated by a submarine earthquake sometimes causes serious damage in a coastal area. To reduce negative effects of tsunami, effective evacuation and disaster prevention are getting interested. We can contribute to that by forecasting arrival time and height of tsunami with computer simulations. However, tsunami simulations always require massive data processing. The shallow water […]
Dec, 9

ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture

Smith-waterman algorithm (SW) is a popular dynamic programming algorithm widely used in Bioinformatics for local biological sequence alignment. Due to the O(n2) high time and space complexity of SW and growing size of biological data, it is crucial to accelerate SW for high performance. In view of the GPU high efficiency in science computation, many […]
Dec, 9

Optimization of a discontinuous finite element solver with OpenCL and StarPU

schnaps is a finite element solver designed to simulate various physical phenomena. It is designed to run on hybrid computers made of several CPUs and GPUs. In order to address the hybrid architectures we rely on the StarPU runtime. StarPU allows to optimize in an incremental way a sequential algorithm in order to migrate to […]
Dec, 9

A Framework for Fast and Efficient Neural Network Compression

Network compression reduces the computational complexity and memory consumption of deep neural networks by reducing the number of parameters. In SVD-based network compression, the right rank needs to be decided for every layer of the network. In this paper, we propose an efficient method for obtaining the rank configuration of the whole network. Unlike previous […]
Dec, 5

International Conference on Frontiers of Neural Networks (ICFNN), 2019

Researchers, scientists, engineers and industry professionals will join together this year at ICFNN 2019, where the latest research will be unveiled and groundbreaking research projects will be presented. The field of  Frontiers of Neural Networks is entering an era of unprecedented change and innovation. ICFNN 2019 presents one of 2019’s premiere opportunities to hear from and network with […]
Dec, 5

International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML), 2019

Researchers, scientists, engineers and industry professionals will join together this year at FAIML 2019, where the latest research will be unveiled and groundbreaking research projects will be presented. The field of Frontiers of Artificial Intelligence and Machine Learning is entering an era of unprecedented change and innovation. FAIML 2019 presents one of 2019’s premiere opportunities to hear from and […]
Dec, 5

3rd World Conference on Clinical Research and Biomarkers, 2019

Theme: Ensure Quality Assurance and Inspection Readiness through Good Clinical Practice and Strategic Partnerships Date: 16-17, 2019 Venue: Miami, Florida, USA E-mail: biomarkersresearch@gmail.com Phone: 7799790002 In the determination of safety and effectiveness of medications, devices, diagnostic products and treatments Clinical Research plays a major role. From a naturally occurring molecule or gene, we can identify […]
Dec, 5

3rd International Conference on Vision, Image and Signal Processing (ICVISP), 2019

ICVISP 2019 welcomes researchers, engineers, scientists and industry professionals to an open forum where advances in the field of Vision, Image and Signal Processing can be shared and examined. The conference is an ideal platform for keeping up with advances and changes to a consistently morphing field. Publication and Indexing All accepted papers will be published […]
Dec, 5

3rd International Symposium on Intelligent Unmanned Systems and Artificial Intelligence (SIUSAI), 2019

SIUSAI 2019 welcomes researchers, engineers, scientists and industry professionals to an open forum where advances in the field of Intelligent Unmanned Systems and Artificial Intelligence can be shared and examined. The conference is an ideal platform for keeping up with advances and changes to a consistently morphing field. Publication and Indexing All accepted papers will be […]
Dec, 2

A Fast and Simple Approach to Merge and Merge Sort using Wide Vector Instructions

Merging and sorting algorithms are the backbone of many modern computer applications. As such, efficient implementations are desired. Recent architectural advancements in CPUs (Central Processing Units), such as wider and more powerful vector instructions, allow for algorithmic improvements. This paper presents a new approach to merge sort using vector instructions. Traditional approaches to vectorized sorting […]
Dec, 2

Mix-and-Match: A Model-driven Runtime Optimisation Strategy for BFS on GPUs

It is universally accepted that the performance of graph algorithms is heavily dependent on the algorithm, the execution platform, and the structure of the input graph. This variability remains difficult to predict and hinders the choice of the right algorithm for a given problem. In this work, we focus on a case study: breadth-first search […]

* * *

* * *

HGPU group © 2010-2018 hgpu.org

All rights belong to the respective authors

Contact us: