16756

Posts

Nov, 23

GA3C: GPU-based A3C for Deep Reinforcement Learning

We introduce and analyze the computational aspects of a hybrid CPU/GPU implementation of the Asynchronous Advantage Actor-Critic (A3C) algorithm, currently the state-of-the-art method in reinforcement learning for various gaming tasks. Our analysis concentrates on the critical aspects to leverage the GPU’s computational power, including the introduction of a system of queues and a dynamic scheduling […]
Nov, 23

Optimization and Evaluation of VLPL-S Particle-in-cell Code on Knights Landing

VLPL-S code is developed based on the particlein-cell (PIC) algorithm, which is the mainstream algorithm of plasma behavior research. In this paper, we report our early experience on porting and optimizing the VLPL-S particle-in-cell code on the Knights Landing. By applying general optimization methods such as memory access optimization, thread level parallelism and vectorization to […]
Nov, 22

High performance pattern matching and data remanence on graphics processing units

Pattern matching is an important task in a plethora of different fields ranging from computer science to medical application, but is also a resource consuming problem.With the increase in network link speed, and the tremendous amounts of data generated, serial pattern matching on Central Processing Unit (CPU) is close to being rendered obsolete. The ubiquitous […]
Nov, 22

Processing OLTP Workloads on Hybrid CPU/GPU Systems

In recent times there have been a plethora of researches done on the utilization of co-processors like GPU and FPGA in database management system (DBMS). The reason for this trend is that modern processors have reached a performance threshold. Two major factors that have led to this behaviour are Memory Wall and Power Wall. This […]
Nov, 22

SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing

With recent advancing of Internet of Things (IoTs), it becomes very attractive to implement the deep convolutional neural networks (DCNNs) onto embedded/portable systems. Presently, executing the software-based DCNNs requires high-performance server clusters in practice, restricting their widespread deployment on the mobile devices. To overcome this issue, considerable research efforts have been conducted in the context […]
Nov, 22

GPU-accelerated Red Blood Cells Simulations with Transport Dissipative Particle Dynamics

Mesoscopic numerical simulations provide a unique approach for the quantification of the chemical influences on red blood cell functionalities. The transport Dissipative Particles Dynamics (tDPD) method can lead to such effective multiscale simulations due to its ability to simultaneously capture mesoscopic advection, diffusion, and reaction. In this paper, we present a GPU-accelerated red blood cell […]
Nov, 22

Celeris: A GPU-accelerated open source software with a Boussinesq-type wave solver for real-time, interactive simulation and visualization

In this paper, we introduce an interactive coastal wave simulation and visualization software, called Celeris. Celeris is an open source software which needs minimum preparation to run on a Windows machine. The software solves the extended Boussinesq equations using a hybrid finite volume – finite difference method and supports moving shoreline boundaries. The simulation and […]
Nov, 20

2nd International Workshop on Theoretical Approaches to Performance Evaluation, Modeling and Simulation (TAPEMS), 2017

Performance and an aspect of it, energy efficiency, has become a key issue in both high performance and embedded computing. The objective of the 2nd TAPEMS International Workshop on Theoretical Approaches to Performance Evaluation, Modeling and Simulation is to bring together researchers and practitioners from academia and industry to discuss current advances and trends in theoretical […]
Nov, 20

9th International Conference on Bioinformatics and Biomedical Technology (ICBBT), 2017

The primary goal of the conference is to promote research and developmental activities in Bioinformatics and Biomedical Technology. Another goal is to promote scientific information interchange between researchers, developers, engineers, students, and practitioners working in Portugal and abroad. The conference will be held every year to make it an ideal platform for people to share […]
Nov, 20

7th International Conference on Biomedical Engineering and Technology (ICBET), 2017

The objective of the 2017 7th International Conference on Biomedical Engineering and Technology (ICBET 2017) is to provide a platform for researchers, engineers, academicians as well as industrial professionals from all over the world to present their research results and development activities in Biomedical Engineering and Technology. 2017 7th International Conference on Biomedical Engineering and […]
Nov, 20

International Conference on High Performance Compilation, Computing and Communications (HP3C-2017), 2017

You are cordially invited to join us at the International Conference on High Performance Compilation, Computing and Communications (HP3C-2017) in Kuala Lumpur, Malaysia during March 22-24, 2017, with the sponsor of American Society for Research. With the rapid growth in computing and communications technology, the past decade has witnessed a proliferation of powerful parallel and […]
Nov, 19

Evaluation of an OpenCL-Based FPGA Platform for Particle Filter

Particle filter is one promising method to estimate the internal states in dynamical systems, and can be used for various applications such as visual tracking and mobile-robot localization. The major drawback of particle filter is its large computational amount, which causes long computational-time and large powerconsumption. In order to solve this problem, this paper proposes […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: