Mar, 28

Parallelized Vlasov-Fokker-Planck solver for desktop personal computers

The numerical solution of the Vlasov-Fokker-Planck equation is a well established method to simulate the dynamics, including the self-interaction with its own wake field, of an electron bunch in a storage ring. In this paper we present Inovesa, a modularly extensible program that uses OpenCL to massively parallelize the computation. It allows a standard desktop […]
Mar, 28

APUNet: Revitalizing GPU as Packet Processing Accelerator

Many research works have recently experimented with GPU to accelerate packet processing in network applications. Most works have shown that GPU brings a significant performance boost when it is compared to the CPU-only approach, thanks to its highly-parallel computation capacity and large memory bandwidth. However, a recent work argues that for many applications, the key […]
Mar, 20

Comparing Programmer Productivity in OpenACC and CUDA: an Empirical Investigation

OpenACC has been touted as a "high productivity" API designed to make GPGPU programming accessible to scientific programmers, but to date, no studies have attempted to verify this quantitatively. In this paper, we conduct an empirical investigation of program productivity comparisons between OpenACC and CUDA in the programming time, the execution time and the analysis […]
Mar, 20

A Study of Complex Deep Learning Networks on High Performance, Neuromorphic, and Quantum Computers

Current Deep Learning approaches have been very successful using convolutional neural networks (CNN) trained on large graphical processing units (GPU)-based computers. Three limitations of this approach are: 1) they are based on a simple layered network topology, i.e., highly connected layers, without intra-layer connections; 2) the networks are manually configured to achieve optimal results, and […]
Mar, 20

Exploiting Multi-level Parallelism on a Many-core System for the Application of Hyperheuristics to a Molecular Docking Problem

The solution of Protein-Ligand Docking Problems can be approached through metaheuristics, and satisfactory metaheuristics can be obtained with hyperheuristics searching in the space of metaheuristics implemented inside a parameterized schema. These hyperheuristics apply several metaheuristics, resulting in high computational costs. To reduce execution times, a shared-memory schema of hyperheuristics is used with four levels of […]
Mar, 20

Optimization of Lattice Boltzmann Simulations on Heterogeneous Computers

High-performance computing systems are more and more often based on accelerators. Computing applications targeting those systems often follow a host-driven approach in which hosts offload almost all compute-intensive sections of the code onto accelerators; this approach only marginally exploits the computational resources available on the host CPUs, limiting performance and energy efficiency. The obvious step […]
Mar, 20

Neural Networks for Beginners. A fast implementation in Matlab, Torch, TensorFlow

This report provides an introduction to some Machine Learning tools within the most common development environments. It mainly focuses on practical problems, skipping any theoretical introduction. It is oriented to both students trying to approach Machine Learning and experts looking for new frameworks.
Mar, 19

International Conference on Deep Learning Technologies (ICDLT), 2017

Paper Publication All accepted papers must be written in English and will be published into #International Conference Proceedings Series by ACM, and indexed by Ei Compendex and Scopus. Proceedings ISBN: 978-1-4503-4783-9. Conference Chair Prof. Dr. Q. M. Jonathan Wu, University of Windsor, Canada Prof. Xudong Jiang, Nanyang Technological University, Singapore Submission Methods 1. Full Paper […]
Mar, 19

2nd IEEE Asia-Pacific Conference on Intelligent Robot Systems (ACIRS), 2017

Paper Publication This conference is supported by IEEE.The proceedings will be submitted and reviewed by the IEEE Xplore and Ei Compendex after the conference. Paper Topic (http://www.acirs.org/cfp.html) Underwater/Aerial Robots, Agriculture Robots, Space Robotics, Biomimetic robotics, Intelligent Transport Systems, Networked robots, Mobiligence, Rescue Robots, SWARM Intelligent Robots, Domestic Personal Robots, Visual Servoing/Robot vision, Medical/rehabilitation robotics, Perception/Learning, […]
Mar, 19

9th International Conference on Graphic and Image Processing (ICGIP), 2017

Paper Publication The paper accepted by ICGIP 2017 will be published in conference proceedings in the SPIE Digital Library along with nearly 300,000 papers from other outstanding conferences and articles from SPIE Journals, which will be included in the SPIE Digital Library, and provided to the Web of Science (CPCI), *Scopus, Ei Compendex*, Inspec, Google […]
Mar, 19

2nd IEEE International Conference on Image, Vision and Computing (ICIVC), 2017

Publication All accepted papers must be written in English and will be published in IEEE conference proceedings and indexed by Ei Compendex and Scopus after conference. Submission 1. Full paper(publication and presentation) 2. Abstract (presentation) For full paper, please upload it to the Electronic Submission System (.pdf) https://www.easychair.org/conferences/?conf=icivc2017 For abstract, please send it to icivc@young.ac.cn […]
Mar, 14

GPU accelerated population annealing algorithm

Population annealing is a promising recent approach for Monte Carlo simulations in statistical physics, in particular for the simulation of systems with complex free-energy landscapes. It is a hybrid method, combining importance sampling through Markov chains with elements of sequential Monte Carlo in the form of population control. While it appears to provide algorithmic capabilities […]
Page 8 of 919« First...678910...203040...Last »

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: