Mar, 20

Optimization of Lattice Boltzmann Simulations on Heterogeneous Computers

High-performance computing systems are more and more often based on accelerators. Computing applications targeting those systems often follow a host-driven approach in which hosts offload almost all compute-intensive sections of the code onto accelerators; this approach only marginally exploits the computational resources available on the host CPUs, limiting performance and energy efficiency. The obvious step […]
Mar, 20

Neural Networks for Beginners. A fast implementation in Matlab, Torch, TensorFlow

This report provides an introduction to some Machine Learning tools within the most common development environments. It mainly focuses on practical problems, skipping any theoretical introduction. It is oriented to both students trying to approach Machine Learning and experts looking for new frameworks.
Mar, 19

International Conference on Deep Learning Technologies (ICDLT), 2017

Paper Publication All accepted papers must be written in English and will be published into #International Conference Proceedings Series by ACM, and indexed by Ei Compendex and Scopus. Proceedings ISBN: 978-1-4503-4783-9. Conference Chair Prof. Dr. Q. M. Jonathan Wu, University of Windsor, Canada Prof. Xudong Jiang, Nanyang Technological University, Singapore Submission Methods 1. Full Paper […]
Mar, 19

2nd IEEE Asia-Pacific Conference on Intelligent Robot Systems (ACIRS), 2017

Paper Publication This conference is supported by IEEE.The proceedings will be submitted and reviewed by the IEEE Xplore and Ei Compendex after the conference. Paper Topic (http://www.acirs.org/cfp.html) Underwater/Aerial Robots, Agriculture Robots, Space Robotics, Biomimetic robotics, Intelligent Transport Systems, Networked robots, Mobiligence, Rescue Robots, SWARM Intelligent Robots, Domestic Personal Robots, Visual Servoing/Robot vision, Medical/rehabilitation robotics, Perception/Learning, […]
Mar, 19

9th International Conference on Graphic and Image Processing (ICGIP), 2017

Paper Publication The paper accepted by ICGIP 2017 will be published in conference proceedings in the SPIE Digital Library along with nearly 300,000 papers from other outstanding conferences and articles from SPIE Journals, which will be included in the SPIE Digital Library, and provided to the Web of Science (CPCI), *Scopus, Ei Compendex*, Inspec, Google […]
Mar, 19

2nd IEEE International Conference on Image, Vision and Computing (ICIVC), 2017

Publication All accepted papers must be written in English and will be published in IEEE conference proceedings and indexed by Ei Compendex and Scopus after conference. Submission 1. Full paper(publication and presentation) 2. Abstract (presentation) For full paper, please upload it to the Electronic Submission System (.pdf) https://www.easychair.org/conferences/?conf=icivc2017 For abstract, please send it to icivc@young.ac.cn […]
Mar, 14

Compiling Parallel Functional Code with Data Parallel Idealised Algol

Graphics Processing Units (GPUs) and other parallel devices are widely available and have the potential for accelerating a wide class of algorithms. However, expert programming skills are required to achieve maximum performance. These devices expose low-level hardware details through imperative programming interfaces which inevitably results in non-performanceportable programs highly tuned for a specific device. Functional […]
Mar, 14

GPU accelerated population annealing algorithm

Population annealing is a promising recent approach for Monte Carlo simulations in statistical physics, in particular for the simulation of systems with complex free-energy landscapes. It is a hybrid method, combining importance sampling through Markov chains with elements of sequential Monte Carlo in the form of population control. While it appears to provide algorithmic capabilities […]
Mar, 14

Large-scale image analysis using docker sandboxing

With the advent of specialized hardware such as Graphics Processing Units (GPUs), large scale image localization, classification and retrieval have seen increased prevalence. Designing scalable software architecture that co-evolves with such specialized hardware is a challenge in the commercial setting. In this paper, we describe one such architecture (Cortexica) that leverages scalability of GPUs and […]
Mar, 14

Massive Exploration of Neural Machine Translation Architectures

Neural Machine Translation (NMT) has shown remarkable progress over the past few years with production systems now being deployed to end-users. One major drawback of current architectures is that they are expensive to train, typically requiring days to weeks of GPU time to converge. This makes exhaustive hyperparameter search, as is commonly done with other […]
Mar, 14

Model-independent partial wave analysis using a massively-parallel fitting framework

The functionality of GooFit, a GPU-friendly framework for doing maximum-likelihood fits, has been extended to extract model-independent S-wave amplitudes in three-body decays such as $D^+ to h^+h^+h^-$. A full amplitude analysis is done where the magnitudes and phases of the S-wave amplitudes are anchored at a finite number of $m^2(h^+h^-)$ control points, and a cubic […]
Mar, 10

A Survey of Cache Partitioning Techniques for Multicore Processors

As the number of on-chip cores and memory demands of applications increase, judicious management of cache resources has become, not merely attractive, but even imperative. Cache partitioning, i.e. dividing cache space between applications based on their memory demands, is a promising approach to provide capacity benefits of shared cache with performance isolation of private caches. […]
Page 8 of 919« First...678910...203040...Last »

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: