17043

Posts

Mar, 19

International Conference on Deep Learning Technologies (ICDLT), 2017

Paper Publication All accepted papers must be written in English and will be published into #International Conference Proceedings Series by ACM, and indexed by Ei Compendex and Scopus. Proceedings ISBN: 978-1-4503-4783-9. Conference Chair Prof. Dr. Q. M. Jonathan Wu, University of Windsor, Canada Prof. Xudong Jiang, Nanyang Technological University, Singapore Submission Methods 1. Full Paper […]
Mar, 19

2nd IEEE Asia-Pacific Conference on Intelligent Robot Systems (ACIRS), 2017

Paper Publication This conference is supported by IEEE.The proceedings will be submitted and reviewed by the IEEE Xplore and Ei Compendex after the conference. Paper Topic (http://www.acirs.org/cfp.html) Underwater/Aerial Robots, Agriculture Robots, Space Robotics, Biomimetic robotics, Intelligent Transport Systems, Networked robots, Mobiligence, Rescue Robots, SWARM Intelligent Robots, Domestic Personal Robots, Visual Servoing/Robot vision, Medical/rehabilitation robotics, Perception/Learning, […]
Mar, 19

9th International Conference on Graphic and Image Processing (ICGIP), 2017

Paper Publication The paper accepted by ICGIP 2017 will be published in conference proceedings in the SPIE Digital Library along with nearly 300,000 papers from other outstanding conferences and articles from SPIE Journals, which will be included in the SPIE Digital Library, and provided to the Web of Science (CPCI), *Scopus, Ei Compendex*, Inspec, Google […]
Mar, 19

2nd IEEE International Conference on Image, Vision and Computing (ICIVC), 2017

Publication All accepted papers must be written in English and will be published in IEEE conference proceedings and indexed by Ei Compendex and Scopus after conference. Submission 1. Full paper(publication and presentation) 2. Abstract (presentation) For full paper, please upload it to the Electronic Submission System (.pdf) https://www.easychair.org/conferences/?conf=icivc2017 For abstract, please send it to icivc@young.ac.cn […]
Mar, 14

GPU accelerated population annealing algorithm

Population annealing is a promising recent approach for Monte Carlo simulations in statistical physics, in particular for the simulation of systems with complex free-energy landscapes. It is a hybrid method, combining importance sampling through Markov chains with elements of sequential Monte Carlo in the form of population control. While it appears to provide algorithmic capabilities […]
Mar, 14

Compiling Parallel Functional Code with Data Parallel Idealised Algol

Graphics Processing Units (GPUs) and other parallel devices are widely available and have the potential for accelerating a wide class of algorithms. However, expert programming skills are required to achieve maximum performance. These devices expose low-level hardware details through imperative programming interfaces which inevitably results in non-performanceportable programs highly tuned for a specific device. Functional […]
Mar, 14

Large-scale image analysis using docker sandboxing

With the advent of specialized hardware such as Graphics Processing Units (GPUs), large scale image localization, classification and retrieval have seen increased prevalence. Designing scalable software architecture that co-evolves with such specialized hardware is a challenge in the commercial setting. In this paper, we describe one such architecture (Cortexica) that leverages scalability of GPUs and […]
Mar, 14

Massive Exploration of Neural Machine Translation Architectures

Neural Machine Translation (NMT) has shown remarkable progress over the past few years with production systems now being deployed to end-users. One major drawback of current architectures is that they are expensive to train, typically requiring days to weeks of GPU time to converge. This makes exhaustive hyperparameter search, as is commonly done with other […]
Mar, 14

Model-independent partial wave analysis using a massively-parallel fitting framework

The functionality of GooFit, a GPU-friendly framework for doing maximum-likelihood fits, has been extended to extract model-independent S-wave amplitudes in three-body decays such as $D^+ to h^+h^+h^-$. A full amplitude analysis is done where the magnitudes and phases of the S-wave amplitudes are anchored at a finite number of $m^2(h^+h^-)$ control points, and a cubic […]
Mar, 10

A Survey of Cache Partitioning Techniques for Multicore Processors

As the number of on-chip cores and memory demands of applications increase, judicious management of cache resources has become, not merely attractive, but even imperative. Cache partitioning, i.e. dividing cache space between applications based on their memory demands, is a promising approach to provide capacity benefits of shared cache with performance isolation of private caches. […]
Mar, 9

Architectural Principles and Experimentation of Distributed High Performance Virtual Clusters

With the advent of virtualization and Infrastructure-as-a-Service (IaaS), the broader scientific computing community is considering the use of clouds for their scientific computing needs. This is due to the relative scalability, ease of use, advanced user environment customization abilities, and the many novel computing paradigms available for data-intensive applications. However, a notable performance gap exists […]
Mar, 9

Achieving high-performance with a sparse direct solver on Intel KNL

The need for energy-efficient high-end systems has led hardware vendors to design new types of chips for general purpose computing. However, designing or porting a code tailored for these new types of processing units is often considered as a major hurdle for their broad adoption. In this paper, we consider a modern Intel Xeon Phi […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: