15427
Gang Mei, Hong Tian
This paper focuses on evaluating the impact of different data layouts on the computational efficiency of GPU-accelerated Inverse Distance Weighting (IDW) interpolation algorithm. First we redesign and improve our previous GPU implementation that was performed by exploiting the feature of CUDA dynamic parallelism (CDP). Then we implement three versions of GPU implementations, i.e., the naive […]
Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally
State-of-the art deep neural networks (DNNs) have hundreds of millions of connections and are both computationally and memory intensive, making them difficult to deploy on embedded systems with limited hardware resources and power budgets. While custom hardware can help the computation, fetching the weights from DRAM can be as much as two orders of magnitude […]
View View   Download Download (PDF)   
C. Kristopher Garrett, Cory Hauck, Judith Hill
We present computational advances and results in the implementation of an entropy-based moment closure, M_N, in the context of linear kinetic equations, with an emphasis on heterogeneous and large-scale computing platforms. Entropy-based closures are known in several cases to yield more accurate results than closures based on standard spectral approximations, such as P_N, but the […]
View View   Download Download (PDF)   
Patrick O. Glauner
Inspired by recent successes of deep learning in computer vision, we propose a novel application of deep convolutional neural networks to facial expression recognition, in particular smile recognition. A smile recognition test accuracy of 99.45% is achieved for the Denver Intensity of Spontaneous Facial Action (DISFA) database, significantly outperforming existing approaches based on hand-crafted features […]
View View   Download Download (PDF)   
Flavio Vella, Giancarlo Carbone, Massimo Bernaschi
Betweenness Centrality (BC) is steadily growing in popularity as a metrics of the influence of a vertex in a graph. The BC score of a vertex is proportional to the number of all-pairs-shortest-paths passing through it. However, complete and exact BC computation for a large-scale graph is an extraordinary challenge that requires high performance computing […]
View View   Download Download (PDF)   
Ashwin Trikuta Srinath
Compact finite difference schemes are widely used in the direct numerical simulation of fluid flows for their ability to better resolve the small scales of turbulence. However, they can be expensive to evaluate and difficult to parallelize. In this work, we present an approach for the computation of compact finite differences and similar tridiagonal schemes […]
View View   Download Download (PDF)   
Justin Crause
The authoring of realistic terrain models is necessary to generate immersive virtual environments for computer games and film visual effects. However, creating these landscapes is difficult – it usually involves an artist spending many hours sculpting a model in a 3D design program. Specialised terrain generation programs exist to rapidly create artificial terrains, such as […]
View View   Download Download (PDF)   
Yifei Li
Measuring the similarity between two streamlines is fundamental to many important flow data analysis and visualization tasks such as feature detection, pattern querying and streamline clustering. This dissertation presents a novel streamline similarity measure inspired by the bag-of-features concept from computer vision. Different from other streamline similarity measures, the proposed one considers both the distribution […]
David Webb
Current embedded multi object tracking system implementations are dominated by the use of Field Programmable Gate Arrays (FPGAs) and Application Specific Integrated Circuits (ASICs) as application accelerators. These offer many of the traits desirable for embedded and real time systems, including; task oriented architectures, deterministic latency and low power requirements. The drawbacks to these approaches […]
View View   Download Download (PDF)   
Teng Li, Vikram K. Narayana, Tarek El-Ghazawi
The past several years have witnessed significant performance improvements in High-Performance Computing (HPC), due to the incorporation of GPUs as co-processors. On one hand, GPU devices are growing significantly in terms of the available number of cores and the memory hierarchy; as a result, effective utilization of the available GPU resources while limiting the system […]
View View   Download Download (PDF)   
Raja H. Singh, Lee Barford, Frederick Harris Jr.
Since the introduction of the Modern Portfolio Theory by Markowitz in the Journal of Finance in 1952, it has been the underlying theory in several portfolio optimization techniques. With the advancement of computers, most portfolio optimization are done by CPUs. Over the years, there have been papers that introduce various optimization methods including those introduced […]
View View   Download Download (PDF)   
Andoni Azkarate Saiz
Deep neural networks have recently gained popularity for improving state-of-the-art machine learning algorithms in diverse areas such as speech recognition, computer vision and bioinformatics. Convolutional networks especially have shown prowess in visual recognition tasks such as object recognition and detection in which this work is focused on. Modern award-winning architectures have systematically surpassed previous attempts […]
View View   Download Download (PDF)   
Page 1 of 52712345...102030...Last »

* * *

* * *

Follow us on Twitter

HGPU group

1735 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

368 people like HGPU on Facebook

HGPU group © 2010-2016 hgpu.org

All rights belong to the respective authors

Contact us: