Posts
Jul, 1
Energy Efficiency Benefits of Reducing the Voltage Guardband on the Kepler GPU Architecture
Energy efficiency of GPU architectures has emerged as an important design criterion for both NVIDIA and AMD. In this paper, we explore the benefits of scaling a general-purpose GPU (GPGPU) core’s supply voltage to the near limits of execution failure. We find that as much as 21% of NVIDIA GTX 680’s core supply voltage guardband […]
Jul, 1
Accelerated Computation of Minimum Enclosing Balls by GPU Parallelization and Distance Filtering
Minimum enclosing balls are used extensively to speed up multidimensional data processing in, e.g., machine learning, spatial databases, and computer graphics. We present a case study of several acceleration techniques that are applicable in enclosing ball algorithms based on repeated farthest-point queries. Parallel GPU solutions using CUDA are developed for both low- and high-dimensional cases. […]
Jul, 1
Parallelizing the cellular potts model on GPU and multi-core CPU: An OpenCL cross-platform study
In this paper, we present the analysis and development of a cross-platform OpenCL parallelization of the Cellular Potts Model (CPM). In general, the evolution of the CPM is time-consuming. Using data-parallel programming model such as CUDA can accelerate the process, but it is highly dependent on the hardware type and manufacturer. Recently, OpenCL has attracted […]
Jul, 1
High-Level Programming Framework for Executing Streaming Applications on Heterogeneous OpenCL Platforms
As the computer industry is reaching more and more limits regarding processor speed and transistor size, they have to come up with complex new architectures and more efficient use of the available processing power. For application developers this can be a difficult task, because they have to be aware of low-level hardware properties and there […]
Jul, 1
4th International Conference on Information Computer Application, ICICA 2015
Submission Deadline: 2014-10-05 Publication: The ICICA 2015 conference proceeding will be published in the International Journal of Computer and Communication Engineering (ISSN:2010-3743 www.ijcce.org ), which will be indexed by Google Scholar, Engineering & Technology Digital Library,ProQuest, and Crossref Call for Paper: Algorithms Automated Software Engineering Bioinformatics and Scientific Computing Compilers and Interpreters Computer Animation Artificial […]
Jul, 1
3rd International Conference on System Modeling and Optimization, ICSMO 2015
Submission Deadline: 2014-09-20 Publication: The ICSMO 2015 conference proceeding will be published in the International Journal of Modeling and Optimization (ISSN: 2010-3697 www.ijmo.org ), and will be included in the Engineering & Technology Digital Library, and indexed by ProQuest, Google Scholar and Crossref. Call for Paper: Agent Based Simulation Analytical and Stochastic Modelling Techniques and […]
Jul, 1
6th International Conference on Computer Modeling and Simulation, ICCMS 2015
Submission Deadline: 2014-09-30 Publication: As usual, all accepted papers for the ICCMS 2015 will be published in the International Journal of Computer Theory and Engineering (ISSN:1793-8201 www.ijcte.org ), will be indexed by Electronic Journals Library, EBSCO, Engineering & Technology Digital Library, Google Scholar, INSPEC, Ulrich’s Periodicals Directory, Crossref, ProQuest, WorldCat, and EI (INSPEC, IET). Call […]
Jul, 1
Kd-tree Based N-Body Simulations with Volume-Mass Heuristic on the GPU
N-body simulations represent an important class of numerical simulations in order to study a wide range of physical phenomena for which researchers demand fast and accurate implementations. Due to the computational complexity, simple brute-force methods to solve the long-distance interaction between bodies can only be used for small-scale simulations. Smarter approaches utilize neighbor lists, tree […]
Jul, 1
Fast Galactic Structure Finding using Graphics Processing Units
Cosmological simulations are used by astronomers to investigate large scale structure formation and galaxy evolution. Structure finding, that is, the discovery of gravitationally-bound objects such as dark matter halos, is a crucial step in many such simulations. During recent years, advancing computational capacity has lead to halo-finders needing to manage increasingly larger simulations. As a […]
Jul, 1
Increasing Deep Neural Network Acoustic Model Size for Large Vocabulary Continuous Speech Recognition
Deep neural networks (DNNs) are now a central component of nearly all state-of-the-art speech recognition systems. Part of the promise of DNNs is their ability to represent increasingly complex functions as the number of DNN parameters increases. This paper investigates the performance of DNN-based hybrid speech recognition systems as DNN model size and training data […]
Jul, 1
The design and verification of Mumax3
We report on the design, verification and performance of mumax3, an open-source GPU-accelerated micromagnetic simulation program. This software solves the time- and space dependent magnetization evolution in nano- to micro scale magnets using a finite-difference discretization. Its high performance and low memory requirements allow for large-scale simulations to be performed in limited time and on […]
Jul, 1
Speedup of Micromagnetic Simulations with C++ AMP On Graphics Processing Units
A finite-difference Micromagnetic solver is presented utilizing the C++ Accelerated Massive Parallelism (C++ AMP). The high speed performance of a single Graphics Processing Unit (GPU) is demonstrated compared to a typical CPU-based solver. The speed-up of GPU to CPU is shown to be greater than 100 for problems with larger sizes. This solver is based […]