17566

Posts

Sep, 12

Optimization of the Brillouin operator on the KNL architecture

Experiences with optimizing the matrix-times-vector application of the Brillouin operator on the Intel KNL processor are reported. Without adjustments to the memory layout, performance figures of 360 Gflop/s in single and 270 Gflop/s in double precision are observed. This is with N_c=3 colors, N_v=12 right-hand-sides, N_{thr}=256 threads, on lattices of size 32^3*64, using exclusively OMP […]
Sep, 12

GPU-Accelerated Parallel Finite-Difference Time-Domain Method for Electromagnetic Waves Propagation in Unmagnetized Plasma Media

The finite-difference time-domain (FDTD) method has been commonly utilized in the numerical solution of electromagnetic (EM) waves propagation through the plasma media. However, the FDTD method may bring about a significant increment in additional run-times consuming for computationally large and complicated EM problems. Graphics Processing Unit (GPU) computing based on Compute Unified Device Architecture (CUDA) […]
Sep, 12

Sorting with GPUs: A Survey

Sorting is a fundamental operation in computer science and is a bottleneck in many important fields. Sorting is critical to database applications, online search and indexing,biomedical computing, and many other applications. The explosive growth in computational power and availability of GPU coprocessors has allowed sort operations on GPUs to be done much faster than any […]
Sep, 12

A Comparative Study of 2D Numerical Methods with GPU Computing

Graphics Processing Unit (GPU) computing is becoming an alternate computing platform for numerical simulations. However, it is not clear which numerical scheme will provide the highest computational efficiency for different types of problems. To this end, numerical accuracies and computational work of several numerical methods are compared using a GPU computing implementation. The Correction Procedure […]
Sep, 12

Report: Performance comparison between C2075 and P100 GPU cards using cosmological correlation functions

In this report, some cosmological correlation functions are used to evaluate the differential performance between C2075 and P100 GPU cards. In the past, the correlation functions used in this work have been widely studied and exploited on some previous GPU architectures. The analysis of the performance indicates that a speedup in the range from 13 […]
Sep, 10

The 2nd International Conference on Machine Learning and Soft Computing (ICMLSC), 2018

ICMLSC 2018, The 2nd International Conference on Machine Learning and Soft Computing, will take place in Phu Quoc Island, Vietnam, from February 2-4, 2018. ICMLSC 2018 is co-organized by the University of Science, Vietnam and Industrial University of Ho Chi Minh City. ICMLSC 2018 is a not-to-be-missed opportunity that distills the most current knowledge on […]
Sep, 10

10th International Conference on Computer and Automation Engineering (ICCAE), 2018

After the successes of ICCAE 2009 (Bangkok, Thailand), ICCAE 2010 (Singapore), ICCAE 2011 (Chongqing, China), ICCAE 2012 (Mumbai, India), ICCAE 2013 (Bruxelles, Belgium), ICCAE 2014 (Melbourne, Australia), ICCAE 2015 (Bali, Indonesia), ICCAE 2016 (Melbourne, Australia), ICCAE 2017 (Sydney, Australia), 2018 10th International Conference on Computer and Automation Engineering (ICCAE 2018) is going to take place […]
Sep, 10

The 2nd International Conference on Mechanical, Aeronautical and Automotive Engineering (ICMAA), 2018

The 2nd International Conference on Mechanical, Aeronautical and Automotive Engineering, will take place in Singapore, from February 24-26, 2018. ICMAA 2018 is a not-to-be-missed opportunity that distills the most current knowledge on a rapidly advancing discipline in one conference. Join key researchers and established professionals in the field of Mechanical, Aeronautical and Automotive Engineering as […]
Sep, 10

The 2nd International Conference on Cryptography, Security and Privacy (ICCSP), 2018

The 2nd International Conference on Cryptography, Security and Privacy (ICCSP 2018) will be held during March 16-19, 2018, in Guiyang, China. ICCSP 2018 aims to bring together researchers, scientists, engineers, and scholar students to exchange and share their experiences, new ideas, and research results about all aspects of Cryptography, Security and Privacy, and discuss the […]
Sep, 10

The 3rd International Conference on Multimedia and Image Processing (ICMIP), 2018

The 3rd International Conference on Multimedia and Image Processing (ICMIP 2018) will be held in Guiyang, China during March 16-19, 2018. ICMIP has been held successfully in Bandar Seri Begawan, Brunei Darussalam, Wuhan, China in the last two years respectively. The objective of ICMIP is to present the latest research and results of scientists related […]
Sep, 9

Optimization of Spatial Convolution in ConvNets on Intel KNL

Most of the experts admit that the true behavior of the neural network is hard to predict. It is quite impossible to deterministically prove the working of the neural network as the architecture gets bigger, yet, it is observed that it is possible to apply a well engineered network to solve one of the most […]
Sep, 9

Beyond 16GB: Out-of-Core Stencil Computations

Stencil computations are a key class of applications, widely used in the scientific computing community, and a class that has particularly benefited from performance improvements on architectures with high memory bandwidth. Unfortunately, such architectures come with a limited amount of fast memory, which is limiting the size of the problems that can be efficiently solved. […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: