14549

Posts

Sep, 10

2nd International Conference on Knowledge (ICK), 2016

Topics: T1 • Novel Algorithms T2 • Association Rules T3 • Knowledge engineering and management T4 • Classification and T5 • Clustering T6 • Text analysis and text understanding T7 • Machine Learning T8 • Privacy Preserving Data Mining T9 • Statistical Methods T10 • Parallel and Distributed Data Mining T11 • Interactive and Online […]
Sep, 10

International Conference on Advances in Mechanical Design (ICAMD), 2016

Submission Methods: Please log in Electronic Submission System (.pdf). http://www.easychair.org/conferences/?conf=icamd2016 Paper Publication: Paper accepted by ICAMD 2016 will be published in one of the following publications after review process. * International Journal of Mechanical Engineering and Robotics Research (ISSN: 2278-0149) Indexing: Index Corpernicus, ProQuest, UDL, Google Scholar, Open J-Gate; etc. Call 4 Papers: Actuator Systems […]
Sep, 10

7th International Conference on Mechatronics and Manufacturing (ICMM), 2016

Submission Methods: Please log in Electronic Submission System (.pdf). http://www.easychair.org/conferences/?conf=icmm2016 Paper Publication: Paper accepted by ICMM 2016 will be published in one of the following publications after review process. *Applied Mechanics and Materials Journal (ISSN: 1660-9336) Indexing: Volumes are submitted for indexing to Elsevier: SCOPUS and Ei Compendex (CPX). Cambridge Scientific Abstracts (CSA), Chemical Abstracts […]
Sep, 9

Experimentation Procedure for Offloaded Mini-Apps Executed on Cluster Architectures with Xeon Phi Accelerators

A heterogeneous cluster architecture is complex. It contains hundreds, or thousands of devices connected by a tiered communication system in order to solve a problem. As a heterogeneous system, these devices will have varying performance capabilities. To better understand the interactions which occur between the various devices during execution, an experimentation procedure has been devised […]
Sep, 9

Parallel waveform extraction algorithms for the Cherenkov Telescope Array Real-Time Analysis

The Cherenkov Telescope Array (CTA) is the next generation observatory for the study of very high-energy gamma rays from about 20 GeV up to 300 TeV. Thanks to the large effective area and field of view, the CTA observatory will be characterized by an unprecedented sensitivity to transient flaring gamma-ray phenomena compared to both current […]
Sep, 9

A Performance Comparison of Algebraic Multigrid Preconditioners on CPUs, GPUs, and Xeon Phis

Algebraic multigrid preconditioners for accelerating iterative solvers are a popular choice for a broad range of applications, because they are able to obtain asymptotic optimality, yet can be applied in a black-box manner. However, only a few variants of algebraic multigrid preconditioners can fully benefit from finegrained parallelization available on multi- and many-core architectures. Previous […]
Sep, 9

Three storage formats for sparse matrices on GPGPUs

The multiplication of a sparse matrix by a dense vector is a centerpiece of scientific computing applications: it is the essential kernel for the solution of sparse linear systems and sparse eigenvalue problems by iterative methods. The efficient implementation of the sparse matrixvector multiplication is therefore crucial and has been the subject of an immense […]
Sep, 9

Dissecting GPU Memory Hierarchy through Microbenchmarking

Memory access efficiency is a key factor in fully utilizing the computational power of graphics processing units (GPUs). However, many details of the GPU memory hierarchy are not released by GPU vendors. In this paper, we propose a novel fine-grained microbenchmarking approach and apply it to three generations of NVIDIA GPUs, namely Fermi, Kepler and […]
Sep, 8

Accelerating Multiple Compound Comparison Using LINGO-based Load-Balancing Strategies on Multi-GPUs

Compound comparison is an important task for the computational chemistry. By the comparison results, potential inhibitors can be found and then used for the pharmacy experiments. The time complexity of a pairwise compound comparison is O(n^2), where n is the maximal length of compounds. In general, the length of compounds is tens to hundreds, and […]
Sep, 8

Assessing the hardness of SVP algorithms in the presence of CPUs and GPUs

Lattice-based cryptography has been a hot topic in the past decade, since it is believed that lattice-based cryptosystems are immune against attacks operated by quantum computers. The security of this type of cryptography is based on the hardness of algorithms that solve lattice-based problems, namely the Shortest Vector Problem (SVP). Therefore, it is important to […]
Sep, 8

Contributions to the Efficient Use of General Purpose Coprocessors: Kernel Density Estimation as Case Study

The high performance computing landscape is shifting from assemblies of homogeneous nodes towards heterogeneous systems, in which nodes consist of a combination of traditional out-oforder execution cores and accelerator devices. Accelerators, built around GPUs, many-core chips, or FPGAs, are used to offload compute-intensive tasks. These devices provide superior theoretical performance compared to traditional multi-core CPUs, […]
Sep, 8

Accelerating Web Search using GPUs

The amount of content on the Internet is growing rapidly as well as the number of the online Internet users. As a consequence, web search engines need to increase their computing capabilities and data continually while maintaining low search latency and without a significant rise in the cost per query. To serve this larger numbers […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: