17616

Posts

Oct, 22

The Sixth International Workshop on Power-Efficient GPU and Many-core Computing (PEGPUM), 2018

The recent success of advanced mobile platforms coincides with the rising challenge of ensuring a long battery life, and accompanies a larger trend away from increasing processor clock speeds in favor of increasing parallelism. That high performance computing (HPC) is also strongly motivated in this area, as witnessed by the recent Green500 List project, illustrates […]
Oct, 22

10th International Conference on Bioinformatics and Biomedical Technology (ICBBT), 2018

CBBT 2018 is to bring together innovative academics and industrial experts in the field of Bioinformatics and Biomedical Technology to a common forum. The primary goal of the conference is to promote research and developmental activities in Bioinformatics and Biomedical Technology. Another goal is to promote scientific information interchange between researchers, developers, engineers, students, and […]
Oct, 22

The International Conference on Machine Vision and Applications (ICMVA), 2018

The field of machine vision and application, has been growing at a fast pace. As in most fast-developing fields, not all aspects of machine vision that are of interest to active researchers are useful to the designers and users of a vision system for a specific application. This conference is intended to provide a balanced […]
Oct, 22

The 10th International Conference on Digital Image Processing (ICDIP), 2018

History ICDIP conferences have been held annually in Bangkok (Thailand), Singapore, Chengdu (China) (2011, 2016), Kuala Lumpur (Malaysia), Beijing (China), Athens (Greece), Los Angeles (USA), and Hong Kong since 2009. So far the previous eight conference proceedings have been indexed by Ei Compendex and Scopus successfully! Publication Accepted (Registered and Presented) papers will be collected […]
Oct, 22

2nd International Conference on Information System and Data Mining (ICISDM), 2018

The conference will take place at the Florida Polytechnic University, Florida, USA, during April 9-11, 2018. The aim as well as objective of ICISDM 2018 is to present the latest research and results of scientists working in the fields related to Information System and Data Mining. This Symposium provides opportunities for the delegates to exchange […]
Oct, 21

Wilson and Domainwall Kernels on Oakforest-PACS

We report the performance of Wilson and Domainwall Kernels on a new Intel Xeon Phi Knights Landing based machine named Oakforest-PACS, which is co-hosted by University of Tokyo and Tsukuba University and is currently fastest in Japan. This machine uses Intel Omni-Path for the internode network. We compare performance with several types of implementation including […]
Oct, 21

Revisiting the Case of ARM SoCs in High-Performance Computing Clusters

Over the course of the past decade, the explosive popularity of embedded devices such as smartphones and tablets have given rise to ARM SoCs, whose characteristically low power consumption have made them ideal for these types of embedded devices. Recent maturation in the ARM SoC market, which has seen the advent of more powerful 64-bit […]
Oct, 21

Parallel Matching and Clustering Algorithms on GPUs

The main focus of this thesis is on developing efficient algorithms on GPUs for certain matching and clustering problems. Through extensive experiments we show that sparse and unstructured problems can benefit greatly from using GPUs as long as the algorithms are carefully designed. Even though none of the presented algorithms are fundamentally new, they still […]
Oct, 21

How to distribute most efficiently a computation intensive calculation on an Android device to external compute units with an Android API

Is transferring computation intensive calculations to external compute-units the next trend? This master’s thesis researches if it is worth the effort to transfer a matrix multiplication from an Android phone to a System-on-Chip (SoC), using Bluetooth or WebSocket as communication protocols. The SoC solution used in this work is an Intel Altera Cyclone V based […]
Oct, 21

Computation of gray-level co-occurrence matrix based on CUDA and its optimization

As in various fields like scientific research and industrial application, the computation time optimization is becoming a task that is of increasing importance because of its highly parallel architecture. The graphics processing unit is regarded as a powerful engine for application programs that demand fairly high computation capabilities. Based on this, an algorithm was introduced […]
Oct, 15

Flexible FPGA design for FDTD using OpenCL

Compared to classical HDL designs, generating FPGA with high-level synthesis from an OpenCL specification promises easier exploration of different design alternatives and, through ready-to-use infrastructure and common abstractions for host and memory interfaces, easier portability between different FPGA families. In this work, we evaluate the extent of this promise. To this end, we present a […]
Oct, 15

Toward Performance Portability for CPUs and GPUs Through Algorithmic Compositions

The diversity of microarchitecture designs in heterogeneous computing systems allows programs to achieve high performance and energy efficiency, but results in substantial software redevelopment cost for each type or generation of hardware. To mitigate this cost, a performance portable programming system is required. This work presents my solution to the performance portability problem. I argue […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: