Oct, 24

Deep Voice 3: 2000-Speaker Neural Text-to-Speech

We present Deep Voice 3, a fully-convolutional attention-based neural text-to-speech (TTS) system. Deep Voice 3 matches state-of-the-art neural speech synthesis systems in naturalness while training ten times faster. We scale Deep Voice 3 to data set sizes unprecedented for TTS, training on more than eight hundred hours of audio from over two thousand speakers. In […]
Oct, 24

BENCHIP: Benchmarking Intelligence Processors

The increasing attention on deep learning has tremendously spurred the design of intelligence processing hardware. The variety of emerging intelligence processors requires standard benchmarks for fair comparison and system optimization (in both software and hardware). However, existing benchmarks are unsuitable for benchmarking intelligence processors due to their non-diversity and nonrepresentativeness. Also, the lack of a […]
Oct, 24

A Fast and Generic GPU-Based Parallel Reduction Implementation

Reduction operations are extensively employed in many computational problems. A reduction consists of, given a finite set of numeric elements, combining into a single value all elements in that set, using for this a combiner function. A parallel reduction, in turn, is the reduction operation concurrently performed when multiple execution units are available. The current […]
Oct, 24

Parallel Computing for the Inverse of SPD matrix

In this paper, we propose a High performance Parallel Computing method for the Inverse of a symmetric positive definite (SPD) matrix. Brought in the reuse of the inverse of diagonal sub blocks technique and Combined with the newest OpenCL parallel computing framework, this methods can improve computing the inverse of SPD matrix effectively. Computing the […]
Oct, 24

GPU acceleration and performance of the particle-beam-dynamics code Elegant

Elegant is an accelerator physics and particle-beam dynamics code widely used for modeling and design of a variety of high-energy particle accelerators and accelerator-based systems. In this paper we discuss a recently developed version of the code that can take advantage of CUDA-enabled graphics processing units (GPUs) to achieve significantly improved performance for a large […]
Oct, 24

Architecting SOT-RAM Based GPU Register File

With increase in GPU register file (RF) size, its power consumption has also increased. Since RF exists at the highest level in cache hierarchy, designing it with memories with high write latency/energy (e.g., spin transfer torque RAM) can lead to large energy loss. In this paper, we present an spin orbit torque RAM (SOT-RAM) based […]
Oct, 22

The Sixth International Workshop on Power-Efficient GPU and Many-core Computing (PEGPUM), 2018

The recent success of advanced mobile platforms coincides with the rising challenge of ensuring a long battery life, and accompanies a larger trend away from increasing processor clock speeds in favor of increasing parallelism. That high performance computing (HPC) is also strongly motivated in this area, as witnessed by the recent Green500 List project, illustrates […]
Oct, 22

10th International Conference on Bioinformatics and Biomedical Technology (ICBBT), 2018

CBBT 2018 is to bring together innovative academics and industrial experts in the field of Bioinformatics and Biomedical Technology to a common forum. The primary goal of the conference is to promote research and developmental activities in Bioinformatics and Biomedical Technology. Another goal is to promote scientific information interchange between researchers, developers, engineers, students, and […]
Oct, 22

The International Conference on Machine Vision and Applications (ICMVA), 2018

The field of machine vision and application, has been growing at a fast pace. As in most fast-developing fields, not all aspects of machine vision that are of interest to active researchers are useful to the designers and users of a vision system for a specific application. This conference is intended to provide a balanced […]
Oct, 22

The 10th International Conference on Digital Image Processing (ICDIP), 2018

History ICDIP conferences have been held annually in Bangkok (Thailand), Singapore, Chengdu (China) (2011, 2016), Kuala Lumpur (Malaysia), Beijing (China), Athens (Greece), Los Angeles (USA), and Hong Kong since 2009. So far the previous eight conference proceedings have been indexed by Ei Compendex and Scopus successfully! Publication Accepted (Registered and Presented) papers will be collected […]
Oct, 22

2nd International Conference on Information System and Data Mining (ICISDM), 2018

The conference will take place at the Florida Polytechnic University, Florida, USA, during April 9-11, 2018. The aim as well as objective of ICISDM 2018 is to present the latest research and results of scientists working in the fields related to Information System and Data Mining. This Symposium provides opportunities for the delegates to exchange […]
Oct, 21

Wilson and Domainwall Kernels on Oakforest-PACS

We report the performance of Wilson and Domainwall Kernels on a new Intel Xeon Phi Knights Landing based machine named Oakforest-PACS, which is co-hosted by University of Tokyo and Tsukuba University and is currently fastest in Japan. This machine uses Intel Omni-Path for the internode network. We compare performance with several types of implementation including […]
Page 4 of 935« First...23456...102030...Last »

Recent source codes

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: