15685

Posts

Apr, 9

Optimizing Performance of Recurrent Neural Networks on GPUs

As recurrent neural networks become larger and deeper, training times for single networks are rising into weeks or even months. As such there is a significant incentive to improve the performance and scalability of these networks. While GPUs have become the hardware of choice for training and deploying recurrent models, the implementations employed often make […]
Apr, 9

dMath: A Scalable Linear Algebra and Math Library for Heterogeneous GP-GPU Architectures

A new scalable parallel math library, dMath, is presented in this paper that demonstrates leading scaling when using intranode, or internode, hybrid-parallelism for deep-learning. dMath provides easy-to-use distributed base primitives and a variety of domain-specific algorithms. These include matrix multiplication, convolutions, and others allowing for rapid development of highly scalable applications, including Deep Neural Networks […]
Apr, 9

The 8th International Conf. on Signal Processing Systems (ICSPS), 2016

Publication Accepted papers will be published in the conference proceedings, which will be indexed by EI Compendex; SCOPUS; ULRICH’s Periodicals Directory; INSPEC; etc. Agenda November 21, 2016 – Registration & Conference Materials Collection November 22, 2016 – workshop November 23, 2016 – Keynote Speeches & Participants’ Oral Presentation November 24, 2016 – Academic Visiting The […]
Apr, 9

Internaitonal Conf. on Biomedical Signal and Bioinformatics (ICBSB), 2016

Schedule November 21, 2016 (Monday) Participants Onsite Registration & Conference Materials Collection November 22, 2016 (Tuesday) Opening Ceremony and Keynote Speeches Participants’ Oral Presentation Excellent Paper Awards Ceremony & Dinner Banquet November 23, 2016 (Wednesday) Academic Visit November 24, 2016 (Thusday) Tutorial Registration Tutorial   Conference Venue The Sir Paul Reeves Building at AUT Address: […]
Apr, 9

5th International Conf. on Bioinformatics and Biomedical Science (ICBBS), 2016

ICBBS 2016 Shining Points: 1.Accepted and published papers can be indexed by Embase (Under elsevier) and other data base. 2.Three Outstanding Professors from local Indonesia, Thailand and USA have joined as Keynote Speakers. They are Prof. Tjokorda Gde Tirta Nindhia from Udayana University, Indonesia, Prof. Orawan Siriratpiriya from Environmental Research Institute of Chulaongkorn University, Thailand, […]
Apr, 9

5th International Workshops on Database and Data Mining (ICDDM 2016), 2016

Confernce website: http://www.icddm.org/guide.html Confernce Venue Tibet Hotel, Chengdu —- North Renmin Road, Chengdu, Sichuan Hotel Tel. 86-28 -83183388-2588 | Fax :86 -28-83185678 Hotel website: http://en.tibet-hotel.com/ ★★★All papers of ICDDM 2016 which will be indexed by ★Ei Compendex★★★ Call for paper All topics about Data mining foundations/Mining in targeted application contexts/Methodological aspects and the KDD process/Integrated […]
Apr, 9

3rd Int. Workshop on Networks and Information Security (ICNIS), 2016

As the workshop of ICIVC 2016. It is hosted at *University of Portsmouth, UK*.   Paper Publication All accepted papers must be written in English and will be published into #IEEE conference proceedings#, indexed by **Ei Compendex**.   Conference Chair Prof. Branislav Vuksanovic, University of Portsmouth, Portsmouth, UK Prof. Shahram Latifi, UNIVERSITY OF NEVADA, USA […]
Apr, 9

GLoP: Enabling Massively Parallel Incident Response Through GPU Log Processing

Large industrial systems that combine services and applications, have become targets for cyber criminals and are challenging from the security, monitoring and auditing perspectives. Security log analysis is a key step for uncovering anomalies, detecting intrusion, and enabling incident response. The constant increase of link speeds, threats and users, produce large volumes of log data […]
Apr, 9

A Highly-Efficient Memory-Compression Scheme for GPU-Accelerated Intrusion Detection Systems

Pattern Matching is a computationally intensive task used in many research fields and real world applications. Due to the ever-growing volume of data to be processed, and increasing link speeds, the number of patterns to be matched has risen significantly. In this paper we explore the parallel capabilities of modern General Purpose Graphics Processing Units […]
Apr, 6

An Exploratory Study of High Performance Graphics Application Programming Interfaces

This study was conducted to take an in depth look at the newest application programming interfaces (API) offered to graphics programmers. With the recent releases of Vulkan (2016) and DirectX 12 (2015) from industry giants like the Khronos Group and Microsoft, it’s clear they are pushing for a much lowerlevel, closer-to-hardware approach for future graphics […]
Apr, 6

gpucc: an open-source GPGPU compiler

Graphics Processing Units have emerged as powerful accelerators for massively parallel, numerically intensive workloads. The two dominant software models for these devices are NVIDIA’s CUDA and the cross-platform OpenCL standard. Until now, there has not been a fully open-source compiler targeting the CUDA environment, hampering general compiler and architecture research and making deployment difficult in […]
Apr, 6

Automatic test case reduction of randomly generated OpenCL kernels

While software developers can review their programs to fix bugs, they might not be able to patch a compiler bug due to which their perfectly valid programs misbehave. Moreover, even a formal verification of source code provides no guarantees if the code is miscompiled. For these reasons great effort is spent to increase the reliability […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: