15848

Posts

May, 9

A Graph-based Model for GPU Caching Problems

Modeling data sharing in GPU programs is a challenging task because of the massive parallelism and complex data sharing patterns provided by GPU architectures. Better GPU caching efficiency can be achieved through careful task scheduling among different threads. Traditionally, in the field of parallel computing, graph partition models are used to model data communication and […]
May, 9

Training Neural Networks Without Gradients: A Scalable ADMM Approach

With the growing importance of large network models and enormous training datasets, GPUs have become increasingly necessary to train neural networks. This is largely because conventional optimization algorithms rely on stochastic gradient methods that don’t scale well to large numbers of cores in a cluster setting. Furthermore, the convergence of all gradient methods, including batch […]
May, 9

Parallelizing Word2Vec in Shared and Distributed Memory

Word2Vec is a widely used algorithm for extracting low-dimensional vector representations of words. It generated considerable excitement in the machine learning and natural language processing (NLP) communities recently due to its exceptional performance in many NLP applications such as named entity recognition, sentiment analysis, machine translation and question answering. State-of-the-art algorithms including those by Mikolov […]
May, 7

Parallel Wavelet Schemes for Images

In this paper, we introduce several new schemes for calculation of discrete wavelet transforms of images. These schemes reduce the number of steps and, as a consequence, allow to reduce the number of synchronizations on parallel architectures. As an additional useful property, the proposed schemes can reduce also the number of arithmetic operations. The schemes […]
May, 7

Going Deeper with Embedded FPGA Platform for Convolutional Neural Network

In recent years, Convolutional Neural Network (CNN) based methods have achieved great success in a large number of applications and have been among the most powerful and widely used techniques in computer vision. However, CNN-based methods are computational-intensive and resource-consuming, and thus are hard to be integrated into embedded systems such as smart phones, smart […]
May, 7

JIT-Compilation for Interactive Scientific Visualization

Due to the proliferation of mobile devices and cloud computing, remote simulation and visualization have become increasingly important. In order to reduce bandwidth and (de)serialization costs, and to improve mobile battery life, we examine the performance and bandwidth benefits of using an optimizing query compiler for remote postprocessing of interactive and in-situ simulations. We conduct […]
May, 7

TheanoLM – An Extensible Toolkit for Neural Network Language Modeling

We present a new tool for training neural network language models (NNLMs), scoring sentences, and generating text. The tool has been written using Python library Theano, which allows researcher to easily extend it and tune any aspect of the training process. Regardless of the flexibility, Theano is able to generate extremely fast native code that […]
May, 7

Parallel Pairwise Correlation Computation On Intel Xeon Phi Clusters

Co-expression network is a critical technique for the identification of inter-gene interactions, which usually relies on all-pairs correlation (or similar measure) computation between gene expression profiles across multiple samples. Pearson’s correlation coefficient (PCC) is one widely used technique for gene co-expression network construction. However, all-pairs PCC computation is computationally demanding for large numbers of gene […]
May, 4

3th International Conference on Artificial Intelligence (ICOAI), 2016

Sponsors sponsored by Limerick Institute of Technology, University of Limerick (UL), Mary Immaculate College (MIC) and Irish Technology Leadership Group (ITLG). Conference Chair Prof. Seppo J. Sirkemaa, University of Turku, Finland Local Organising Co-chair Jerald Cavanagh, Limerick Institute of Technology, Ireland Padraig Kirby, Limerick Institute of Technology, Ireland Paper Publication Paper accepted by ICOAI 2016 […]
May, 4

3rd Intl. Conference on Soft Computing and Machine Intelligence, 2016

Topics of interest for submission include, but are not limited to: Advanced Intelligent Systems; Ant Colony Optimization and Swarm Intelligence; Artificial Immune Systems; Artificial Intelligence; Artificial Life; Associative Memory; Automatic Annotation; Bioinformatics and Biological Computing; Case-Based and Temporal Reasoning;   Conference Schedule November 23, 2016: Registration and Collecting conference materials November 24, 2016: Keynote speeches […]
May, 4

Post-Moore’s Era Supercomputing Workshop (PMES), 2016

This interdisciplinary workshop is organized to explore the scientific issues, challenges, and opportunities for supercomputing beyond the scaling limits of Moore’s Law, with the ultimate goal of keeping supercomputing at the forefront of computing technologies beyond the physical and conceptual limits of current systems. Continuing progress of supercomputing beyond the scaling limits of Moore’s Law […]
May, 4

International Conf. on System Engineering Management (ICSEM), 2016

Publication International Journal of Modeling and Optimization (ISSN: 2010-3697) Indexed by Engineering & Technology Digital Library, ProQuest, Crossref, Electronic Journals Library, DOAJ, Google Scholar, EI (INSPEC, IET). Conference Schedule 1) July 22, 2016—Conference Materials Collection 2) July 23, 2016—Keynote Speeches & Oral Presentations 3) July 24, 2016—One-day tour in Beijing Conference Chairs Prof. Dr. Chen-Huei […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org