14527

Posts

Sep, 8

Accelerating Web Search using GPUs

The amount of content on the Internet is growing rapidly as well as the number of the online Internet users. As a consequence, web search engines need to increase their computing capabilities and data continually while maintaining low search latency and without a significant rise in the cost per query. To serve this larger numbers […]
Sep, 7

A Survey Of Architectural Techniques for Near-Threshold Computing

Energy efficiency has now become the primary obstacle in scaling the performance of all classes of computing systems. Low-voltage computing and specifically, near-threshold voltage computing (NTC), which involves operating the transistor very close to and yet above its threshold voltage, holds the promise of providing many-fold improvement in energy efficiency. However, use of NTC also […]
Sep, 5

Waste Not, Want Not! Managing relational data in asymmetric memories

In this thesis, we study the management of relational data in modern, i.e., asymmetric computer systems. We explore different strategies to identify asymmetries in persistent data, map them to asymmetries in the memory landscape and, eventually, exploit them to increase query processing performance. To this end, we study memory conscious decomposition and storage of data […]
Sep, 5

Virtualizing Data Parallel Systems for Portability, Productivity, and Performance

Computer systems equipped with graphics processing units (GPUs) have become increasingly common over the last decade. In order to utilize the highly data parallel architecture of GPUs for general purpose applications, new programming models such as OpenCL and CUDA were introduced, showing that data parallel kernels on GPUs can achieve speedups by several orders of […]
Sep, 5

Parallel Execution of the ASP Computation – an Investigation on GPUs

This paper illustrates the design and implementation of a conflict-driven ASP solver that is capable of exploiting the Single-Instruction Multiple-Thread parallelism offered by General Purpose Graphical Processing Units (GPUs). Modern GPUs are multi-core platforms, providing access to large number of cores at a very low cost, but at the price of a complex architecture with […]
Sep, 5

Convolutional Neural Network for Sentence Classification

The goal of a Knowledge Base-supported Question Answering (KB-supported QA) system is to answer a query natural language by obtaining the answer from a knowledge database, which stores knowledge in the form of (entity, relation, value) triples. QA systems understand questions by extracting entity and relation pairs. This thesis aims at recognizing the relation candidates […]
Sep, 5

On GPU-Accelerated Fast Direct Solvers and Their Applications in Image Denoising

This dissertation focuses on block cyclic reduction (BCR) type fast direct solvers, graphics processing unit (GPU) computation, and image denoising. The fast direct solvers are specialized methods for solving certain types of linear systems. They take into account specific characteristics of the system and are therefore able to solve the system much more efficiently than […]
Sep, 3

Advanced Simulation Library: Expanding software ecosystem for the DSP/FPGA/GPU market

Advanced Simulation Library is a free and open source multiphysics simulation software package and a tool for solving Partial Differential Equations. It has significant user base across many areas of engineering and science, from both industrial and academic organizations. ASL utilizes only the methods that allow efficient parallelization: Lattice Boltzmann Methods, Explicit Finite Difference, Matrix […]
Sep, 3

International Conference on Mechanical Engineering and Electrical Systems (ICMES 2015), 2015

All accepted and registered papers will be recommended to publish into the journals below: * Journal of Electronic Science and Technology (JEST, ISSN: 1674-862X) Abstracting/Indexing: SCOPUS; EI INSPECT; included in DOAJ, CA, CSCD, CNKI, Sciencepaper Online. * International Journal of Mechanical Engineering and Robotics Research (IJMERR, ISSN: 2278-0149, DOI: 10.18178) Abstracting/ Indexing: Index Corpernicus, ProQuest, […]
Sep, 3

International Conference on Computer Systems and Instrumentation (ICCSI), 2015

Submission Deadline: 2015.10.05 Submission Method: Please log in the Electronic Submission System (.pdf only) to submit your paper: http://www.easychair.org/conferences/?conf=iccsi2015 Registration: http://www.iccsi.org/reg.html (250-350USD) Topic: (http://www.iccsi.org/cfp.html) Systems theory and practice Systems engineering System Instrumentation Embedded systems and systems-on-the-chip Target-oriented systems [emulation, simulation, prediction, etc.] Specialized systems [sensor-based, mobile, multimedia, biometrics, etc.] Validation systems Security and protection systems […]
Sep, 3

OpenCL 2.0 for FPGAs using OCLAcc

Designing hardware is a time-consuming and complex process. Realization of both, embedded and high-performance applications can benefit from a design process on a higher level of abstraction. This helps to reduce development time and allows to iteratively test and optimize the hardware design during development, as common in software development. We present our tool, OCLAcc, […]
Sep, 3

Exploiting Hyper-Loop Parallelism in Vectorization to Improve Memory Performance on CUDA GPGPU

Memory performance is of great importance to achieve high performance on the Nvidia CUDA GPU. Previous work has proposed specific optimizations such as thread coarsening, caching data in shared memory, and global data layout transformation. We argue that vectorization based on hyper loop parallelism can be used as a unified technique to optimize the memory […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: