Sep, 8

Contributions to the Efficient Use of General Purpose Coprocessors: Kernel Density Estimation as Case Study

The high performance computing landscape is shifting from assemblies of homogeneous nodes towards heterogeneous systems, in which nodes consist of a combination of traditional out-oforder execution cores and accelerator devices. Accelerators, built around GPUs, many-core chips, or FPGAs, are used to offload compute-intensive tasks. These devices provide superior theoretical performance compared to traditional multi-core CPUs, […]
Sep, 8

Accelerating Web Search using GPUs

The amount of content on the Internet is growing rapidly as well as the number of the online Internet users. As a consequence, web search engines need to increase their computing capabilities and data continually while maintaining low search latency and without a significant rise in the cost per query. To serve this larger numbers […]
Sep, 7

A Survey Of Architectural Techniques for Near-Threshold Computing

Energy efficiency has now become the primary obstacle in scaling the performance of all classes of computing systems. Low-voltage computing and specifically, near-threshold voltage computing (NTC), which involves operating the transistor very close to and yet above its threshold voltage, holds the promise of providing many-fold improvement in energy efficiency. However, use of NTC also […]
Sep, 5

Waste Not, Want Not! Managing relational data in asymmetric memories

In this thesis, we study the management of relational data in modern, i.e., asymmetric computer systems. We explore different strategies to identify asymmetries in persistent data, map them to asymmetries in the memory landscape and, eventually, exploit them to increase query processing performance. To this end, we study memory conscious decomposition and storage of data […]
Sep, 5

Virtualizing Data Parallel Systems for Portability, Productivity, and Performance

Computer systems equipped with graphics processing units (GPUs) have become increasingly common over the last decade. In order to utilize the highly data parallel architecture of GPUs for general purpose applications, new programming models such as OpenCL and CUDA were introduced, showing that data parallel kernels on GPUs can achieve speedups by several orders of […]
Sep, 5

Parallel Execution of the ASP Computation – an Investigation on GPUs

This paper illustrates the design and implementation of a conflict-driven ASP solver that is capable of exploiting the Single-Instruction Multiple-Thread parallelism offered by General Purpose Graphical Processing Units (GPUs). Modern GPUs are multi-core platforms, providing access to large number of cores at a very low cost, but at the price of a complex architecture with […]
Sep, 5

Convolutional Neural Network for Sentence Classification

The goal of a Knowledge Base-supported Question Answering (KB-supported QA) system is to answer a query natural language by obtaining the answer from a knowledge database, which stores knowledge in the form of (entity, relation, value) triples. QA systems understand questions by extracting entity and relation pairs. This thesis aims at recognizing the relation candidates […]
Sep, 5

On GPU-Accelerated Fast Direct Solvers and Their Applications in Image Denoising

This dissertation focuses on block cyclic reduction (BCR) type fast direct solvers, graphics processing unit (GPU) computation, and image denoising. The fast direct solvers are specialized methods for solving certain types of linear systems. They take into account specific characteristics of the system and are therefore able to solve the system much more efficiently than […]
Sep, 3

Advanced Simulation Library: Expanding software ecosystem for the DSP/FPGA/GPU market

Advanced Simulation Library is a free and open source multiphysics simulation software package and a tool for solving Partial Differential Equations. It has significant user base across many areas of engineering and science, from both industrial and academic organizations. ASL utilizes only the methods that allow efficient parallelization: Lattice Boltzmann Methods, Explicit Finite Difference, Matrix […]
Sep, 3

International Conference on Mechanical Engineering and Electrical Systems (ICMES 2015), 2015

All accepted and registered papers will be recommended to publish into the journals below: * Journal of Electronic Science and Technology (JEST, ISSN: 1674-862X) Abstracting/Indexing: SCOPUS; EI INSPECT; included in DOAJ, CA, CSCD, CNKI, Sciencepaper Online. * International Journal of Mechanical Engineering and Robotics Research (IJMERR, ISSN: 2278-0149, DOI: 10.18178) Abstracting/ Indexing: Index Corpernicus, ProQuest, […]
Sep, 3

International Conference on Computer Systems and Instrumentation (ICCSI), 2015

Submission Deadline: 2015.10.05 Submission Method: Please log in the Electronic Submission System (.pdf only) to submit your paper: http://www.easychair.org/conferences/?conf=iccsi2015 Registration: http://www.iccsi.org/reg.html (250-350USD) Topic: (http://www.iccsi.org/cfp.html) Systems theory and practice Systems engineering System Instrumentation Embedded systems and systems-on-the-chip Target-oriented systems [emulation, simulation, prediction, etc.] Specialized systems [sensor-based, mobile, multimedia, biometrics, etc.] Validation systems Security and protection systems […]
Sep, 3

Fast GPU-based calculations in few-body quantum scattering

A principally novel approach towards solving the few-particle (many-dimensional) quantum scattering problems is described. The approach is based on a complete discretization of few-particle continuum and usage of massively parallel computations of integral kernels for scattering equations by means of GPU. The discretization for continuous spectrum of a few-particle Hamiltonian is realized with a projection […]
Page 5 of 831« First...34567...102030...Last »

* * *

* * *

Follow us on Twitter

HGPU group

1577 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

293 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: