Papers on (.txt-file)

“Local Rank Differences” Image Feature Implemented on GPU Download

.NET High Performance Computing Download

10×10: A General-purpose Architectural Approach to Heterogeneity and Energy Efficiency Download

190 TFlops Astrophysical N-body Simulation on a Cluster of GPUs Download

2D and 3D level-set algorithms on GPU

2D Triangulation of Polygons on CUDA Download

2D/3D image registration on the GPU Download

2HOT: An Improved Parallel Hashed Oct-Tree N-Body Algorithm for Cosmological Simulation Download

2PARMA: Parallel Paradigms and Run-time Management Techniques for Many-Core Architectures Download

3-SAT on CUDA: Towards a massively parallel SAT solver Download

3.5-D Blocking Optimization for Stencil Computations on Modern CPUs and GPUs Download

3D Edge Bundling for Geographical Data Visualization Download

3D finite difference computation on GPUs using CUDA Download

3D finite element numerical integration on GPUs Download

3D GPU Architecture using Cache Stacking: Performance, Cost, Power and Thermal analysis Download


3D Hydrodynamic Simulation of Classical Nova Explosions Download

3D Information Extraction Based on GPU Download

3D Modeling, Distance and Gradient Computation for Motion Planning: A Direct GPGPU Approach Download

3D Non-Local Means denoising via multi-GPU Download

3D nonrigid registration via optimal mass transport on the GPU Download

3D Recursive Gaussian IIR on GPU and FPGAs: A Case Study for Accelerating Bandwidth-Bounded Applications Download

3D Registration Based on Normalized Mutual Information: Performance of CPU vs. GPU Implementation Download

3D tumor localization through real-time volumetric x-ray imaging for lung cancer radiotherapy Download

3D vision of electromagnetic fields in antenna and microwave technique

3D-color video camera Download

3DES ECB Optimized for Massively Parallel CUDA GPU Architecture Download

3I: A tool for visualizing and processing in parallel 2D & 3D images Download

42 TFlops hierarchical N-body simulations on GPUs with applications in both astrophysics and turbulence Download

4kUHD H264 wireless live video streaming using CUDA Download

5.6: GPU enhancement of FDTD-PIC plasma-wave simulations

A (ir)regularity-aware task scheduler for heterogeneous platforms Download

A 3D Convex Hull Algorithm for Graphics Hardware Download Package

A 3D radiative transfer framework. VIII. OpenCL implementation Download

A 3D radiative transfer framework: XIII. OpenCL implementation Download

A 57mW embedded mixed-mode neuro-fuzzy accelerator for intelligent multi-core processor

A balanced programming model for emerging heterogeneous multicore systems Download

A Batched GPU Algorithm for Set Intersection Download

A Bi-objective Optimization Framework for Query Plans Download

A biomolecular electrostatics solver using Python, GPUs and boundary elements that can handle solvent-filled cavities and Stern layers Download Package

A block-asynchronous relaxation method for graphics processing units Download

A Braille Conversion Service Using GPU and Human Interaction by Computer Vision Download

A breadth-first course in multicore and manycore programming Download

A capabilities-aware framework for using computational accelerators in data-intensive computing Download

A case for neuromorphic ISAs Download

A Case Study for Petascale Applications in Astrophysics: Simulating Gamma-Ray Bursts Download

A Case Study of SWIM: Optimization of Memory Intensive Application on GPGPU

A case study on porting scientific applications to GPU/CUDA Download

A CG-based Poisson solver on a GPU-cluster

A characterization and analysis of PTX kernels Download

A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads Download

A Chunking Method for Euclidean Distance Matrix Calculation on Large Dataset Using Multi-GPU

A class of communication-avoiding algorithms for solving general dense linear systems on CPU/GPU parallel machines Download

A Class of Hybrid LAPACK Algorithms for Multicore and GPU Architectures Download

A closer look at GPUs Download

A Cloud Computing Service Architecture of a Parallel Algorithm Oriented to Scientific Computing with CUDA and Monte Carlo Download

A cluster for CS education in the manycore era Download

A Co-Prime Blur Scheme for Data Security in Video Surveillance Download

A Coarse Grain Reconfigurable Architecture for sequence alignment problems in bio-informatics

A code motion technique for accelerating general-purpose computation on the GPU Download

A Code Optimization Framework for Performance Portability of GPU Kernels onto Custom Accelerators Download

A Code Transformation Framework for Scientific Applications on Structured Grids Download

A code-based analytical approach for using separate device coprocessors in computing systems

A collision detection algorithm using adaptive particle sensor

A Common GPU n-Dimensional Array for Python and C Download Package

A Comparative Analysis of GPU Implementations of Spectral Unmixing Algorithms Download

A comparative analysis of the performance and deployment overhead of parallelized Finite Difference Time Domain (FDTD) algorithms on a selection of high performance multiprocessor computing systems Download

A comparative benchmarking of the FFT on Fermi and Evergreen GPUs

A comparative study of GPU programming models and architectures using neural networks

A Comparative Study of Neighborhood Filters for Artifact Reduction in Iterative Low-Dose CT Download

A Comparative Study of OpenACC Implementations Download

A Comparative Study of Parallel Algorithms for the Girth Problem Download

A Comparative Study on ASIC, FPGAs, GPUs and General Purpose Processors in the O(N^2) Gravitational N-body Simulation Download

A comparison between parallelization approaches in molecular dynamics simulations on GPUs Download Package

A Comparison of Algebraic Multigrid Preconditioners using Graphics Processing Units and Multi-Core Central Processing Units Download

A comparison of CPU and GPU performance for Fourier pseudospectral simulations of the Navier-Stokes, Cubic Nonlinear Schrodinger and Sine Gordon Equations Download

A Comparison of CPU and OpenCL Parallelization Methods for Correlation and Graph Layout Algorithms used in the Network Analysis of High Dimensional Data Download Package

A comparison of CPUs, GPUs, FPGAs, and massively parallel processor arrays for random number generation Download

A Comparison of FPGA and GPU for Real-Time Phase-based Optical Flow, Stereo, and Local Image Features

A Comparison of Gradient Estimation Methods for Volume Rendering on Unstructured Meshes Download

A Comparison of Many-threaded Differential Evolution and Genetic Algorithms on CUDA Download

A Comparison of Modern GPU and CPU Architectures: And the Common Convergence of Both Download

A comparison of period finding algorithms Download

A Comparison of Sequential and GPU Implementations of Iterative Methods to Compute Reachability Probabilities Download

A Comparison of Statistical Techniques for Detecting Side-Channel Information Leakage in Cryptographic Devices Download

A Comparison of Two Methods for Geometric Milling Simulation Accelerated by GPU Download

A Comparison of xPU Platforms Exemplified with Ray Tracing Algorithms

A Compile-Time Managed Multi-Level Register File Hierarchy Download

A Compiler and Runtime for Heterogeneous Computing Download

A compiler for high performance computing with many-core accelerators Download

A compiler framework for optimization of affine loop nests for gpgpus Download

A compiler toolkit for array-based languages targeting CPU/GPU hybrid systems Download Package

A Complete Descritpion of the UnPython and Jit4GPU Framework Download

A complete modular resultant algorithm targeted for realization on graphics hardware Download

A comprehensive analysis and parallelization of an image retrieval algorithm Download

A Comprehensive Performance Comparison of CUDA and OpenCL Download

A comprehensive study of Dynamic Memory Management in OpenCL kernels Download

A Computational Comparison of Basis Updating Schemes for the Simplex Algorithm on a CPU-GPU System Download

A Computational Model of Afterimages Download

A computationally efficient and scalable approach for privacy preserving kNN classification Download


Brief statistics for this page

Titles: 100

Download open PDFs: 86

Package packages: 6

Page 1 of 8412345...102030...Last »

* * *

* * *

* * *

Free GPU computing nodes at

Registered users can now run their OpenCL application at We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 11.4
  • SDK: AMD APP SDK 2.8
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 5.0.35, AMD APP SDK 2.8

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to will be treated according to our Privacy Policy

HGPU group © 2010-2014

All rights belong to the respective authors

Contact us: