high performance computing on graphics processing units: hgpu.org

hgpu.org » nVidia GeForce 8400 GS

Parallel Computation of Functions on Set Partitions

Chetan Rokhade

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia GeForce 8400 GS, Thesis

May 25, 2014 by hgpu

Fast Feature Selection in a GPU Cluster Using the Delta Test

Alberto Guillen, M. Isabel Garcia Arenas, Mark van Heeswijk, Dusan Sovilj, Amaury Lendasse, Luis Javier Herrera, Hector Pomares, Ignacio Rojas

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, GPU cluster, Nearest neighbour, nVidia, nVidia GeForce 8400 GS, nVidia GeForce 9800 GTX, nVidia GeForce GTS 450

February 25, 2014 by hgpu

High-Speed Turbo Equalization for GPP-based Software Defined Radios

Michael Schwall, Friedrich K. Jondral

View

Download (PDF)

Tags: Algorithms, Filtering, nVidia, nVidia GeForce 8400 GS, OpenCL, Signal processing

December 31, 2013 by hgpu

Hidden Surface Removal Using BSP Tree with CUDA

Murat Uysal, Baha Sen, Canan Celik

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia GeForce 8400 GS, Rendering

July 12, 2013 by hgpu

Evaluating the Performance of Legacy Applications on Emerging Parallel Architectures

Simon John Pennycook

View

Download (PDF)

Source codes

Tags: Algorithms, ATI, ATI FirePro V7800, Benchmarking, Computer science, CUDA, MPI, nVidia, nVidia GeForce 8400 GS, nVidia GeForce 9800 GT, nVidia GeForce GTX 680, OpenCL, Performance, Tesla C1060, Tesla C2050, Thesis

May 21, 2013 by hgpu

Automated Tool to Generate Parallel CUDA code from a Serial C Code

Akhil Jindal, Nikhil Jindal, Divyashikha Sethia

View

Download (PDF)

Tags: Code generation, Computer science, CUDA, nVidia, nVidia GeForce 8400 GS

August 2, 2012 by hgpu

A stand-alone Finite Difference Time Domain (FDTD) simulation for Integrated Optoelectronics Laboratory

Sathya Swaroop Ganta

View

Download (PDF)

Tags: CUDA, Differential equations, Electrodynamics, FDTD, Finite difference, Finite-difference time-domain, Maxwell's equations, nVidia, nVidia GeForce 8400 GS, Optoelectronics, Python, Tesla C2075, Thesis

July 13, 2012 by hgpu

CUDA Implementation of Parallel Algorithms for Animal Noseprint Identification

Vincent Stanley Dayes

View

Download (PDF)

Tags: Algorithms, CUDA, Image processing, Image registration, nVidia, nVidia GeForce 8400 GS, Thesis

May 29, 2012 by hgpu

Algorithm Construction for GPGPU

Mattias Svanstrom, Simon Hossjer

View

Download (PDF)

Tags: Algorithms, ATI, ATI Radeon HD 5650, Computer science, Matrix multiplication, nVidia, nVidia GeForce 8400 GS, nVidia GeForce GTX 560 Ti, OpenCL, Sorting

April 19, 2012 by hgpu

Visualization of Pareto Solutions by Spherical Self-Organizing Map and It’s acceleration on a GPU

Masato Yoshimi, Takuya Kuhara, Kaname Nishimoto, Mitsunori Miki, Tomoyuki Hiroyasu

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia GeForce 8400 GS, nVidia GeForce GTX 280, Optimization, Self-organizing map, Tesla C1060, Visualization

April 5, 2012 by hgpu

Acceleration of Solving Maxwell’s Equations Using Cluster of GPUs

E. Arianyan, S. A. Motamedi, M. Hekmatpanah, I. Arianyan

View

Download (PDF)

Tags: Algorithms, CUDA, Differential equations, Electrodynamics, FDTD, Finite difference, Finite-difference time-domain, Maxwell's equations, nVidia, nVidia GeForce 8400 GS

March 10, 2012 by hgpu

Optimization of the Particle-based Volume Rendering for GPUs with Hiding Data Transfer Latency

Kyoko Nakao, Erika Matsui, Naoko Yoshii, Masami Takata, Kazuki Joe

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia GeForce 8400 GS, Optimization, Rendering, Tesla C1060

November 30, 2011 by hgpu

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

OpenMP5-Offload-OpenMC-Intel-PVC

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Parallel Computation of Functions on Set Partitions

Fast Feature Selection in a GPU Cluster Using the Delta Test

High-Speed Turbo Equalization for GPP-based Software Defined Radios

Hidden Surface Removal Using BSP Tree with CUDA

Evaluating the Performance of Legacy Applications on Emerging Parallel Architectures

Automated Tool to Generate Parallel CUDA code from a Serial C Code

A stand-alone Finite Difference Time Domain (FDTD) simulation for Integrated Optoelectronics Laboratory

CUDA Implementation of Parallel Algorithms for Animal Noseprint Identification

Algorithm Construction for GPGPU

Visualization of Pareto Solutions by Spherical Self-Organizing Map and It’s acceleration on a GPU

Acceleration of Solving Maxwell’s Equations Using Cluster of GPUs

Optimization of the Particle-based Volume Rendering for GPUs with Hiding Data Transfer Latency

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)