high performance computing on graphics processing units: hgpu.org

hgpu.org » nVidia GeForce 8600 GT

Research on the fast Fourier transform of image based on GPU

Feifei Shen, Zhenjian Song, Congrui Wu, Jiaqi Geng, Qingyun Wang

View

Download (PDF)

Tags: Algorithms, FFT, Image processing, Mathematical Software, nVidia, nVidia GeForce 8600 GT

June 1, 2015 by hgpu

GPU Computations in Heterogeneous Grid Environments

Marcus Hinders

View

Download (PDF)

Tags: ATI, ATI Radeon HD 5870, Biology, Computational biology, Computer science, Grid, Heterogeneous systems, nVidia, nVidia GeForce 8600 GT, nVidia GeForce GTX 280, OpenCL, Thesis

October 30, 2011 by hgpu

Optimizing a Near-duplicate Document Detection System with SIMD Technologies

Xinpan Yuan, Jun Long, Hao Zhang, Zuping Zhang, Weihua Gui

View

Download (PDF)

Tags: Algorithms, Clustering, Computer science, nVidia, nVidia GeForce 8600 GT, OpenCL, Optimization

October 24, 2011 by hgpu

Parallel programming with NVIDIA CUDA

Alejandro Segovia

View

Download (PDF)

Source codes

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia GeForce 8600 GT, Package, Tutorial

September 11, 2011 by hgpu

Exploring reconfigurable architectures for explicit finite difference option pricing models

Qiwei Jin, David B. Thomas, Wayne Luk

View

Download (PDF)

Tags: CUDA, Finance, Finite difference, FPGA, nVidia, nVidia GeForce 8600 GT, Tesla C1060

August 3, 2011 by hgpu

Articulated object tracking by rendering consistent appearance parts

Zachary Pezzementi, Sandrine Voros, Gregory D. Hager

View

Download (PDF)

Tags: Computer science, Filtering, GLSL, Machine learning, nVidia, nVidia GeForce 8600 GT, nVidia GeForce 8800 GTX, OpenGL, Rendering

July 30, 2011 by hgpu

View-Dependent Real-Time Rendering of Large Outdoor Scenes

Xiangkun Zhao, Fengxia Li, Yufeng Chen, Shouyi Zhan

View

Download (PDF)

Tags: Clustering, Computer science, Hierarchical clustering, nVidia, nVidia GeForce 8600 GT, Real-time graphics, Rendering

July 28, 2011 by hgpu

High-performance bankruptcy prediction model using Graphics Processing Units

Bernardete Ribeiro, Noel Lopes, Catarina Silva

View

Download (PDF)

Source codes

Tags: Finance, Neural networks, nVidia GeForce 8600 GT, nVidia GeForce GTX 280, Package

July 16, 2011 by hgpu

Option pricing with multi-dimensional quadrature architectures

Anson H.T. Tse, David B. Thomas, Wayne Luk

View

Download (PDF)

Tags: Finance, FPGA, nVidia, nVidia GeForce 8600 GT, Performance, Tesla C1060

July 14, 2011 by hgpu

Design Exploration of Quadrature Methods in Option Pricing

Anson H. T. Tse, David Thomas, Wayne Luk

View

Download (PDF)

Tags: CUDA, Finance, FPGA, nVidia, nVidia GeForce 8600 GT, Tesla C1060

June 14, 2011 by hgpu

Accelerating biomedical signal processing algorithms with parallel programming on graphic processor units

Evdokimos I. Konstantinidis, Christos A. Frantzidis, Lazaros Tzimkas, Costas Pappas, Panagiotis D. Bamidis

View

Download (PDF)

Tags: CUDA, Medicine, nVidia, nVidia GeForce 8600 GT, nVidia GeForce 9600 GT, nVidia GeForce GTS 250, Signal processing

June 6, 2011 by hgpu

PyCUDA and PyOpenCL: A Scripting-Based Approach to GPU Run-Time Code Generation

Andreas Klockner, Nicolas Pinto, Yunsup Lee, Bryan Catanzaro, Paul Ivanov, Ahmed Fasih

View

Download (PDF)

Source codes

Tags: Code generation, Computer science, CUDA, High-level Languages, nVidia, nVidia GeForce 8600 GT, nVidia GeForce 9400 M, nVidia GeForce GTX 295, nVidia GeForce GTX 480, OpenCL, Package, PyOpenCL, Python, Software Engineering, Tesla C1060

March 30, 2011 by hgpu

Code examples for paper on SYCL backend of Kokkos - IWOCL 2024

Experiences with implementing Kokkos’ SYCL backend

ROCm's implementation of Gromacs

GROMACS on AMD GPU-Based HPC Platforms: Using SYCL for Performance and Portability

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Research on the fast Fourier transform of image based on GPU

GPU Computations in Heterogeneous Grid Environments

Optimizing a Near-duplicate Document Detection System with SIMD Technologies

Parallel programming with NVIDIA CUDA

Exploring reconfigurable architectures for explicit finite difference option pricing models

Articulated object tracking by rendering consistent appearance parts

View-Dependent Real-Time Rendering of Large Outdoor Scenes

High-performance bankruptcy prediction model using Graphics Processing Units

Option pricing with multi-dimensional quadrature architectures

Design Exploration of Quadrature Methods in Option Pricing

Accelerating biomedical signal processing algorithms with parallel programming on graphic processor units

PyCUDA and PyOpenCL: A Scripting-Based Approach to GPU Run-Time Code Generation

Recent source codes

Code examples for paper on SYCL backend of Kokkos - IWOCL 2024

ROCm's implementation of Gromacs

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Most viewed papers (last 30 days)