high performance computing on graphics processing units: hgpu.org

hgpu.org » nVidia GeForce 9800 GX2

A model of dynamic compilation for heterogeneous compute platforms

Andrew Kerr

View

Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce 9800 GX2, nVidia GeForce GTX 280, OpenCL, PTX, Thesis

June 29, 2013 by hgpu

MATLAB and Python for GPU Computing

Jose Unpingco, Juan Carlos Chaves

View

Download (PDF)

Tags: Computer science, CUDA, Image processing, nVidia, nVidia GeForce 9800 GTS, nVidia GeForce 9800 GX2, Python, Signal processing, Tesla C1060, Tesla C2050

February 15, 2013 by hgpu

An Environment to Support GPU and Multicore Programming for Rapid, High Performance, Application Deployment

James Laurence Brock

View

Download (PDF)

Tags: ATI, ATI Radeon HD 5870, Computer science, CUDA, Heterogeneous systems, Image reconstruction, Monte Carlo simulation, nVidia, nVidia GeForce 9800 GX2, nVidia GeForce GTX 560 Ti, OpenCL, Tesla C1060, Tesla S1070, Thesis

October 26, 2012 by hgpu

Task Performance with List-Mode Data

Luca Caucci

View

Download (PDF)

Tags: Algorithms, Computer science, Data acquisition, Image reconstruction, Medicine, nVidia, nVidia GeForce 9800 GX2, nVidia GeForce GTX 295, Tesla C1060, Tesla C2050, Thesis, Tomography

September 23, 2012 by hgpu

Performance Analysis on Several GPU Architectures of an Algorithm for Noise Removal

M. G. Sanchez, V. Vidal, J. Bataller, G. Verdu

View

Download (PDF)

Tags: Algorithms, CUDA, Image processing, nVidia, nVidia GeForce 9800 GX2, nVidia GeForce GT 120, Signal denoising, Tesla M2050

September 1, 2012 by hgpu

Performance models for CUDA streams on NVIDIA GeForce series

Juan Gomez-Luna, Jose Maria Gonzalez-Linares, Jose Ignacio Benavides, Nicolas Guil

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia GeForce 8800 GTS, nVidia GeForce 9800 GX2, nVidia GeForce GTX 260, nVidia GeForce GTX 280, nVidia GeForce GTX 480, nVidia GeForce GTX 580, Performance

July 9, 2012 by hgpu

Accelerating Sparse Matrix Vector Multiplication on Many-Core GPUs

Weizhi Xu, Zhiyong Liu, Dongrui Fan, Shuai Jiao, Xiaochun Ye, Fenglong Song, Chenggang Yan

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia GeForce 9800 GX2, nVidia GeForce GTX 295, Performance, Sparse matrix

March 21, 2012 by hgpu

How well do STARLAB and NBODY compare? II: Hardware and accuracy

P. Anders, H. Baumgardt, E. Gaburov, S. Portegies Zwart

View

Download (PDF)

Tags: Algorithms, Astrophysics, Cosmology and Extragalactic Astrophysics, Galaxy Astrophysics, GPU cluster, Instrumentation and Methods for Astrophysics, N-body simulation, nVidia, nVidia GeForce 9800 GX2

January 30, 2012 by hgpu

A New Approach to rCUDA

Jose Duato, Antonio J. Pena, Federico Silla, Juan C. Fernandez, Rafael Mayo, Enrique S. Quintana-Orti

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia GeForce 9800 GX2, Virtualization

January 23, 2012 by hgpu

Parallel unmixing of remotely sensed hyperspectral images on commodity graphics processing units

Sergio Sanchez, Abel Paz, Gabriel Martin, Antonio Plaza

View

Download (PDF)

Tags: CUDA, Image processing, nVidia, nVidia GeForce 9800 GX2, nVidia GeForce GTX 275, Tesla C1060

January 13, 2012 by hgpu

Methodology of control and supervision of web connected mobile robots with CUDA technology application

Janusz Bedkowski, Andrzej Maslowski

View

Download (PDF)

Tags: Algorithms, Artificial intelligence, Computer science, CUDA, nVidia, nVidia GeForce 9800 GX2, nVidia GeForce GTX 280, nVidia Quadro FX 1600 M, nVidia Quadro FX 3700

December 25, 2011 by hgpu

The Optimization of Algorithms in the Process of Temporal Data Mining Using the Compute Unified Device Architecture

Alexandru Pirjan

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Data mining, MapReduce, nVidia, nVidia GeForce 8800 GTS, nVidia GeForce 9800 GX2, nVidia GeForce GTX 280, nVidia GeForce GTX 480, Optimization, Review

November 24, 2011 by hgpu

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

KIS-S: A GPU-Aware Kubernetes Inference Simulator with RL-Based Auto-Scaling

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computing

ParEval: A Parallel Code Evaluation Benchmark

ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

Libra: Synergizing CUDA and Tensor Cores for High-Performance Sparse Matrix Multiplication

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

No More Shading Languages: Compiling C++ to Vulkan Shaders

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

A model of dynamic compilation for heterogeneous compute platforms

MATLAB and Python for GPU Computing

An Environment to Support GPU and Multicore Programming for Rapid, High Performance, Application Deployment

Task Performance with List-Mode Data

Performance Analysis on Several GPU Architectures of an Algorithm for Noise Removal

Performance models for CUDA streams on NVIDIA GeForce series

Accelerating Sparse Matrix Vector Multiplication on Many-Core GPUs

How well do STARLAB and NBODY compare? II: Hardware and accuracy

A New Approach to rCUDA

Parallel unmixing of remotely sensed hyperspectral images on commodity graphics processing units

Methodology of control and supervision of web connected mobile robots with CUDA technology application

The Optimization of Algorithms in the Process of Temporal Data Mining Using the Compute Unified Device Architecture

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)