high performance computing on graphics processing units: hgpu.org

hgpu.org » nVidia Quadro FX 2000

vSMC: Parallel Sequential Monte Carlo in C++

Yan Zhou

View

Download (PDF)

Source codes

Tags: Algorithms, Bayesian, Mathematical Software, Mathematics, nVidia, nVidia Quadro FX 2000, OpenCL, Package, Physics, Signal processing, Statistics

July 1, 2013 by hgpu

Cropped Quad-Tree Based Solid Object Colouring with CUDA

Abdullah Cavusoglu, Baha Sen, Caner Ozcan, Salih Gorgunoglu

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia GeForce GTX 560 Ti, nVidia Quadro FX 2000, OpenGL, Rendering

June 30, 2013 by hgpu

A Distributed CPU-GPU Framework for Pairwise Alignments on Large-Scale Sequence Datasets

Da Li, Kittisak Sajjapongse, Huan Truong, Gavin Conant, Michela Becchi

View

Download (PDF)

Tags: Bioinformatics, Biology, Computational biology, CUDA, GPU cluster, Heterogeneous systems, MPI, nVidia, nVidia Quadro FX 2000, Sequence alignment, Tesla K20

May 11, 2013 by hgpu

An implementation for quad-tree based solid object coloring using CUDA

Baha Sen, Caner Ozcan, Nesrin Aydin Atasoy

View

Download (PDF)

Tags: 3D Graphics and Realism, Algorithms, Computer science, CUDA, nVidia, nVidia GeForce GTX 560 Ti, nVidia Quadro FX 2000

January 9, 2013 by hgpu

Interactive Refactoring for GPU Parallelization of Affine Loops

Kostadin Damevski, Madhan Muralimanohar

View

Download (PDF)

Tags: Code generation, Computer science, CUDA, Heterogeneous systems, nVidia, nVidia Quadro FX 2000

January 7, 2013 by hgpu

Parallelisation of Shallow Water Simulation for Heterogeneous Architectures

Michail Emmanouil Pappas

View

Download (PDF)

Tags: CUDA, Fluid dynamics, nVidia, nVidia Quadro FX 2000, OpenCL, Thesis

December 18, 2012 by hgpu

Using Graphics Processing Units to Parallelize the FDK Algorithm for Tomographic Image Reconstruction

Joel Sanchez Dominguez, Luiz Fernando de Oliveira, Nilton Alves Junior, Joaquim Teixeira de Assis

View

Download (PDF)

Tags: Computed tomography, CT, CUDA, Image reconstruction, Medicine, nVidia, nVidia GeForce 9400 M, nVidia Quadro FX 2000

November 18, 2012 by hgpu

Teaching Parallel Programming Models on a Shallow-Water Code

Alexander Breuer, Michael Bader

View

Download (PDF)

Source codes

Tags: CUDA, Education, Finite volume method, Fluid dynamics, MPI, nVidia, nVidia Quadro FX 2000, OpenMP, Package

July 8, 2012 by hgpu

Fast and accurate digital signal processing realized with GPGPU technology

Adam Dabrowski, Pawel Pawlowski, Mateusz Stankiewicz, Filip Misiorek

View

Download (PDF)

Tags: CUDA, nVidia, nVidia Quadro FX 2000, nVidia Quadro FX 580, Signal processing

May 26, 2012 by hgpu

An Introduction to the OpenCL Programming Model

Jonathan Thompson, Kristofer Schlachter

View

Download (PDF)

Tags: ATI, ATI Radeon HD 5850, ATI Radeon HD 6750 M, Computer science, Matrix multiplication, nVidia, nVidia Quadro FX 2000, OpenCL, Overview, Tutorial

May 16, 2012 by hgpu

Real-Time Ultrasound Biomicroscopy with Optoacoustic Arrays

Ya Shu

View

Download (PDF)

Tags: CUDA, Data acquisition, Microscopy, nVidia, nVidia Quadro FX 2000, Signal processing, Thesis, Ultrasound

January 24, 2012 by hgpu

Speculative Parallel Evaluation Of Classification Trees On GPGPU Compute Engines

Jason Spencer

View

Download (PDF)

Tags: Computer science, Computer vision, CUDA, nVidia, nVidia Quadro FX 2000, Optimization, Pattern recognition

November 8, 2011 by hgpu

Specx: Speculative task-based runtime system

Specx: a C++ task-based runtime system for heterogeneous distributed architectures

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

KISim: Kubernetes Intelligent Scheduling Simulator

KIS-S: A GPU-Aware Kubernetes Inference Simulator with RL-Based Auto-Scaling

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

Libra: Synergizing CUDA and Tensor Cores for High-Performance Sparse Matrix Multiplication

exa-AMD: Exascale Accelerated Materials Discovery

Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computing

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

No More Shading Languages: Compiling C++ to Vulkan Shaders

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

vSMC: Parallel Sequential Monte Carlo in C++

Cropped Quad-Tree Based Solid Object Colouring with CUDA

A Distributed CPU-GPU Framework for Pairwise Alignments on Large-Scale Sequence Datasets

An implementation for quad-tree based solid object coloring using CUDA

Interactive Refactoring for GPU Parallelization of Affine Loops

Parallelisation of Shallow Water Simulation for Heterogeneous Architectures

Using Graphics Processing Units to Parallelize the FDK Algorithm for Tomographic Image Reconstruction

Teaching Parallel Programming Models on a Shallow-Water Code

Fast and accurate digital signal processing realized with GPGPU technology

An Introduction to the OpenCL Programming Model

Real-Time Ultrasound Biomicroscopy with Optoacoustic Arrays

Speculative Parallel Evaluation Of Classification Trees On GPGPU Compute Engines

Recent source codes

Specx: Speculative task-based runtime system

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

KISim: Kubernetes Intelligent Scheduling Simulator

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

Most viewed papers (last 30 days)