high performance computing on graphics processing units: hgpu.org

hgpu.org » Tesla

Microbranching in mode-I fracture using large scale simulations of amorphous and perturbed lattice models

Shay I. Heizler, David A. Kessler

View

Download (PDF)

Tags: Condensed matter, CUDA, nVidia, Soft Condensed Matter, Statistical Mechanics, Tesla

April 1, 2015 by hgpu

Enabling Efficient Use of MPI and PGAS Programming Models on Heterogeneous Clusters with High Performance Interconnects

Sreeram Potluri

View

Download (PDF)

Tags: Computer science, CUDA, Heterogeneous systems, Intel Xeon Phi, MPI, nVidia, OpenCL, Tesla, Tesla C2075, Tesla K20, Tesla M2090

September 29, 2014 by hgpu

HSApriori: High Speed Association Rule Mining using Apriori Based Algorithm for GPU

D.William Albert, K.Fayaz, D.Veerabhadra Babu

View

Download (PDF)

Tags: Algorithms, Computer science, Data mining, Java, Machine learning, nVidia, OpenCL, Tesla

August 26, 2014 by hgpu

Understanding the ISA impact on GPU Architecture

Mayank Vinodbhai Kothiya

View

Download (PDF)

Tags: Computer science, CUDA, GPGPU-sim, Hardware Architecture, nVidia, PTX, Tesla, Thesis

July 28, 2014 by hgpu

Multicore Computing: Algorithms, Architectures, and Applications

Sanguthevar Rajasekaran, Lance Fiondella, Mohamed Ahmed, Reda A. Ammar

View

Download (PDF)

Tags: Algorithms, Biology, Book, Computer science, CUDA, Linear Algebra, nVidia, String matching, Tesla

January 6, 2014 by hgpu

Tesla vs. Xeon Phi vs. Radeon A Compiler Writer’s Perspective

Brent Leback, Douglas Miles, Michael Wolfe

View

Download (PDF)

Tags: ATI, Compilers, Computer science, Intel Xeon Phi, nVidia, OpenCL, Performance, Tesla

December 18, 2013 by hgpu

Speed up Large Integer Multiplication Using Fourier Transforms and CUDA Technology

Hovhannes Bantikyan

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, FFT, nVidia, Security, Tesla

April 7, 2013 by hgpu

Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs

Ahmad Lashgar, Amirali Baniasadi, Ahmad Khonsari

View

Download (PDF)

Tags: Computer science, Energy-efficient computing, GPGPU-sim, nVidia, Tesla

January 17, 2013 by hgpu

Acceleration of Monte-Carlo Molecular Simulations on Hybrid Computing Architectures

Claus Braun, Stefan Holst, Hans-Joachim Wunderlich, Juan Manuel Castillo, Joachim Gross

View

Download (PDF)

Tags: CUDA, Hybrid computing, Molecular simulation, nVidia, Physics, Tesla, Thermodynamics

November 5, 2012 by hgpu

GPU-to-GPU and Host-to-Host multipattern string matching on a GPU

Xinyan Zha, Sartaj Sahni

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, nVidia, String matching, Tesla

August 3, 2012 by hgpu

Linearised inversion with GPUs

Chris Leader, Robert Clapp

View

Download (PDF)

Source codes

Tags: CUDA, Image processing, nVidia, Package, Tesla

July 13, 2012 by hgpu

Affine Vector Cache for memory bandwidth savings

Sylvain Collange, Alexandre Kouyoumdjian

View

Download (PDF)

Tags: Benchmarking, Computer science, CUDA, Memory model, nVidia, Tesla

December 16, 2011 by hgpu

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

OpenMP5-Offload-OpenMC-Intel-PVC

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Microbranching in mode-I fracture using large scale simulations of amorphous and perturbed lattice models

Enabling Efficient Use of MPI and PGAS Programming Models on Heterogeneous Clusters with High Performance Interconnects

HSApriori: High Speed Association Rule Mining using Apriori Based Algorithm for GPU

Understanding the ISA impact on GPU Architecture

Multicore Computing: Algorithms, Architectures, and Applications

Tesla vs. Xeon Phi vs. Radeon A Compiler Writer’s Perspective

Speed up Large Integer Multiplication Using Fourier Transforms and CUDA Technology

Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs

Acceleration of Monte-Carlo Molecular Simulations on Hybrid Computing Architectures

GPU-to-GPU and Host-to-Host multipattern string matching on a GPU

Linearised inversion with GPUs

Affine Vector Cache for memory bandwidth savings

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)