high performance computing on graphics processing units: hgpu.org

Posts

Feb, 10

International workshop and tutorial on Computational Intelligence on Consumer Games and Graphics Hardware, CIGPU 2011

The fourth International workshop and tutorial on Computational Intelligence on Consumer Games and Graphics Hardware (CIGPU 2011) will be held as a workshop in the GECCO-2011 conference in Dublin 12-16 July 2011. CIGPU 2011 is the fourth workshop on the use of GPUs, games consoles and other consumer hardware for evolutionary algorithms and other computational […]

Feb, 9

25th International Conference on Supercomputing, ICS’11

ICS (International Conference on Supercomputing) is the premier international forum for the presentation of research results in high-performance computing systems.Papers are solicited on all aspects of research, development, and application of large-scale, high-performance experimental and commercial systems. The list of topics includes (but not limited to): Computationally challenging scientific and commercial applications, particularly studies and […]

Feb, 9

9th International Conference on Parallel Processing and Applied Mathematics, PPAM 2011

The PPAM 2011 conference, ninth in a series, will cover topics in parallel and distributed processing, including theory and applications, as well as applied mathematics. The focus will be on models, algorithms, and software tools which facilitate efficient and convenient utilization of modern parallel and distributed computing architectures, as well as on large-scale applications, and […]

Feb, 9

3D tumor localization through real-time volumetric x-ray imaging for lung cancer radiotherapy

Recently we have developed an algorithm for reconstructing volumetric images and extracting 3D tumor motion information from a single x-ray projection. We have demonstrated its feasibility using a digital respiratory phantom with regular breathing patterns. In this work, we present a detailed description and a comprehensive evaluation of the improved algorithm. The algorithm was improved […]

CUDA

Feb, 9

High-precision molecular dynamics simulation of UO2-PuO2: superionic transition in uranium dioxide

Our series of articles is devoted to high-precision molecular dynamics simulation of mixed actinide-oxide (MOX) fuel in the rigid ions approximation using high-performance graphics processors (GPU). In this article we assess the 10 most relevant interatomic sets of pair potential (SPP) by reproduction of the Bredig superionic phase transition (anion sublattice premelting) in uranium dioxide. […]

CUDA

Feb, 9

High-precision molecular dynamics simulation of UO2-PuO2: pair potentials comparison

Our series of articles is devoted to high-precision molecular dynamics simulation of mixed actinide-oxide (MOX) fuel in the rigid ions approximation using high-performance graphics processors (GPU). In the first article we assess 10 most relevant interatomic sets of pair potentials (SPP) by reproduction of solid phase properties of uranium dioxide (UO2) – temperature dependences of […]

CUDA

Feb, 9

Fast Analysis of Molecular Dynamics Trajectories with Graphics Processing Units – Radial Distribution Function Histogramming

The calculation of radial distribution functions (RDFs) from molecular dynamics trajectory data is a common and computationally expensive analysis task. The rate limiting step in the calculation of the RDF is building a histogram of the distance between atom pairs in each trajectory frame. Here we present an implementation of this histogramming scheme for multiple […]

CUDA

Feb, 9

Accelerating urban fast response Lagrangian dispersion simulations using inexpensive graphics processor parallelism

Owing to the potential consequences associated with accidental or deliberate releases of chemical or biological agents in urban areas, fast response urban dispersion models must rapidly provide solutions that can be easily analyzed by researchers and emergency responders. In this paper, we describe a novel application of an existing Lagrangian dispersion modeling system to achieve […]

Feb, 9

Next-generation acceleration and code optimization for light transport in turbid media using GPUs

A highly optimized Monte Carlo (MC) code package for simulating light transport is developed on the latest graphics processing unit (GPU) built for general-purpose computing from NVIDIA – the Fermi GPU. In biomedical optics, the MC method is the gold standard approach for simulating light transport in biological tissue, both due to its accuracy and […]

CUDA

Feb, 9

Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition

Good old on-line back-propagation for plain multi-layer perceptrons yields a very low 0.35% error rate on the famous MNIST handwritten digits benchmark. All we need to achieve this best result so far are many hidden layers, many neurons per layer, numerous deformed training images, and graphics cards to greatly speed up learning.

CUDA

Feb, 9

Deep, Big, Simple Neural Nets for Handwritten Digit Recognition

Good old online backpropagation for plain multilayer perceptrons yields a very low 0.35% error rate on the MNIST handwritten digits benchmark. All we need to achieve this best result so far are many hidden layers, many neurons per layer, numerous deformed training images to avoid overfitting, and graphics cards to greatly speed up learning. Good […]

CUDA

Feb, 9

Real-time GPU color-based segmentation of football players

In this paper, we propose a multi-camera application capable of processing high resolution images and extracting features based on colors patterns over graphic processing units (GPU). The goal is to work in real time under the uncontrolled environment of a sport event like a football match. Since football players are composed for diverse and complex […]

CUDA

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

* * *

high performance computing on graphics processing units: hgpu.org

Posts

International workshop and tutorial on Computational Intelligence on Consumer Games and Graphics Hardware, CIGPU 2011

25th International Conference on Supercomputing, ICS’11

9th International Conference on Parallel Processing and Applied Mathematics, PPAM 2011

3D tumor localization through real-time volumetric x-ray imaging for lung cancer radiotherapy

High-precision molecular dynamics simulation of UO2-PuO2: superionic transition in uranium dioxide

High-precision molecular dynamics simulation of UO2-PuO2: pair potentials comparison

Fast Analysis of Molecular Dynamics Trajectories with Graphics Processing Units – Radial Distribution Function Histogramming

Accelerating urban fast response Lagrangian dispersion simulations using inexpensive graphics processor parallelism

Next-generation acceleration and code optimization for light transport in turbid media using GPUs

Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition

Deep, Big, Simple Neural Nets for Handwritten Digit Recognition

Real-time GPU color-based segmentation of football players

Recent source codes

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Most viewed papers (last 30 days)