high performance computing on graphics processing units: hgpu.org

Posts

Feb, 10

GLSV: Graphics library stereo vision for OpenGL

This work proposes the development of an auxiliary library for use with OpenGL, to facilitate the creation of graphic applications incorporating stereoscopic representation. This library, christened graphics library stereo vision (GLSV), is designed to remove all calculations involving knowledge of stereo vision theory from the task performed by the programmer without the latter having to […]

OpenGL

Feb, 10

Scaling LAPACK panel operations using parallel cache assignment

In LAPACK many matrix operations are cast as block algorithms which iteratively process a panel using an unblocked algorithm and then update a remainder matrix using the high performance Level 3 BLAS. The Level 3 BLAS have excellent weak scaling, but panel processing tends to be bus bound, and thus scales with bus speed rather […]

Feb, 10

Random-access rendering of general vector graphics

We introduce a novel representation for random-access rendering of antialiased vector graphics on the GPU, along with efficient encoding and rendering algorithms. The representation supports a broad class of vector primitives, including multiple layers of semitransparent filled and stroked shapes, with quadratic outlines and color gradients. Our approach is to create a coarse lattice in […]

Feb, 10

Precomputed Atmospheric Scattering

We present a new and accurate method to render the atmosphere in real time from any viewpoint from ground level to outer space, while taking Rayleigh and Mie multiple scattering into account. Our method reproduces many effects of the scattering of light, such as the daylight and twilight sky color and aerial perspective for all […]

OpenGL

Feb, 10

Comparing FPGAs to Graphics Accelerators and the Playstation 2 Using a Unified Source Description

Field programmable gate arrays (FPGAs), graphics processing units (GPUs) and Sony’s Playstation 2 vector units offer scope for hardware acceleration of applications. We compare the performance of these architectures using a unified description based on A Stream Compiler (ASC) for FPGAs, which has been extended to target GPUs and PS2 vector units. Programming these architectures […]

Feb, 10

Cg: a system for programming graphics hardware in a C-like language

The latest real-time graphics architectures include programmable floating-point vertex and fragment processors, with support for data-dependent control flow in the vertex processor. We present a programming language and a supporting system that are designed for programming these stream processors. The language follows the philosophy of C, in that it is a hardware-oriented, general-purpose language, rather […]

Feb, 10

International workshop and tutorial on Computational Intelligence on Consumer Games and Graphics Hardware, CIGPU 2011

The fourth International workshop and tutorial on Computational Intelligence on Consumer Games and Graphics Hardware (CIGPU 2011) will be held as a workshop in the GECCO-2011 conference in Dublin 12-16 July 2011. CIGPU 2011 is the fourth workshop on the use of GPUs, games consoles and other consumer hardware for evolutionary algorithms and other computational […]

Feb, 9

25th International Conference on Supercomputing, ICS’11

ICS (International Conference on Supercomputing) is the premier international forum for the presentation of research results in high-performance computing systems.Papers are solicited on all aspects of research, development, and application of large-scale, high-performance experimental and commercial systems. The list of topics includes (but not limited to): Computationally challenging scientific and commercial applications, particularly studies and […]

Feb, 9

9th International Conference on Parallel Processing and Applied Mathematics, PPAM 2011

The PPAM 2011 conference, ninth in a series, will cover topics in parallel and distributed processing, including theory and applications, as well as applied mathematics. The focus will be on models, algorithms, and software tools which facilitate efficient and convenient utilization of modern parallel and distributed computing architectures, as well as on large-scale applications, and […]

Feb, 9

3D tumor localization through real-time volumetric x-ray imaging for lung cancer radiotherapy

Recently we have developed an algorithm for reconstructing volumetric images and extracting 3D tumor motion information from a single x-ray projection. We have demonstrated its feasibility using a digital respiratory phantom with regular breathing patterns. In this work, we present a detailed description and a comprehensive evaluation of the improved algorithm. The algorithm was improved […]

CUDA

Feb, 9

High-precision molecular dynamics simulation of UO2-PuO2: superionic transition in uranium dioxide

Our series of articles is devoted to high-precision molecular dynamics simulation of mixed actinide-oxide (MOX) fuel in the rigid ions approximation using high-performance graphics processors (GPU). In this article we assess the 10 most relevant interatomic sets of pair potential (SPP) by reproduction of the Bredig superionic phase transition (anion sublattice premelting) in uranium dioxide. […]

CUDA

Feb, 9

High-precision molecular dynamics simulation of UO2-PuO2: pair potentials comparison

Our series of articles is devoted to high-precision molecular dynamics simulation of mixed actinide-oxide (MOX) fuel in the rigid ions approximation using high-performance graphics processors (GPU). In the first article we assess 10 most relevant interatomic sets of pair potentials (SPP) by reproduction of solid phase properties of uranium dioxide (UO2) – temperature dependences of […]

CUDA

high performance computing on graphics processing units: hgpu.org

Posts

GLSV: Graphics library stereo vision for OpenGL

Scaling LAPACK panel operations using parallel cache assignment

Random-access rendering of general vector graphics

Precomputed Atmospheric Scattering

Comparing FPGAs to Graphics Accelerators and the Playstation 2 Using a Unified Source Description

Cg: a system for programming graphics hardware in a C-like language

International workshop and tutorial on Computational Intelligence on Consumer Games and Graphics Hardware, CIGPU 2011

25th International Conference on Supercomputing, ICS’11

9th International Conference on Parallel Processing and Applied Mathematics, PPAM 2011

3D tumor localization through real-time volumetric x-ray imaging for lung cancer radiotherapy

High-precision molecular dynamics simulation of UO2-PuO2: superionic transition in uranium dioxide

High-precision molecular dynamics simulation of UO2-PuO2: pair potentials comparison

Recent source codes

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

LC Framework

pplx-garden: Perplexity open source garden for inference technology

Atlas CLI: Machine Learning (ML) Lifecycle & Transparency Manager

transformers_tvm: Implementation of Encoder Decoder transformer on TVM

OpScanner

INT v.s. FP: A framework to compare low-bit integer and float-point formats

AutoDock-GPU: AutoDock for GPUs and other accelerators

NCCLX: collective communication framework

Tutoring LLM into a Better CUDA Optimizer

Most viewed papers (last 30 days)