high performance computing on graphics processing units: hgpu.org

Posts

Jun, 19

Design of a fully programmable shader processor for low power mobile devices

In this paper, we propose a novel architecture of a general graphics shader processor without a dedicated hardware. Recently, mobile devices require the high performance graphics processor as well as the small size and low power. The proposed shader processor is a GP-GPU (General-Purpose computing on Graphics Processing Units) to execute the whole OpenGL ES […]

OpenGL

Jun, 19

CST: Constructive Solid Trimming for Rendering BReps and CSG

To eliminate the need to evaluate the intersection curves in explicit representations of surface cutouts or of trimmed faces in BReps of CSG solids, we advocate using constructive solid trimming (CST). A CST face is the intersection of a surface with a Blist representation of a trimming CSG volume. We propose a new GPU-based CSG […]

OpenGL

Jun, 18

ClearView: An Interactive Context Preserving Hotspot Visualization Technique

Volume rendered imagery often includes a barrage of 3D information like shape, appearance and topology of complex structures, and it thus quickly overwhelms the user. In particular, when focusing on a specific region a user cannot observe the relationship between various structures unless he has a mental picture of the entire data. In this paper […]

Jun, 18

A Novel Computing-Enhanced Cloud Storage Model Supporting Combined Service Aware

Cloud storage has been increasing in popularity recently due to its ability to deliver virtualized storage on demand over a network. As the amount of digital resources continues to grow at an astounding rate, more and more intelligent devices (such as GPU) are embedded as computing units to enhance the performance of storage system. How […]

Jun, 18

Legolizer: A Real-Time System for Modeling and Rendering LEGO Representations of Boundary Models

In this work we propose a method for converting triangular meshes into LEGO bricks through a voxel representation of boundary meshes. We present a novel voxelization approach that uses points sampled from a surface model to define which cubes (voxels) and their associated colors will compose the model. All steps of the algorithm were implemented […]

OpenGL

Jun, 18

Binary Segmentation of Video Sequences in Real Time

We present a method for foreground-background video segmentation in real-time that may be used in applications as, for instance, Background Substitution, Analysis of Surveillance Cameras, Highway Cars Detection and so on. Our approach implements a probabilistic segmentation based on the binary Quadratic Markov Measure Fields models (QMMFs). That framework regularizes the likelihood of each pixel […]

CUDA

Jun, 18

Processing of synthetic Aperture Radar data with GPGPU

Synthetic aperture radar processing is a complex task that involves advanced signal processing techniques and intense computational effort. While the first issue has now reached a mature stage, the question of how to produce accurately focused images in real-time, without mainframe facilities, is still under debate. The recent introduction of general-purpose graphic processing units seems […]

CUDA

Jun, 18

Direct Volume Editing

In this work we present basic methodology for interactive volume editing on GPUs, and we demonstrate the use of these methods to achieve a number of different effects. We present fast techniques to modify the appearance and structure of volumetric scalar fields given on Cartesian grids. Similar to 2D circular brushes as used in surface […]

Jun, 18

Coming Soon: Research in a Cloud

Graphic processing units (GPU) provide an impressive amount of computer power at an unprecedented low cost. The authors describe the main features of HOOMD (highly optimized object oriented molecular dynamics), a software package that makes molecular dynamics simulations on GPUs available to general users. Also, a news sidebar by Joshua A. Anderson and Alex Travesset […]

CUDA

Jun, 18

CUDA-Based Jacobi’s Iterative Method

Solving linear equations is a common problem in the fields of science and engineering. Accelerating its solving process is of great significance. Modern GPUs are high performance many-core processors fit for large scale parallel computing. They provide us a novel way for accelerating the solving process. A GPU based parallel Jacobi’s iterative solver for dense […]

CUDA

Jun, 18

Environment Lighting for Point Sampled Geometry

Point sampled geometry has recently gained significant interest due to the tremendous advances in the technology of 3D scanning and the representational simplicity afforded by avoiding any need for explicit connectivity information. Their use in creating highquality rendered images is however still limited. Till date, most renderings of point sampled surfaces use the Phong illumination […]

Jun, 18

Interactive Visualization of Molecular Surface Dynamics

Molecular dynamics simulations of proteins play a growing role in various fields such as pharmaceutical, biochemical and medical research. Accordingly, the need for high quality visualization of these protein systems raises. Highly interactive visualization techniques are especially needed for the analysis of time-dependent molecular simulations. Beside various other molecular representations the surface representations are of […]

OpenGL

high performance computing on graphics processing units: hgpu.org

Posts

Design of a fully programmable shader processor for low power mobile devices

CST: Constructive Solid Trimming for Rendering BReps and CSG

ClearView: An Interactive Context Preserving Hotspot Visualization Technique

A Novel Computing-Enhanced Cloud Storage Model Supporting Combined Service Aware

Legolizer: A Real-Time System for Modeling and Rendering LEGO Representations of Boundary Models

Binary Segmentation of Video Sequences in Real Time

Processing of synthetic Aperture Radar data with GPGPU

Direct Volume Editing

Coming Soon: Research in a Cloud

CUDA-Based Jacobi’s Iterative Method

Environment Lighting for Point Sampled Geometry

Interactive Visualization of Molecular Surface Dynamics

Recent source codes

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

LC Framework

pplx-garden: Perplexity open source garden for inference technology

Atlas CLI: Machine Learning (ML) Lifecycle & Transparency Manager

transformers_tvm: Implementation of Encoder Decoder transformer on TVM

OpScanner

INT v.s. FP: A framework to compare low-bit integer and float-point formats

AutoDock-GPU: AutoDock for GPUs and other accelerators

NCCLX: collective communication framework

Tutoring LLM into a Better CUDA Optimizer

Most viewed papers (last 30 days)