Chih-Sheng Lin, Chih-Wei Hsieh, Hsi-Ya Chang, Pao-Ann Hsiung
Recently, heterogeneous system architectures are becoming mainstream for achieving high performance and power efficiency. In particular, many-core graphics processing units (GPUs) now play an important role for computing in heterogeneous architectures. However, for application designers, computational workload still needs to be distributed to heterogeneous GPUs manually and remains inefficient. In this paper, we propose a […]
View View   Download Download (PDF)   
Chih-Sheng Lin
Recently, a hybrid system consisting of general-purpose processors (CPU) and accelerators such as graphic processing units (GPUs) have become mainstream system architecture design for achieving high performance and power efficiency. However, this growing trend is forcing programmers to address issues and challenges in adapting legacy serial programs into heterogeneous parallel programs. To alleviate the burden […]
View View   Download Download (PDF)   
Yiding Han
This dissertation presents research focusing on reshaping the design paradigm of electronic design automation (EDA) applications to embrace the computational throughput of a massively parallel computing architecture. The EDA industry has gone through major evolution in algorithm designs over the past several decades, delivering improved and more sophisticated design tools. Today, these tools provide a […]
View View   Download Download (PDF)   
Shousheng Liu, Ge Chen, Chunyong Ma, Yong Han
The existing matrix palette algorithms for skeletal animation are accelerated by the technique GPGPU based on GLSL or CUDA. Because GLSL is extended from graphics library OpenGL, it couples the rendering and calculations together closely and forces itself not convenient to reuse, meanwhile CUDA is designed only for NVIDIA GPUs. In this paper GPGPU based […]
View View   Download Download (PDF)   
Yuan Zhao, Xinchang Zhang, Zhen Zhang, Lu Wang, Yueming Hu
Because Cellular Automata (CA) is a dynamic system with inherent parallelism, many studies are focused on mapping CA to the parallel system in order to obtain high performance computing capability, such as using clusters, supercomputers and networks of computers. But the application of these systems are too expensive and difficult to use on the occasions […]
View View   Download Download (PDF)   
Xueqin Zhang, Yifeng Zhang, Chunhua Gu
The network anomaly detection technology based on support vector machine (SVM) can efficiently detect unknown attacks or variants of known attacks, however, it cannot be used for detection of large-scale intrusion scenarios due to the demand of computational time. The graphics processing unit (GPU) has the characteristics of multi-threads and powerful parallel processing capability. Based […]
View View   Download Download (PDF)   
Jie Guo, Jingui Pan
Monte Carlo based photorealistic image synthesis has proven to be one of the most flexible and powerful rendering techniques, but is plagued with undesirable artifacts known as Monte Carlo noise. We present an adaptive filtering method designed for Monte Carlo rendering systems that counteracts noise while respecting sharp features. The filter operates as a post-process […]
View View   Download Download (PDF)   
Lukasz Swierczewski
Modern computers have graphics cards with much higher theoretical efficiency than conventional CPU. The paper presents application possibilities GPU CUDA acceleration for encryption of data using the new architecture tailored to the 3DES algorithm, characterized by increased security compared to the normal DES. The algorithm used in ECB mode (Electronic Codebook), in which 64-bit data […]
View View   Download Download (PDF)   
L.F.C.C. Mallens
In recent years, graphics processing units (GPUs) became more and more popular as high performance processing units. Due to the availability of hundreds of cores, code fragments speed up significantly when they are transformed from CPU functions to GPU kernels. The transformation process is non-trivial and therefore error prone. Developing correct and efficient GPU accelerated […]
View View   Download Download (PDF)   
Sulan Zhang, Qingsheng Zhu, Ji Liu, Lingqiu Zeng
When L-systems are applied to large and detailed 3D objects, the inherent serial geometric interpretation limits the speed of image generation. To accelerate the interpreting procedure, a Graphic Processing Unit (GPU) based method utilizing Compute Unified Device Architecture (CUDA) is proposed in this paper. The focused approach involves two phases: first is a sequential scan […]
View View   Download Download (PDF)   
Heru Suhartanto, Arry Yanuar, Ari Wibisono
One of application that needs high performance computing resources is molecular d ynamic. There is some software available that perform molecular dynamic, one of these is a well known GROMACS. Our previous experiment simulating molecular dynamics of Indonesian grown herbal compounds show sufficient speed up on 32 n odes Cluster computing environment. In order to […]
View View   Download Download (PDF)   
F.Pardo, P.Lopez, D. Cabello
Infrared thermography is an attractive technique for non-destructive evaluation processes and particularly for detecting shallowly buried mines. Its use consists of subjecting the area under inspection to a source of natural or artificial heating/cooling process and studying the soil’s response by means of the analysis of its thermal evolution given by a temporal sequence of […]
View View   Download Download (PDF)   
Page 1 of 3123

* * *

* * *

Follow us on Twitter

HGPU group

1665 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

339 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: