13192

Posts

Nov, 29

Parallel kNN on GPU Architecture Using OpenCL

In data mining applications, one of the useful algorithms for classification is the kNN algorithm. The kNN search has a wide usage in many research and industrial domains like 3-dimensional object rendering, content-based image retrieval, statistics, biology (gene classification), etc. In spite of some improvements in the last decades, the computation time required by the […]
Nov, 25

4th International Conference on Software and Computer Applications, ICSCA 2015

Submission Deadline: 2015-04-10 Topics: Software Engineering AI and Knowledge based software engineering Artificial Intelligence Aspect-orientation and feature interaction Business Process Reengineering & Science Communication Systems and Networks Component-Based Software Engineering Computer & Software Engineering Computer Animation and Design Contents Computer Game Development, User Modeling and Management Computer supported cooperative work Cost Modeling and Analysis Data […]
Nov, 25

Improving GPU Performance by Regrouping CPU-Memory Data

In order to fast effective analysis of large complex systems, high-performance computing is essential. NVIDIA Compute Unified Device Architecture (CUDA)-assisted central processing unit (CPU) / graphics processing unit (GPU) computing platform has proven its potential to be used in high-performance computing. In CPU/GPU computing, original data and instructions are copied from CPU main memory to […]
Nov, 25

A Self-Optimizing Framework for Developing Metrology Software on Massive Parallel Processor Architectures

Standard PC hardware rapidly increases in parallel computing power in form of multicore CPUs and general purpose GPUs. To take advantage of this situation it is necessary to create specialized code. This is a very time consuming and therefore an expensive task. One approach on solving this problem is the OpenCL (Open Computing Language) standard. […]
Nov, 25

Anisotropic interfacial tension, contact angles, and line tensions: A graphics-processing-unit-based Monte Carlo study of the Ising model

As a generic example for crystals where the crystal-fluid interface tension depends on the orientation of the interface relative to the crystal lattice axes, the nearest neighbor Ising model on the simple cubic lattice is studied over a wide temperature range, both above and below the roughening transition temperature. Using a thin film geometry $L_x […]
Nov, 25

Ageing at the Spin-Glass/Ferromagnet Transition: Monte Carlo Simulation using GPUs

We study the the non-equilibrium ageing behaviour of the +/-J Edwards-Anderson model in three dimensions for samples of size up to N=128^3 and for up to 10^8 Monte Carlo sweeps. In particular we are interested in the change of the ageing when crossing from the spin-glass phase to the ferromagnetic phase. The necessary long simulation […]
Nov, 25

Scalable Verification Techniques for Data-Parallel Programs

This thesis is about scalable formal verification techniques for software. A verification technique is scalable if it is able to scale to reasoning about real (rather than synthetic or toy) programs. Scalable verification techniques are essential for practical program verifiers. In this work, we consider three key characteristics of scalability: precision, performance and automation. We […]
Nov, 21

4th International Conference on Computer Technology and Science, ICCTS 2015

Publication: Selected submission paper will be recommended to publish into one of the journals below: *IJCTE: Abstracting/Indexing: Index Copernicus,Electronic Journals Library, EBSCO, Engineering & Technology Digital Library, Google Scholar, Ulrich’s Periodicals Directory, Crossref, ProQuest, WorldCat, and EI (INSPEC, IET), Cabell’s Directories. *IJCCE: Abstracting/Indexing: EI (INSPEC, IET), Google Scholar, Engineering & Technology Digital Library, ProQuest, and […]
Nov, 20

A GPU-based framework for efficient image processing

This thesis tries to answer how to design a framework for image processing on the GPU, supporting the common environments OpenGL GLSL, OpenCL and CUDA. An generalized view of GPU image processing is presented. The framework is called gpuip and is implemented in C++ but also wrapped with Python-bindings. The framework is cross-platform and works […]
Nov, 20

Using CUDA architecture for computer simulations of thermomechanical phenomena

This paper presents a simulation of the casting solidification process performed on graphics processors compatible with nVidia CUDA architecture. Indispensable for the parallel implementation of a computer simulation of the solidification process, it was necessary to modify the numerical model. The new approach shown in this paper allows the process of matrix building to be […]
Nov, 20

Automatic Performance Tuning of Pipeline Patterns for Heterogeneous Parallel Architectures

Heterogeneous parallel architectures combining conventional multicore CPUs with GPUs and other types of accelerators promise significant performance gains compared to homogeneous systems. However, exploiting the full potential of such systems is becoming more and more challenging often forcing programmers to combine different programming models and parallelization strategies. A promising approach to coping with the increased […]
Nov, 20

CL2QCD – Lattice QCD based on OpenCL

We present the Lattice QCD application CL2QCD, which is based on OpenCL and can be utilized to run on Graphic Processing Units as well as on common CPUs. We focus on implementation details as well as performance results of selected features. CL2QCD has been successfully applied in LQCD studies at finite temperature and density and […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: