11720

Posts

Mar, 18

2014 4th International Conference on Information Communication and Management, ICICM 2014

2014-08-01 All accepted paper will be published in the Lecture Notes on Information Theory (LNIT, ISSN: 2301-3788, www.lnit.org ),which will be indexed by Ulrich’s Periodicals Directory, EBSCO, Engineering & Technology Digital Library, Crossref and Electronic Journals Digital Library. Information Engineering Artificial Intelligence Bioinformatics Software Engineering VLSI Design and Fabrication Photonic Technologies Parallel and Distributed Computing […]
Mar, 18

Optimising OpenCL kernels for the ARM Mali-T600 GPUs

OpenCL is a relatively young industry-backed standard API that aims to provide functional portability across systems equipped with computational accelerators such as GPUs: a standard-conforming OpenCL program can be executed on any standard-conforming OpenCL implementation. OpenCL, however, does not address the issue of performance portability: transforming an OpenCL program to achieve higher performance on one […]
Mar, 18

A Unified Approach for Registration and Depth in Depth from Defocus

Depth from Defocus (DFD) suggests a simple optical set-up to recover the shape of a scene through imaging with shallow depth of field. Although numerous methods have been proposed for DFD, less attention has been paid to the particular problem of alignment between the captured images. The inherent shift-variant defocus often prevents standard registration techniques […]
Mar, 18

Programming Frameworks for Distributed Smartphone Computing

In this thesis we described two frameworks for distributed smartphone computing, one for applications with compute intensive tasks and another one for applications that take contextual sensor information into account. Both frameworks provide a common structure for the development of distributed smartphone applications, thereby extending the possible distribution model options for distributed smartphone applications. Both […]
Mar, 18

Dynamic Load Balancing using Graphics Processors

To get maximum performance on the many-core graphics processors, it is important to have an even balance of the workload so that all processing units contribute equally to the task at hand. This can be hard to achieve when the cost of a task is not known beforehand and when new sub-tasks are created dynamically […]
Mar, 18

Towards a Unified Sentiment Lexicon Based on Graphics Processing Units

This paper presents an approach to create what we have called a Unified Sentiment Lexicon (USL). This approach aims at aligning, unifying, and expanding the set of sentiment lexicons which are available on the web in order to increase their robustness of coverage. One problem related to the task of the automatic unification of different […]
Mar, 17

Prototyping methodology of image processing applications on heterogeneous parallel systems

The work presented in this thesis takes place in a context of growing demand for image and video applications on parallel embedded systems. The limitations and lack of flexibility of current design with parallel embedded systems make increasingly complicated to implement applications, particularly on heterogeneous systems. But Open Computing Language (OpenCL) is a new framework […]
Mar, 17

An optimized algorithm for discrete element system analysis using CUDA

In this paper a parallel computing algorithm for discrete element systems is presented. The discrete model is consisted of finite elements and contacts among the elements. The algorithm is realized using C++ and CUDA and was optimized for NVIDIA GPUs. As a result, the performance of the GPU code is 43 times faster than the […]
Mar, 17

Shape Transformation of Multidimensional Density Functions using Distribution Interpolation of the Radon Transforms

In this paper, we extend 1D distribution interpolation to 2D and 3D by using the Radon transform. Our algorithm is fundamentally different from previous shape transformation techniques, since it considers the objects to be interpolated as density distributions rather than level sets of Implicit Functions (IF). First, we perform distribution interpolation on the precalculated Radon […]
Mar, 17

CPU/GPGPU/HW comparison of an Eigenfaces face recognition system

In this Master Thesis it has been established the specifications for developing a face recognition system in a variety of platforms at the same time: MATLAB running in a personal computer, C code in an embedded microprocessor (MicroBlaze), a simpler reconfigurable hardware for an FPGA-based platform, a flexible hardware for higher performance, and finally a […]
Mar, 17

Minimal models for finite particles in fluctuating hydrodynamics

This thesis is devoted to the development of efficient numerical solvers for fluctuating hydrodynamics, in particular, for flows with immersed particles. In the first part of the thesis we develop numerical solvers able to work in a broad number of flow regimes with a high computational performance. To derive thermodynamically consistent set of equations in […]
Mar, 15

Multi-GPU cluster wave propagation and OpenGL visualization

The inherent issues of properly deploying finite difference calculations onto GPUs are described and solutions are suggested. A speedup of 60x is achieved over the CPU version. Four visualization methods were implemented using OpenGL and compared in terms of the clarity of their visual result. A combination of hedgehogs and slices was deemed to give […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: