Nov, 5

Graphics Processing Unit-Based Computer-Aided Design Algorithms for Electronic Design Automation

This dissertation presents research focusing on reshaping the design paradigm of electronic design automation (EDA) applications to embrace the computational throughput of a massively parallel computing architecture. The EDA industry has gone through major evolution in algorithm designs over the past several decades, delivering improved and more sophisticated design tools. Today, these tools provide a […]
Nov, 5

GPU Acceleration of k-Nearest Neighbor Search in Face Classifier based on Eigenfaces

Face recognition is a specialized case of object recognition, and has broad applications in security, surveillance, identity management, law enforcement, human-computer interaction, and automatic photo and video indexing. Because human faces occupy a narrow portion of the total image space, specialized methods are required to identify faces based on subtle differences. One such method is […]
Nov, 5

Parallelization techniques of the x264 video encoder

Higher video quality is demanded by the users of any kind of video stream service, including web applications, High Definition broadcast terrestrial services, etc. All of those video streams are encoded first using a compression format, one of them is H.264/MPEG-4 AVC. The main issue is that the better the quality of the video the […]
Nov, 5

Highly optimized simulations on single- and multi-GPU systems of 3D Ising spin glass

We present a highly optimized implementation of a Monte Carlo (MC) simulator for the three-dimensional Ising spin-glass model with bimodal disorder, i.e., the 3D Edwards-Anderson model running on CUDA enabled GPUs. Multi-GPU systems exchange data by means of the Message Passing Interface (MPI). The chosen MC dynamics is the classic Metropolis one, which is purely […]
Nov, 5

A GPU-Based Wide-Band Radio Spectrometer

The Graphics Processing Unit (GPU) has become an integral part of astronomical instrumentation, enabling high-performance online data reduction and accelerated online signal processing. In this paper, we describe a wide-band reconfigurable spectrometer built using an off-the-shelf GPU card. This spectrometer, when configured as a polyphase filter bank (PFB), supports a dual-polarization bandwidth of up to […]
Nov, 3

Profiling of Data-Parallel Processors

Profiling data can help to improve an application with respect to various objectives like execution time, energy consumption or even thermal sensor placement for an upcoming device. This survey reviews state-of-the-art profiling tools for dataparallel processors like Nsight, PAPI and TAU as well as Lynx. Additionally, the attained knowledge is utilized to detect the bottleneck […]
Nov, 3

A Fast Poisson Solver with Periodic Boundary Conditions for GPU Clusters in Various Configurations

Fast Poisson solvers using the Fast Fourier Transform on uniform grids are especially suited for parallel implementation, making them appropriate for portability on graphical processing unit (GPU) devices. The goal of the following work was to implement, test, and evaluate a fast Poisson solver for periodic boundary conditions for use on a variety of GPU […]
Nov, 3

Bounds on the Energy Consumption of Computational Kernels

As computing devices evolve with successive technology generations, many machines target either the mobile or high-performance computing/datacenter environments. In both of these form factors, energy consumption often represents the limiting factor on hardware and software efficiency. On mobile devices, limitations in battery technology may reduce possible hardware capability due to a tight energy budget. On […]
Nov, 3

A GPU-based Framework for Real-time Free Viewpoint Television

Thesis addresses two main problems of Free Viewpoint TV: generation of arbitrary viewpoint in real-time and its delivery to end-user. For the first problem a GPU-based algorithm capable of generating free viewpoints from a network of fixed HD video cameras was developed. We used a space-sweep algorithm to estimate depth information. The view generation sub-system […]
Nov, 3

Performance Optimization Using Partitioned SpMV on GPUs and Multicore CPUs

This paper presents a sparse matrix partitioning strategy to improve the performance of SpMV on GPUs and multicore CPUs. This method has wide adaptability for different types of sparse matrices, and is different from existing methods which only adapt to some particular sparse matrices. In addition, our partitioning method can obtain dense blocks by analyzing […]
Nov, 3

International Workshop on Pattern Recognition, ICOPR 2015

Submission Deadline: 2015-03-01 Publication: All papers for the ICOPR 2015 will be published in the IJSPS (ISSN: 2315-4535) as one volume, and will be indexed by Ulrich’s Periodicals Directory, Google Scholar, EBSCO, Engineering & Technology Digital Library, etc. Topics: Track 1: Computer Vision Vision sensors Early vision Low-level vision Biologically motivated vision Illumination and reflectance […]
Nov, 3

4th International Conference on Software and Information Engineering, ICSIE 2015

Submission Deadline: 2015-03-01 Publication: Selected submission paper will be recommended to publish into one of the journals below: * Journal of Lecture Notes on Software Engineering (LNSE, ISSN: 2301-3559) Abstracting/ Indexing: EI (INSPEC, IET), DOAJ, Electronic Journals Library, Engineering & Technology Digital Library, Ulrich’s Periodicals Directory, International Computer Science Digital Library (ICSDL), ProQuest and Google […]
Page 4 of 768« First...23456...102030...Last »

* * *

* * *

Like us on Facebook

HGPU group

184 people like HGPU on Facebook

Follow us on Twitter

HGPU group

1311 peoples are following HGPU @twitter

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: AMD APP SDK 2.9
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 6.0.1, AMD APP SDK 2.9

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2014 hgpu.org

All rights belong to the respective authors

Contact us: