Sep, 23

Advanced Optimizations of An Implicit Navier-Stokes Solver on GPGPU

General-purpose computing on graphics processing units (GPGPU) is a massive fine-grain parallel computation platform, which is is particularly attractive for CFD tasks due to its potential of one or two magnitudes of performance improvement with relatively low capital investment. Many successful attempts have been reported in recent years (see, for example [1, 2, 3, 4, […]
Sep, 23

Explicit Integration with GPU Acceleration for Large Kinetic Networks

We demonstrate the first implementation of recently-developed fast explicit kinetic integration algorithms on modern graphics processing unit (GPU) accelerators. Taking as a generic test case a Type Ia supernova explosion with an extremely stiff thermonuclear network having 150 isotopic species and 1604 reactions coupled to hydrodynamics using operator splitting, we demonstrate the capability to solve […]
Sep, 23

Computational Gravitational Dynamics with Modern Numerical Accelerators

We review the recent optimizations of gravitational N-body kernels for running them on graphics processing units (GPUs), on single hosts and massive parallel platforms. For each of the two main N-body techniques, direct summation and tree-codes, we discuss the optimization strategy, which is different for each algorithm. Because both the accuracy as well as the […]
Sep, 23

An optimized GPU implementation of a 2D free surface simulation model on unstructured meshes

This work is related with the implementation of a finite volume method to solve the 2D Shallow Water Equations on Graphic Processing Units (GPU). The strategy is fully oriented to work efficiently with unstructured meshes which are widely used in many fields of Engineering. Due to the design of the GPU cards, structured meshes are […]
Sep, 20

High performance histogramming on massively parallel processors

Histogramming is a technique by which input datasets are mined to extract features and patterns. Histograms have wide range of uses in computer vision, machine learning, database processing, quality control for manufacturing, and many applications benefit from advance knowledge about the distribution of data. Computing a histogram is, essentially, the antithesis of parallel processing. Without […]
Sep, 20

Parallel Hierarchical Clustering on the GPU

Clustering is a basic task in exploratory data analysis. It is used to partition elements of a set into disjoint groups, so-called clusters, such that elements within a group are similar to each other, but dissimilar to elements of other groups. Several clustering algorithms exist, which can be applied depending on the type of dataset […]
Sep, 20

Interactive Ray Tracing with Data Locality Optimizations

Ray tracing denotes a class of rendering algorithms that are well-known for their flexibility and their capability of generating highly realistic images of three dimensional models. However, due to the heavy computational requirements, it has traditionally been used for offline rendering. Improving the performance of ray tracing has been an active area of research and […]
Sep, 20

Parallel Primitive Optimization for GPU and Multicore

This thesis focuses on the use of automatic code generation to combine different classes of optimizations to find the best optimization for parallel reduction in OpenCL on various devices. It also introduces the optimizations used. In the end the results of the combinations will be evaluated and discussed.
Sep, 20

On the evaluation of matrix polynomials using several GPGPUs

Computing a matrix polynomial is the basic process in the calculation of functions of matrices by the Taylor method. One of the most efficient techniques for computing matrix polynomials is based on the Paterson-Stockmeyer method. Inspired by this method, we propose in this work a recursive algorithm and an efficient implementation that exploit the heterogeneous […]
Sep, 19

International Conference on Virtual Reality, ICVR 2015

Submission Deadline: 2015-02-05 Publication: Accepted papers will be published in International Journal of Computer Theory and Engineering (IJCTE,ISSN: 1793-8201,DOI: 10.7763/IJCTE ) Abstracting/Indexing: Electronic Journals Library, EBSCO, Engineering & Technology Digital Library, Google Scholar, INSPEC, Ulrich’s Periodicals Directory, Crossref, ProQuest, WorldCat, and EI (INSPEC, IET). Topics: Immersive gaming 3D interaction for VR/AR/MR Input devices for VR/AR/MR […]
Sep, 19

7th International Conference on Digital Image Processing, ICDIP 2015

Submission Deadline: 2015-02-10 Publication: All accepted papers for the ICDIP 2015 will be published in the conference proceeding by SPIE, which will be included in SPIE Digital Library and indexed by Ei Compendex and ISI proceeding. Topics: Image acquisition Image processing Medical image processing Pattern recognition and analysis Visualization Image coding and compression Super-resolution imagingImage […]
Sep, 19

GPU Technology Conference 2015, GTC 2015

Call for Submissions for NVIDIA’s GPU Technology Conference (GTC), scheduled for March 17-20, 2015 at the San Jose Convention Center in California, is now open and we want to hear from you! At the epicenter of visual computing, GTC is the world’s largest and most important GPU developer conference. If you’re doing creative and groundbreaking […]
Page 10 of 763« First...89101112...203040...Last »

* * *

* * *

Like us on Facebook

HGPU group

171 people like HGPU on Facebook

Follow us on Twitter

HGPU group

1282 peoples are following HGPU @twitter

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: AMD APP SDK 2.9
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 6.0.1, AMD APP SDK 2.9

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2014 hgpu.org

All rights belong to the respective authors

Contact us: