Topi Siro, Ari Harju
We implement the Lanczos algorithm on an Intel Xeon Phi coprocessor and compare its performance to a multi-core Intel Xeon CPU and an NVIDIA graphics processor. The Xeon and the Xeon Phi are parallelized with OpenMP and the graphics processor is programmed with CUDA. The performance is evaluated by measuring the execution time of a […]
View View   Download Download (PDF)   
Joshua A. Anderson, M. Eric Irrgang, Sharon C. Glotzer
We design and implement HPMC, a scalable hard particle Monte Carlo simulation toolkit, and release it open source as part of HOOMD-blue. HPMC runs in parallel on many CPUs and many GPUs using domain decomposition. We employ BVH trees instead of cell lists on the CPU for fast performance, especially with large particle size disparity, […]
View View   Download Download (PDF)   
Jens Glaser, Andrew S. Karas, Sharon C. Glotzer
We present an algorithm to simulate the many-body depletion interaction between anisotropic colloids in an implicit way, integrating out the degrees of freedom of the depletants, which we treat as an ideal gas. Because the depletant particles are statistically independent and the depletion interaction is short-ranged, depletants are randomly inserted in parallel into the excluded […]
Ursula Iturraran-Viveros, Miguel Molero-Armenta
Graphics processing units (GPUs) have become increasingly powerful in recent years. Programs exploring the advantages of this architecture could achieve large performance gains and this is the aim of new initiatives in high performance computing. The objective of this work is to develop an efficient tool to model 2D elastic wave propagation on parallel computing […]
T. Takaki, R. Rojas, M Ohno, T. Shimokawabe, T. Aoki
A GPU code has been developed for a phase-field lattice Boltzmann (PFLB) method, which can simulate the dendritic growth with motion of solids in a dilute binary alloy melt. The GPU accelerated PFLB method has been implemented using CUDA C. The equiaxed dendritic growth in a shear flow and settling condition have been simulated by […]
View View   Download Download (PDF)   
Shay I. Heizler, David A. Kessler
We study the high-velocity regime mode-I fracture instability using large scale simulations. At large driving displacements, the pattern of a single, steady-state crack that propagates in the midline of the sample breaks down, and small microbranches start to appear near the main crack. Some of the features of those microbranches have been reproduced qualitatively in […]
View View   Download Download (PDF)   
Benjamin Trefz, Peter Virnau
Large scale molecular dynamics simulations on graphic processing units (GPUs) are employed to study the scaling behavior of ring polymers with various topological constraints in melts. Typical sizes of rings containing $3_1$, $5_1$ knots and catenanes made up of two unknotted rings scale like $N^{1/3}$ in the limit of large ring sizes $N$. This is […]
View View   Download Download (PDF)   
Markus Manssen, Alexander K. Hartmann
We study the the non-equilibrium ageing behaviour of the +/-J Edwards-Anderson model in three dimensions for samples of size up to N=128^3 and for up to 10^8 Monte Carlo sweeps. In particular we are interested in the change of the ageing when crossing from the spin-glass phase to the ferromagnetic phase. The necessary long simulation […]
View View   Download Download (PDF)   
Matteo Lulli, Massimo Bernaschi, Giorgio Parisi
We present a highly optimized implementation of a Monte Carlo (MC) simulator for the three-dimensional Ising spin-glass model with bimodal disorder, i.e., the 3D Edwards-Anderson model running on CUDA enabled GPUs. Multi-GPU systems exchange data by means of the Message Passing Interface (MPI). The chosen MC dynamics is the classic Metropolis one, which is purely […]
View View   Download Download (PDF)   
I. A. Sadovskyy, A. E. Koshelev, C. L. Phillips, D. A. Karpeev, A. Glatz
Understanding the interaction of vortices with inclusions in type-II superconductors is a major outstanding challenge both for fundamental science and energy applications. At application-relevant scales, the long-range interactions between a dense configuration of vortices and the dependence of their behavior on external parameters, such as temperature and an applied magnetic field, are all important to […]
View View   Download Download (PDF)   
Ezequiel E. Ferrero, Alejandro B. Kolton, Matteo Palassini
We develop a parallel rejection algorithm to tackle the problem of low acceptance in Monte Carlo methods, and apply it to the simulation of the hopping conduction in Coulomb glasses using Graphics Processing Units, for which we also parallelize the update of local energies. In two dimensions, our parallel code achieves speedups of up to […]
Miguel Ibanez Berganza, Pietro Coletti, Alberto Petri
Langer theory of metastability provides a description of the lifetime and properties of the metastable phase of the Ising model field-driven transition, describing the magnetic field-driven transition in ferromagnets and the chemical potential-driven transition of fluids. An immediate further step is to apply it to the study of a transition driven by the temperature, as […]
View View   Download Download (PDF)   
Page 1 of 512345

* * *

* * *

Follow us on Twitter

HGPU group

1666 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

339 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: