Posts
May, 29
Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms
Hardware-aware design and optimization is crucial in exploiting emerging architectures for PDE-based computational fluid dynamics applications. In this work, we study optimizations aimed at acceleration of OpenFOAM-based applications on emerging hybrid heterogeneous platforms. OpenFOAM uses MPI to provide parallel multi-processor functionality, which scales well on homogeneous systems but does not fully utilize the potential per-node […]
May, 29
Lensed: a code for the forward reconstruction of lenses and sources from strong lensing observations
Robust modelling of strong lensing systems is fundamental to exploit the information they contain about the distribution of matter in galaxies and clusters. In this work, we present Lensed, a new code which performs forward parametric modelling of strong lenses. Lensed takes advantage of a massively parallel ray-tracing kernel to perform the necessary calculations on […]
May, 29
Simple sorting algorithm test based on CUDA
With the development of computing technology, CUDA has become a very important tool. In computer programming, sorting algorithm is widely used. There are many simple sorting algorithms such as enumeration sort, bubble sort and merge sort. In this paper, we test some simple sorting algorithm based on CUDA and draw some useful conclusions.
May, 29
International Conference on Biomedical Signal and Image Processing (ICBIP 2015), 2015
Submission Deadline: 2015-06-30 Topics: • Audio and acoustic signal processing • Bio-imaging and biomedical signal processing • Signal processing education systems • Speech processing • Industry technology tracks • Information forensics and security • Machine learning for signal processing • Localisation and tracking • Multimedia signal processing • Sensor array and multichannel signal • Design […]
May, 28
Particle-in-Cell Laser-Plasma Simulation on Xeon Phi Coprocessors
This paper concerns development of a high-performance implementation of the Particle-in-Cell method for plasma simulation on Intel Xeon Phi coprocessors. We discuss suitability of the method for Xeon Phi architecture and present our experience of porting and optimization of the existing parallel Particle-in-Cell code PICADOR. Direct porting with no code modification gives performance on Xeon […]
May, 28
Revisiting Actor Programming in C++
The actor model of computation has gained significant popularity over the last decade. Its high level of abstraction makes it appealing for concurrent applications in parallel and distributed systems. However, designing a real-world actor framework that subsumes full scalability, strong reliability, and high resource efficiency requires many conceptual and algorithmic additives to the original model. […]
May, 28
Towards Modeling Energy Consumption of Xeon Phi
In the push for exascale computing, energy efficiency is of utmost concern. System architectures often adopt accelerators to hasten application execution at the cost of power. The Intel Xeon Phi co-processor is unique accelerator that offers application designers high degrees of parallelism, energy-efficient cores, and various execution modes. To explore the vast number of available […]
May, 28
Implementing a Photorealistic Rendering System using GLSL
Ray tracing on GPUs is becoming quite common these days. There are many publicly available documents on how to implement basic ray tracing on GPUs for spheres and implicit surfaces. We even have some general frameworks for ray tracing on GPUs. We however hardly find details on how to implement more complex ray tracing algorithms […]
May, 28
Analysis of GPU Parallel Computing based on Matlab
Matlab is very widely used in scientific computing, but Matlab computational efficiency is lower than C language program. In order to improve the computing speed, some toolbox can use GPU to accelerate the computation. This paper describes GPU working principle, our experiments and results analysis of parallel computing by using GPU based on Matlab. Experimental […]
May, 28
Citrix Ready Technical Webinar with NVIDIA
Blow your designers minds by combining 3D apps with XenDesktop and NVIDIA GPUs Got key users with 3D apps? Already familiar with XenDesktop? Want to blow your designers minds by combining 3D apps with Desktop Virtualization? Your key users drive your company IP. Desktop Virtualization with NVIDIA graphics will drive your key users’ productivity. Learn […]
May, 25
DjiNN and Tonic: DNN as a Service and Its Implications for Future Warehouse Scale Computers
As applications such as Apple Siri, Google Now, Microsoft Cortana, and Amazon Echo continue to gain traction, webservice companies are adopting large deep neural networks (DNN) for machine learning challenges such as image processing, speech recognition, natural language processing, among others. A number of open questions arise as to the design of a server platform […]
May, 25
Nucleation of nanoparticles in a coarse grained fluid using OpenCL
In this thesis, the nucleation rate of almost hard spheres in a course-grained fluid is measured to study the effects of an explicit solvent on the nucleation rate. Previous measurements show a discrepancy between physical measurements and simulations, where the latter all used implicit solvents. In this thesis, the fluid is approximated using Stochastic Rotation […]