Posts
Sep, 5
On GPU-Accelerated Fast Direct Solvers and Their Applications in Image Denoising
This dissertation focuses on block cyclic reduction (BCR) type fast direct solvers, graphics processing unit (GPU) computation, and image denoising. The fast direct solvers are specialized methods for solving certain types of linear systems. They take into account specific characteristics of the system and are therefore able to solve the system much more efficiently than […]
Sep, 3
Advanced Simulation Library: Expanding software ecosystem for the DSP/FPGA/GPU market
Advanced Simulation Library is a free and open source multiphysics simulation software package and a tool for solving Partial Differential Equations. It has significant user base across many areas of engineering and science, from both industrial and academic organizations. ASL utilizes only the methods that allow efficient parallelization: Lattice Boltzmann Methods, Explicit Finite Difference, Matrix […]
Sep, 3
International Conference on Mechanical Engineering and Electrical Systems (ICMES 2015), 2015
All accepted and registered papers will be recommended to publish into the journals below: * Journal of Electronic Science and Technology (JEST, ISSN: 1674-862X) Abstracting/Indexing: SCOPUS; EI INSPECT; included in DOAJ, CA, CSCD, CNKI, Sciencepaper Online. * International Journal of Mechanical Engineering and Robotics Research (IJMERR, ISSN: 2278-0149, DOI: 10.18178) Abstracting/ Indexing: Index Corpernicus, ProQuest, […]
Sep, 3
International Conference on Computer Systems and Instrumentation (ICCSI), 2015
Submission Deadline: 2015.10.05 Submission Method: Please log in the Electronic Submission System (.pdf only) to submit your paper: http://www.easychair.org/conferences/?conf=iccsi2015 Registration: http://www.iccsi.org/reg.html (250-350USD) Topic: (http://www.iccsi.org/cfp.html) Systems theory and practice Systems engineering System Instrumentation Embedded systems and systems-on-the-chip Target-oriented systems [emulation, simulation, prediction, etc.] Specialized systems [sensor-based, mobile, multimedia, biometrics, etc.] Validation systems Security and protection systems […]
Sep, 3
OpenCL 2.0 for FPGAs using OCLAcc
Designing hardware is a time-consuming and complex process. Realization of both, embedded and high-performance applications can benefit from a design process on a higher level of abstraction. This helps to reduce development time and allows to iteratively test and optimize the hardware design during development, as common in software development. We present our tool, OCLAcc, […]
Sep, 3
Exploiting Hyper-Loop Parallelism in Vectorization to Improve Memory Performance on CUDA GPGPU
Memory performance is of great importance to achieve high performance on the Nvidia CUDA GPU. Previous work has proposed specific optimizations such as thread coarsening, caching data in shared memory, and global data layout transformation. We argue that vectorization based on hyper loop parallelism can be used as a unified technique to optimize the memory […]
Sep, 3
Fast GPU-based calculations in few-body quantum scattering
A principally novel approach towards solving the few-particle (many-dimensional) quantum scattering problems is described. The approach is based on a complete discretization of few-particle continuum and usage of massively parallel computations of integral kernels for scattering equations by means of GPU. The discretization for continuous spectrum of a few-particle Hamiltonian is realized with a projection […]
Sep, 3
EMMA: an AMR cosmological simulation code with radiative transfer
EMMA is a cosmological simulation code aimed at investigating the reionization epoch. It handles simultaneously collisionless and gas dynamics, as well as radiative transfer physics using a moment-based description with the M1 approximation. Field quantities are stored and computed on an adaptive 3D mesh and the spatial resolution can be dynamically modified based on physically-motivated […]
Sep, 3
A Real-time Coherent Dedispersion Pipeline for the Giant Metrewave Radio Telescope
A fully real-time coherent dedispersion system has been developed for the pulsar back-end at the Giant Metrewave Radio Telescope (GMRT). The dedispersion pipeline uses the single phased array voltage beam produced by the existing GMRT software back-end (GSB) to produce coherently dedispersed intensity output in real time, for the currently operational bandwidths of 16 MHz […]
Sep, 3
A Comparison of High-Level Design Tools for SoC-FPGA on Disparity Map Calculation Example
Modern SoC-FPGA that consists of FPGA with embedded ARM cores is being popularized as an embedded vision system platform. However, the design approach of SoC-FPGA applications still follows traditional hardware-software separate workflow, which becomes the barrier of rapid product design and iteration on SoC-FPGA. High-Level Synthesis (HLS) and OpenCL-based system-level design approaches provide programmers the […]
Aug, 31
A parallel algorithm for implicit depletant simulations
We present an algorithm to simulate the many-body depletion interaction between anisotropic colloids in an implicit way, integrating out the degrees of freedom of the depletants, which we treat as an ideal gas. Because the depletant particles are statistically independent and the depletion interaction is short-ranged, depletants are randomly inserted in parallel into the excluded […]
Aug, 31
An Asynchronous Event Communication Technique for Soft Real-Time GPGPU Applications
CONTEXT. Interactive GPGPU applications requires low response time feedback from events such as user input in order to provide a positive user experience. Communication of these events must be performed asynchronously as to not cause significant performance penalties. OBJECTIVES. In this study the usage of CPU/GPU shared virtual memory to perform asynchronous communication is explored. […]