14510

Posts

Sep, 3

International Conference on Mechanical Engineering and Electrical Systems (ICMES 2015), 2015

All accepted and registered papers will be recommended to publish into the journals below: * Journal of Electronic Science and Technology (JEST, ISSN: 1674-862X) Abstracting/Indexing: SCOPUS; EI INSPECT; included in DOAJ, CA, CSCD, CNKI, Sciencepaper Online. * International Journal of Mechanical Engineering and Robotics Research (IJMERR, ISSN: 2278-0149, DOI: 10.18178) Abstracting/ Indexing: Index Corpernicus, ProQuest, […]
Sep, 3

International Conference on Computer Systems and Instrumentation (ICCSI), 2015

Submission Deadline: 2015.10.05 Submission Method: Please log in the Electronic Submission System (.pdf only) to submit your paper: http://www.easychair.org/conferences/?conf=iccsi2015 Registration: http://www.iccsi.org/reg.html (250-350USD) Topic: (http://www.iccsi.org/cfp.html) Systems theory and practice Systems engineering System Instrumentation Embedded systems and systems-on-the-chip Target-oriented systems [emulation, simulation, prediction, etc.] Specialized systems [sensor-based, mobile, multimedia, biometrics, etc.] Validation systems Security and protection systems […]
Sep, 3

Fast GPU-based calculations in few-body quantum scattering

A principally novel approach towards solving the few-particle (many-dimensional) quantum scattering problems is described. The approach is based on a complete discretization of few-particle continuum and usage of massively parallel computations of integral kernels for scattering equations by means of GPU. The discretization for continuous spectrum of a few-particle Hamiltonian is realized with a projection […]
Sep, 3

EMMA: an AMR cosmological simulation code with radiative transfer

EMMA is a cosmological simulation code aimed at investigating the reionization epoch. It handles simultaneously collisionless and gas dynamics, as well as radiative transfer physics using a moment-based description with the M1 approximation. Field quantities are stored and computed on an adaptive 3D mesh and the spatial resolution can be dynamically modified based on physically-motivated […]
Sep, 3

A Real-time Coherent Dedispersion Pipeline for the Giant Metrewave Radio Telescope

A fully real-time coherent dedispersion system has been developed for the pulsar back-end at the Giant Metrewave Radio Telescope (GMRT). The dedispersion pipeline uses the single phased array voltage beam produced by the existing GMRT software back-end (GSB) to produce coherently dedispersed intensity output in real time, for the currently operational bandwidths of 16 MHz […]
Sep, 3

A Comparison of High-Level Design Tools for SoC-FPGA on Disparity Map Calculation Example

Modern SoC-FPGA that consists of FPGA with embedded ARM cores is being popularized as an embedded vision system platform. However, the design approach of SoC-FPGA applications still follows traditional hardware-software separate workflow, which becomes the barrier of rapid product design and iteration on SoC-FPGA. High-Level Synthesis (HLS) and OpenCL-based system-level design approaches provide programmers the […]
Sep, 3

OpenCL 2.0 for FPGAs using OCLAcc

Designing hardware is a time-consuming and complex process. Realization of both, embedded and high-performance applications can benefit from a design process on a higher level of abstraction. This helps to reduce development time and allows to iteratively test and optimize the hardware design during development, as common in software development. We present our tool, OCLAcc, […]
Sep, 3

Exploiting Hyper-Loop Parallelism in Vectorization to Improve Memory Performance on CUDA GPGPU

Memory performance is of great importance to achieve high performance on the Nvidia CUDA GPU. Previous work has proposed specific optimizations such as thread coarsening, caching data in shared memory, and global data layout transformation. We argue that vectorization based on hyper loop parallelism can be used as a unified technique to optimize the memory […]
Aug, 31

An Asynchronous Event Communication Technique for Soft Real-Time GPGPU Applications

CONTEXT. Interactive GPGPU applications requires low response time feedback from events such as user input in order to provide a positive user experience. Communication of these events must be performed asynchronously as to not cause significant performance penalties. OBJECTIVES. In this study the usage of CPU/GPU shared virtual memory to perform asynchronous communication is explored. […]
Aug, 31

A GPU-accelerated local search algorithm for the Correlation Clustering problem

The solution of the Correlation Clustering (CC) problem can be used as a criterion to measure the amount of balance in signed social networks, where positive (friendly) and negative (antagonistic) interactions take place. Metaheuristics have been used successfully for solving not only this problem, as well as other hard combinatorial optimization problems, since they can […]
Aug, 31

Dynamic Memory Allocation for OpenCL

Heterogeneous systems are computer systems that exploit multiple devices with different processor architectures to improve the computing efficiency by offloading workloads to the device that fits them best. OpenCL is a framework for building portable applications that run across different devices in heterogeneous systems. It has gained traction as a powerful tool for high-performance computing. […]
Aug, 31

Partitioning Large Scale Deep Belief Networks Using Dropout

Deep learning methods have shown great promise in many practical applications, ranging from speech recognition, visual object recognition, to text processing. However, most of the current deep learning methods suffer from scalability problems for large-scale applications, forcing researchers or users to focus on small-scale problems with fewer parameters. In this paper, we consider a well-known […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: