6626

Posts

Dec, 11

Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive Robotics Research

This paper presents a novel open-source software Aquila developed as a part of the iTalk and RobotDoC projects. This software provides many different tools and biologically inspired systems that are useful for cognitive robotics research. Aquila addresses the need for high-performance robot control by adopting the latest parallel processing paradigm based on the NVidia CUDA […]
Dec, 11

Gyrokinetic Toroidal Simulations on Leading Multi-and Manycore HPC Systems

The gyrokinetic Particle-in-Cell (PIC) method is a critical computational tool enabling petascale fusion simulation research. In this work, we present novel multi- and manycore-centric optimizations to enhance performance of GTC, a PIC-based production code for studying plasma microturbulence in tokamak devices. Our optimizations encompass all six GTC sub-routines and include multi-level particle and grid decompositions […]
Dec, 11

Accelerating Swarm Intelligence Algorithms with GPU-Computing

Swarm intelligence describes the ability of groups of social animals and insects to exhibit highly organized and complex problem-solving behaviors that allow the group as a whole to accomplish tasks which are beyond the capabilities of any individual. This phenomenon found in nature is the inspiration for swarm intelligence algorithms — systems that utilize the […]
Dec, 11

Fast Face Detection Using Graphics Processor

Fast face detection is one of the key components of various computer vision applications. Viola-Jones algorithm provides a good and fast detection for low and medium resolution images. This paper proposes a new and fast approach to perform real time face detection. The proposed method includes the enhanced Haar-like features and uses SVM for training […]
Dec, 11

A Dynamic Approach to Weighted Suffix Tree Construction Algorithm

In present time weighted suffix tree is consider as a one of the most important existing data structure used for analyzing molecular weighted sequence. Although a static partitioning based parallel algorithm existed for the construction of weighted suffix tree, but for very long weighted DNA sequences it takes significant amount of time. However, in our […]
Dec, 11

Generalizing Execution of Vectorizable Computations by Generating Vector Oriented Byte Code

Computer simulations, which are widely used in both academia and in the industry, often work on very large data sets. This makes them well suited for harvesting the computing power of modern, highly parallel computing systems, such as GPU’s, clusters and vector processors. The challenge lies in the fact, that these systems must be programmed […]
Dec, 11

Data analysis and 3D evolution in High Energy Physics using graphic processor

One of the main challenges in High Energy Physics (HEP) is to make fast analysis of high amount of experimental and simulated data. For example, the amount of data generated at Large Hadron Collider (LHC) is estimated to reach 1 PetaByte/year. The time taken to analyze the data and to obtain fast results depends on […]
Dec, 11

ALICE HLT High Speed Tracking on GPU

The on-line event reconstruction in ALICE is performed by the High Level Trigger, which should process up to 2000 events per second in proton-proton collisions and up to 300 central events per second in heavy-ion collisions, corresponding to an input data stream of 30 GB/s. In order to fulfill the time requirements, a fast on-line […]
Dec, 11

Evaluating graph coloring on GPUs

This paper evaluates features of graph coloring algorithms implemented on graphics processing units (GPUs), comparing coloring heuristics and thread decompositions. As compared to prior work on graph coloring for other parallel architectures, we find that the large number of cores and relatively high global memory bandwidth of a GPU lead to different strategies for the […]
Dec, 10

Achieving High Throughput Sequencing with Graphics Processing Units

High throughput sequencing has become a powerful technique for genome analysis after this concept was raised in recent years. Currently, there is a huge demand from patients that have genetic diseases which cannot be satisfied due to the limitation of computation power. Though several softwares are developed using currently most efficient algorithm to deal with […]
Dec, 10

Particle Simulation on a GPU with PyCUDA

This report is on a small test problem within the context of a larger long-term research project. GPUs are increasingly popular for particle methods, due to the readily apparent parallelism inherent to N-Body problems. Particle-In-Cell is a popular scheme for exploring systems in plasma physics. We hope to explore a small sample problem in order […]
Dec, 10

Real-time intraoperative full-range complex FD-OCT guided cerebral blood vessel identification and brain tumor resection in neurosurgery

This work utilized an ultra-high-speed full-range complex-conjugate-free optical coherence tomography (FD-OCT) system to perform real-time intraoperative imaging to guide two common neurosurgical procedures: the cerebral blood vessel identification and the brain tumor resection. The cerebral blood vessel identification experiment is conducted ex vivo on human cadaver specimen. Specific cerebral arteries and veins in different positions […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: