Posts
Dec, 27
HUGO: Hierarchical mUlti-reference Genome cOmpression for aligned reads
BACKGROUND AND OBJECTIVE: Short-read sequencing is becoming the standard of practice for the study of structural variants associated with disease. However, with the growth of sequence data largely surpassing reasonable storage capability, the biomedical community is challenged with the management, transfer, archiving, and storage of sequence data. METHODS: We developed Hierarchical mUlti-reference Genome cOmpression (HUGO), […]
Dec, 27
Finite Element Modelling of Prostate Deformation and Needle-Tissue Interactions
During brachytherapy and biopsy, significant prostate motion (including deformation) can occur, causing the target lesion to move during the procedures. One method to improve the accuracy of needle tip placement during these percutaneous procedures is to use a 3D Finite Element (FE) model to estimate the amount of needle deflection. This model is based on […]
Dec, 27
BbmTTP: Beat-based Parallel Simulated Annealing Algorithm on GPGPUs for the Mirrored Traveling Tournament Problem
The problem of scheduling sports leagues has received considerable attention in recent years, especially since mathematically optimized schedules often have a large impact both economically and environmentally. The Mirrored Traveling Tournament Problem (mTTP) is an optimization problem that represents certain types of sports scheduling where the main objective is to minimize the total distance traveled […]
Dec, 27
Multi-GPU Load Balancing for In-Situ Simulation and Visualization
Multiple-GPU systems have become ubiquitously available due to their support of massive parallel computing and more device memory for large scale problems. Such systems are ideal for In-Situ visualization applications, which require significant computational power for concurrent execution of simulation and visualization. While pipelining based parallel computing scheme overlaps the execution of simulation and rendering […]
Dec, 25
BIDMach: Large-scale Learning with Zero Memory Allocation
This paper describes recent work on the BIDMach toolkit for large-scale machine learning. BIDMach has demonstrated single-node performance that exceeds that of published cluster systems for many common machine-learning task. BIDMach makes full use of both CPU and GPU acceleration (through a sister library BIDMat), and requires only modest hardware (commodity GPUs). One of the […]
Dec, 25
Building Multiclass Nonlinear Classifiers with GPUs
The adoption of multiclass classification strategies that train independent binary classifiers becomes challenging when the goal is to retrieve nonlinear models from large datasets and the process requires several passes through the data. In such scenario, the combined use of a search and score algorithm and GPUs allows to obtain binary classifiers in a reduced […]
Dec, 25
Efficiency analysis of a physical problem: Different parallel computational approaches for a dynamical integrator evolution
A great challenge for scientists is to execute their computational applications efficiently. Nowadays, parallel programming has become a fundamental key to achieve this goal. High-performance computing provides a solution to exploit parallel architectures in order to get optimal performance. Both parallel programming model and the system architecture will maximize the benefits if both together are […]
Dec, 25
Adaptive Task Size Control on High Level Programming for GPU/CPU Work Sharing
On the work sharing among GPUs and CPU cores on GPU equipped clusters, it is a critical issue to keep load balance among these heterogeneous computing resources. We have been developing a runtime system for this problem on PGAS language named XcalableMP-dev/StarPU [1]. Through the development, we found the necessity of adaptive load balancing for […]
Dec, 25
A Convex Relaxation Approach to Space Time Multi-view 3D Reconstruction
We propose a convex relaxation approach to space-time 3D reconstruction from multiple videos. Generalizing the works [16], [8] to the 4D setting, we cast the problem of reconstruction over time as a binary labeling problem in a 4D space. We propose a variational formulation which combines a photoconsistency based data term with a spatio-temporal total […]
Dec, 24
Graphics Processing Units in Acceleration of Bandwidth Selection for Kernel Density Estimation
The Probability Density Function (PDF) is a key concept in statistics. Constructing the most adequate PDF from the observed data is still an important and interesting scientific problem, especially for large datasets. PDFs are often estimated using nonparametric data-driven methods. One of the most popular nonparametric method is the Kernel Density Estimator (KDE). However, a […]
Dec, 24
Scene Boundary Detection Technique Based on Bottom-Up Attention System and OpenCL Parallel Implementation
This paper spotlights the maintaining of scene boundary detection system in video and process of porting it to the OpenCL. The scene boundary detection algorithm proposed by authors is based on bottom-up focus attention principle. The system builds Gaussian pyramids from input image, calculates map of saliency from the image and then detects the most […]
Dec, 24
Transparent Checkpoint-Restart for Hardware-Accelerated 3D Graphics
A mechanism for transparent GPU-independent checkpoint-restart of 3D graphics is described. The approach is based on a record-prune-replay paradigm: all OpenGL calls relevant to the graphics driver state are recorded; calls not relevant to the internal driver state as of the last graphics frame prior to checkpoint are discarded; and and the remaining calls are […]

