Posts
May, 15
CUDA implementation of the solution of a system of linear equations arising in an hp-Finite Element code
The FEM has proven to be one of the most efficient methods for solving differential equations. Designed to run on different computer architectures, technological improvements have led over the years to the fast solution of larger and larger problems. Among these technological improvements, we emphasize the development of GPU (Graphic Processor Unit). Scientific programming in […]
May, 15
Perturbation Functions in Computer Graphics
The problem of real-time photorealistic imaging is discussed. New techniques for specifying free forms without their approximation by polygons are considered. Free forms based on the perturbation functions have an advantage of spline representation of surfaces, that is, a high degree of smoothness, and an advantage of arbitrary form for a small number of perturbation […]
May, 15
The Lattice Boltzmann Equation Method for Complex Flows
The lattice Boltzmann equation (LBE) method is a promising technique for simulating fluid flows and modeling complex physics. Because the LBE model is based on microscopic models and mesoscopic kinetic equations, it offers many advantages for the study of multi-component or multiphase flows. However, there are still challenges encountered when dealing with thermal effects and […]
May, 15
Approximative inference for multivariate functional data on massively parallel processors
With continually increasing data sizes, the relevance of the big n problem of classical likelihood approaches is greater than ever. This paper considers functional data, and presents operator approximations, where observations are embedded in function space, and likelihood calculations are carried out in the functional domain. The resulting approximated problems are naturally parallel and can […]
May, 15
Fractal Video Compression in OpenCL: An Evaluation of CPUs, GPUs, and FPGAs as Acceleration Platforms
Fractal compression is an efficient technique for image and video encoding that uses the concept of self-referential codes. Although offering compression quality that matches or exceeds traditional techniques with a simpler and faster decoding process, fractal techniques have not gained widespread acceptance due to the computationally intensive nature of its encoding algorithm. In this paper, […]
May, 13
Interaction and Visualization Techniques for Immersive Exploration and Perception of 3D datasets
The objective in this case is not only to be realistic, but also to provide new and intelligible ways of model representation. This raises new issues in data perception. The question of perception of complex data, especially regarding visual feedback, is an open question, and it is the subject of this work. This PhD thesis […]
May, 13
A GPU based real-time video compression method for video conferencing
Recent years have seen a great increase in the everyday use of real-time video communication over the internet through video conferencing applications. Limitations on computational resources and network bandwidth require video encoding algorithms that provide acceptable quality on low bitrates and can support various resolutions inside the same stream. In this work, the authors present […]
May, 13
Pedestrian Detection at Warp Speed: Exceeding 500 Detections per Second
Object detection, and in particular pedestrian detection, is a challenging task, due to the wide variety of appearances. The application domain is extremely broad, ranging from e.g. surveillance to automotive safety systems. Many practical applications however often rely on stringent real-time processing speeds combined with high accuracy needs. These demands are contradictory, and usually a […]
May, 13
Improving Synchronization and Data Access in Parallel Programming Models
Today, parallel architectures are the main vector for exploiting available die area. The shift from architectures tuned for sequential programming models to ones optimized for parallel processing follows from the inability of further enhance sequential performance due to power and memory walls. On the other hand, efficient exploitation of parallel computing units looks a hard […]
May, 13
Programming for scientific computing on peta-scale heterogeneous parallel systems
Peta-scale high-performance computing systems are increasingly built with heterogeneous CPU and GPU nodes to achieve higher power efficiency and computation throughput. While providing unprecedented capabilities to conduct computational experiments of historic significance, these systems are presently difficult to program. The users, who are domain experts rather than computer experts, prefer to use programming models closer […]
May, 11
A Distributed CPU-GPU Framework for Pairwise Alignments on Large-Scale Sequence Datasets
Several problems in computational biology require the all-against-all pairwise comparisons of tens of thousands of individual biological sequences. Each such comparison can be performed with the well-known Needleman-Wunsch alignment algorithm. However, with the rapid growth of biological databases, performing all possible comparisons with this algorithm in serial becomes extremely time-consuming. The massive computational power of […]
May, 11
Exploring Computer Vision and Image Processing Algorithms in Teaching Parallel Programming
Computer Vision (CV) is a rapidly growing field, intent on enabling computers to process, analyze, and understand the information of images to produce structured information and/or make decisions. In recent years, interest in computer vision has grown in part as a result of both cheaper and more capable cameras, but also largely because of affordable […]