high performance computing on graphics processing units: hgpu.org

hgpu.org » Sparse direct solvers

Intel Xeon Phi acceleration of Hybrid Total FETI solver

Michal Merta, Lubomir Riha, Ondrej Meca, Alexandros Markopoulos, Tomas Brzobohaty, Tomas Kozubek, Vit Vondrak

View

Download (PDF)

Source codes

Tags: Algorithms, Computer science, Intel Xeon Phi, Laplace and Poisson equation, OpenMP, Package, Sparse direct solvers, Sparse matrix

May 24, 2017 by hgpu

Achieving high-performance with a sparse direct solver on Intel KNL

Emmanuel Agullo, Alfredo Buttari, Mikko Byckling, Abdou Guermouche, Ian Masliah

View

Download (PDF)

Tags: Computer science, Energy-efficient computing, Intel Xeon Phi, Sparse direct solvers

March 9, 2017 by hgpu

Reordering strategy for blocking optimization in sparse linear solvers

Gregoire Pichon, Mathieu Faverge, Pierre Ramet, Jean Roman

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Factorization, Linear Algebra, nVidia, Sparse direct solvers, Tesla M2070

October 15, 2016 by hgpu

A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves

Weifeng Liu, Ang Li, Jonathan Hogg, Iain S. Duff, Brian Vinter

View

Download (PDF)

Source codes

Tags: Algorithms, AMD Radeon R9 Fury X, ATI, Computer science, CUDA, Linear Algebra, nVidia GeForce GTX Titan X, OpenCL, Package, Sparse direct solvers, Tesla K40

June 28, 2016 by hgpu

Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data Layouts

Joshua Dennis Booth, Sivasankaran Rajamanickam, Heidi K. Thornquist

View

Download (PDF)

Tags: Algorithms, Computer science, Factorization, Intel Xeon Phi, Sparse direct solvers, Sparse matrix

January 22, 2016 by hgpu

Analysis of A Splitting Approach for the Parallel Solution of Linear Systems on GPU Cards

Ang Li, Radu Serban, Dan Negrut

View

Download (PDF)

Source codes

Tags: Computer science, CUDA, Linear Algebra, nVidia, Package, Sparse direct solvers, Tesla K20

September 30, 2015 by hgpu

Composability of parallel codes on heterogeneous architectures

Andra-Ecaterina Hugo

View

Download (PDF)

Tags: Computer science, CUDA, Heterogeneous systems, nVidia, Sparse direct solvers, Tesla M2070, Thesis

June 26, 2015 by hgpu

Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes

Xavier Lacoste, Mathieu Faverge, Pierre Ramet, Samuel Thibault, George Bosilca

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Factorization, Heterogeneous systems, nVidia, Sparse direct solvers, Tesla M2070

January 30, 2014 by hgpu

Scheduling a Parallel Sparse Direct Solver to Multiple GPUs

Kyungjoo Kim

View

Download (PDF)

Source codes

Tags: Computer science, CUDA, Factorization, FEM, Finite element method, Heterogeneous systems, nVidia, Package, Sparse direct solvers, Task scheduling, Tesla M2070

February 21, 2013 by hgpu

A GPU-Based Transient Stability Simulation Using Runge-Kutta Integration Algorithm

Zhijun Qin, Yunhe Hou

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Differential equations, nVidia, nVidia GeForce GTX 580, Sparse direct solvers

November 7, 2012 by hgpu

Sparse direct solvers with accelerators over DAG runtimes

Xavier Lacoste, Pierre Ramet, Mathieu Faverge, Yamazaki Ichitaro, Jack Dongarra

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Factorization, Linear Algebra, nVidia, Sparse direct solvers, Tesla T20

May 24, 2012 by hgpu

Accelerating the ANSYS Direct Sparse Solver with GPUs

Geraud P. Krawezik, Gene Poole

View

Download (PDF)

Tags: Computer science, CUDA, Linear Algebra, Mixed precision, nVidia, Sparse direct solvers, Tesla C1060

February 22, 2011 by hgpu

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

No More Shading Languages: Compiling C++ to Vulkan Shaders

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

* * *

high performance computing on graphics processing units: hgpu.org

Intel Xeon Phi acceleration of Hybrid Total FETI solver

Achieving high-performance with a sparse direct solver on Intel KNL

Reordering strategy for blocking optimization in sparse linear solvers

A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves

Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data Layouts

Analysis of A Splitting Approach for the Parallel Solution of Linear Systems on GPU Cards

Composability of parallel codes on heterogeneous architectures

Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes

Scheduling a Parallel Sparse Direct Solver to Multiple GPUs

A GPU-Based Transient Stability Simulation Using Runge-Kutta Integration Algorithm

Sparse direct solvers with accelerators over DAG runtimes

Accelerating the ANSYS Direct Sparse Solver with GPUs

Recent source codes

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Most viewed papers (last 30 days)