high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » A complete modular resultant algorithm targeted for realization on graphics hardware

A complete modular resultant algorithm targeted for realization on graphics hardware

Pavel Emeliyanenko

Max-Planck Institute for Informatics, Saarbrucken, Germany

Proceedings of the 4th International Workshop on Parallel and Symbolic Computation PASCO ’10

DOI:10.1145/1837210.1837219

@conference{emeliyanenko2010complete,

title={A complete modular resultant algorithm targeted for realization on graphics hardware},

author={Emeliyanenko, P.},

booktitle={Proceedings of the 4th International Workshop on Parallel and Symbolic Computation},

pages={35–43},

year={2010},

organization={ACM}

}

Download (PDF)

View

Source

2200

views

This paper presents a complete modular approach to computing bivariate polynomial resultants on Graphics Processing Units (GPU). Given two polynomials, the algorithm first maps them to a prime field for sufficiently many primes, and then processes each modular image individually. We evaluate each polynomial at several points and compute a set of univariate resultants for each prime in parallel on the GPU. The remaining “combine” stage of the algorithm comprising polynomial interpolation and Chinese remaindering is also executed on the graphics processor. The GPU algorithm returns coefficients of the resultant as a set of Mixed Radix (MR) digits. Finally, the large integer coefficients are recovered from the MR representation on the host machine. With the approach of displacement structure [16] and efficient modular arithmetic [8] we have been able to achieve more than 100x speed-up over a CPU-based resultant algorithm from Maple 13.

Tags: Algorithms, Computer science, CUDA, Linear Algebra, nVidia, nVidia GeForce GTX 280

January 5, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

A complete modular resultant algorithm targeted for realization on graphics hardware

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

A complete modular resultant algorithm targeted for realization on graphics hardware

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)