3485

ARKCoS: Artifact-Suppressed Accelerated Radial Kernel Convolution on the Sphere

Franz Elsner, Benjamin D. Wandelt
Institut d’Astrophysique de Paris, UMR 7095, CNRS – Universite Pierre et Marie Curie (Univ Paris 06), 98 bis blvd
arXiv:1104.0672 [astro-ph.IM] (4 Apr 2011)

@article{2011arXiv1104.0672E,

   author={Elsner}, F. and {Wandelt}, B.~D.},

   title={"{ARKCoS: Artifact-Suppressed Accelerated Radial Kernel Convolution on the Sphere}"},

   journal={ArXiv e-prints},

   archivePrefix={"arXiv"},

   eprint={1104.0672},

   primaryClass={"astro-ph.IM"},

   keywords={Astrophysics – Instrumentation and Methods for Astrophysics, Astrophysics – Cosmology and Extragalactic Astrophysics},

   year={2011},

   month={apr},

   adsurl={http://adsabs.harvard.edu/abs/2011arXiv1104.0672E},

   adsnote={Provided by the SAO/NASA Astrophysics Data System}

}

Download Download (PDF)   View View   Source Source   

1151

views

We describe a hybrid Fourier/direct space convolution algorithm for compact radial (azimuthally symmetric) kernels on the sphere. For high resolution maps covering a large fraction of the sky, our implementation takes advantage of the inexpensive massive parallelism afforded by consumer graphics processing units (GPUs). Applications involve modeling of instrumental beam shapes in terms of compact kernels, computation of fine-scale wavelet transformations, and optimal filtering for the detection of point sources. Our algorithm works for any pixelization where pixels are grouped into isolatitude rings. Even for kernels that are not bandwidth limited, ringing features are completely absent on an ECP grid. We demonstrate that they can be highly suppressed on the popular HEALPix pixelization, for which we develop a freely available implementation of the algorithm. As an example application, we show that running on a high-end consumer graphics card our method speeds up beam convolution for simulations of a characteristic Planck high frequency instrument channel by two orders of magnitude compared to the commonly used HEALPix implementation on one CPU core while maintaining at typical a fractional RMS accuracy of about 1 part in 10^5.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: