K. A. Hawick
Hypercubic storage layout and transforms in arbitrary dimensions using GPUs and CUDA
Hawick, K. A.; Playne, D. P.
Authors
D. P. Playne
Abstract
Many simulations in the physical sciences are expressed in terms of rectilinear arrays of variables. It is attractive to develop such simulations for use in 1-, 2-, 3- or arbitrary physical dimensions and also in a manner that supports exploitation of data-parallelism on fast modern processing devices. We report on data layouts and transformation algorithms that support both conventional and data-parallel memory layouts. We present our implementations expressed in both conventional serial C code as well as in NVIDIA's Compute Unified Device Architecture concurrent programming language for use on general purpose graphical processing units. We discuss: general memory layouts; specific optimizations possible for dimensions that are powers-of-two and common transformations, such as inverting, shifting and crinkling. We present performance data for some illustrative scientific applications of these layouts and transforms using several current GPU devices and discuss the code and speed scalability of this approach.
Citation
Hawick, K. A., & Playne, D. P. (2011). Hypercubic storage layout and transforms in arbitrary dimensions using GPUs and CUDA. Concurrency and Computation: Practice and Experience, 23(10), 1027-1050. https://doi.org/10.1002/cpe.1628
Journal Article Type | Article |
---|---|
Online Publication Date | Aug 28, 2010 |
Publication Date | 2011-07 |
Deposit Date | Nov 13, 2014 |
Journal | Concurrency And Computation-Practice & Experience |
Print ISSN | 1532-0626 |
Publisher | Wiley |
Peer Reviewed | Peer Reviewed |
Volume | 23 |
Issue | 10 |
Pages | 1027-1050 |
DOI | https://doi.org/10.1002/cpe.1628 |
Keywords | Theoretical Computer Science; Computer Networks and Communications; Computational Theory and Mathematics; Software; Computer Science Applications |
Public URL | https://hull-repository.worktribe.com/output/470690 |
Contract Date | Nov 13, 2014 |
You might also like
Computational methods for finding long simple cycles in complex networks
(2017)
Journal Article
Exploiting graphical processing units for data-parallel scientific applications
(2009)
Journal Article
Regular lattice and small-world spin model simulations using CUDA and GPUs
(2010)
Journal Article
Parallel graph component labelling with GPUs and CUDA
(2010)
Journal Article
Downloadable Citations
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search