Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
Future Blog Post
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml
and set future: false
.
Blog Post number 4
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 3
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 2
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
portfolio
Consortium for the Advancement of Scientific Software
Advancing scientific discovery through software stewardship
Kokkos C++ Performance Portability Programming Ecosystem
Bridging the gap: Portable performance in HPC
xSDK: Extreme-scale Scientific Software Development Kit
Working toward a Community Software Ecosystem
publications
ArborX: A Performance Portable Geometric Search Library
Published in ACM Transactions on Mathematical Software (TOMS), 2020
This paper is about fixing template issue #693.
Recommended citation: Lebrun-Grandie et al. (2020). "ArborX: A Performance Portable Geometric Search Library." ACM Transactions on Mathematical Software. 47(1).
Download Paper
Enabling particle applications for exascale computing platforms
Published in The International Journal of High Performance Computing Applications (IJHPCA), 2021
This paper is about fixing template issue #693.
Recommended citation: Mniszewski SM, Belak J, et al. (2021). "Enabling particle applications for exascale computing platforms." The International Journal of High Performance Computing Applications. 35(6).
Download Paper
The Kokkos Ecosystem: Comprehensive Performance Portability for High Performance Computing
Published in IEEE Computing in Science & Engineering (CiSE), 2021
This paper is about fixing template issue #693.
Recommended citation: Trott et al. (2021). "The Kokkos Ecosystem: Comprehensive Performance Portability for High Performance Computing." Computing in Science & Engineering. 23(5).
Download Paper
Kokkos 3: Programming Model Extensions for the Exascale Era
Published in IEEE Transactions on Parallel and Distributed Systems (TPDS), 2022
This paper is about fixing template issue #693.
Recommended citation: Trott CR, Lebrun-Grandie DT, et al. (2022). "Kokkos 3: Programming Model Extensions for the Exascale Era." IEEE Transactions on Parallel and Distributed Systems. 33(4).
Download Paper
Cabana: A Performance Portable Library for Particle-Based Simulations
Published in The Journal of Open Source Software (JOSS), 2022
This paper is about fixing template issue #693.
Recommended citation: Slattery SR, Reeve ST, et al. (2022). "Cabana: A Performance Portable Library for Particle-Based Simulations." Journal of Open Source Software. 7(72).
Download Paper
A single-tree algorithm to compute the Euclidean minimum spanning tree on GPUs
Published in ICPP '22: Proceedings of the 51st International Conference on Parallel Processing, Bordeaux, France, 2023
This paper is about fixing template issue #693.
Recommended citation: Prokopenko A, Sao P, Lebrun-Grandie D (2022). "A single-tree algorithm to compute the Euclidean minimum spanning tree on GPUs." ICPP '22: Proceedings of the 51st International Conference on Parallel Processing.
Download Paper
Application Experiences on a GPU-Accelerated Arm-based HPC Testbed
Published in Proceedings of the HPC Asia 2023 Workshops, 2023
This paper is about fixing template issue #693.
Recommended citation: Elwasif et al. (2023). "Application Experiences on a GPU-Accelerated Arm-based HPC Testbed." Proceedings of the HPC Asia 2023 Workshops.
Download Paper
The Kokkos OpenMPTarget Backend: Implementation and Lessons Learned
Published in IWOMP 2023 : 19th International Workshop on OpenMP, 2023
This paper is about fixing template issue #693.
Recommended citation: Gayatri et al. (2023). "The Kokkos OpenMPTarget Backend: Implementation and Lessons Learned." IWOMP 2023 : 19th International Workshop on OpenMP.
Download Paper
Fast tree-based algorithms for DBSCAN for low-dimensional data on GPUs
Published in ICPP '23: Proceedings of the 52nd International Conference on Parallel Processing, Salt Lake City, Utah, USA, 2023
This paper is about fixing template issue #693.
Recommended citation: Prokopenko A, Lebrun-Grandie D, Arndt A (2023). "Fast tree-based algorithms for DBSCAN for low-dimensional data on GPUs." ICPP '23: Proceedings of the 52nd International Conference on Parallel Processing.
Download Paper
PANDORA: A Parallel Dendrogram Construction Algorithm for Single Linkage Clustering on GPU
Published in ICPP '24: Proceedings of the 53rd International Conference on Parallel Processing, Gotland, Sweden, 2023
This paper is about fixing template issue #693.
Recommended citation: Sao P, Prokopenko A, Lebrun-Grandie D (2024). "PANDORA: A Parallel Dendrogram Construction Algorithm for Single Linkage Clustering on GPU." ICPP '24: Proceedings of the 53rd International Conference on Parallel Processing.
Download Paper
Experiences with implementing Kokkos’ SYCL backend
Published in IWOCL '24: Proceedings of the 12th International Workshop on OpenCL and SYCL, 2024
This paper is about fixing template issue #693.
Recommended citation: Arndt A, Lebrun-Grandie D, Trott C (2024). "Experiences with implementing Kokkos’ SYCL backend." IWOCL '24: Proceedings of the 12th International Workshop on OpenCL and SYCL.
Download Paper
Advances in ArborX to support exascale applications
Published in The International Journal of High Performance Computing Applications (IJHPCA), 2024
This paper is about fixing template issue #693.
Recommended citation: Prokopenko A, Lebrun-Grandie D, et al. (2024). "Advances in ArborX to support exascale applications." The International Journal of High Performance Computing Applications. 39(1).
Download Paper
talks
Kokkos tutorial at ATPESC 2022
Published:
I was invited to present a Kokkos tutorial in the “Programming Models and Languages” track at the Argonne Training Program on Extreme-Scale Computing (ATPESC 2022). The tutorial was attended by over 70 participants and parked engaging questions and discussion, including a lively panel session. Recordings are available on YouTube: Part 1, Part 2, Pannel.
Paving the way to exascale for scientific applications
Published:
I gave the opening keynote at the 2022 IEEE/ACM International Workshop on Hierarchical Parallelism for Exascale Computing, during SC22.
Sustainability through standardization
Published:
CSE 6230: HPC Tools and Applications – Spring ’23
Published:
I delivered a guest lecture on the Kokkos programming model for high-performance computing to students in Georgia Tech’s CSE 6230: HPC Tools and Applications during Spring 2023.
Overcoming Today’s Limitations of Standard C++ with Kokkos
Published:
I was invited speaker at SOS-25, a prestigious workshop on distributed supercomputing. I shared insights on how scientists are leveraging Kokkos to overcome the limitations of standard C++ and harness the power of modern supercomputers
Kokkos case study: ArborX
Published:
I was invited to share my experience with Kokkos and its application in ArborX for performance portable geometric algorithms at the STFC Hartree Centre (UK), as one of a select group of US representatives.
Kokkos tutorial at EuroTUG 2023 in Delft, Netherlands
Published:
I taught a introductory tutorial on Kokkos and Kokkos Kernels with Luc (@lucbv) at the European Trilinos User Group Meeting hosted by TU Delft.
CExA Kokkos training days in Saclay, France
Published:
Christian (@crtrott) and I were invited to CEA Saclay, France, for the CExA project kickoff, where we delivered a presentation and conducted a three-day training session to bootstrap the new Kokkos developer team and application representatives.
Welcome to C++ 23, the “pandemic” edition and C++ NEXT in 2026
Published:
At SC23, I was invited to join the ‘C++ 23, the “Pandemic” Edition and C++ NEXT in 2026’ panel, where we explored the significant features of C++23 and the roadmap for C++26, with a focus on its impact on HPC.
The Kokkos ecosystem - Sustaining performance portability at the exascale era
Published:
Thrilled to share the Kokkos team’s vision for exascale computing at SC23! Christian and I presented at the DOE Booth, discussing how Kokkos is empowering applications to reach new heights of performance.
CSE 6230: HPC Tools and Applications – Spring ’24
Published:
I was invited to return to Georgia Tech for a guest lecture in Spring 2024.
C++ 23/26: What’s Next?
Published:
At ISC 2024, I chaired the ‘C++ for HPC’ panel, which focused on the significant new features in C++23 and the ongoing development of C++26 for high-performance computing environments.
Device Agnostic Programming Models Training Series - Kokkos
Published:
I conducted Kokkos training for NCI (Australia), focusing on enabling performance portability across diverse hardware architectures through practical, hands-on experience.
CExA Kokkos training days in Saclay, France
Published:
Luc (@lucbv) and I traveled to CEA Saclay (France) to deliver a 3-day training session with the CExA team.
CASS BoF Days: Kokkos Ecosystem - State of the Union
Published:
I organized and led a virtual Kokkos Birds-of-a-Feather session for CASS, the Consortium for the Advancement of Scientific Software.
PPPL Center for Advanced Computation Tutorials: A short Kokkos introduction
Published:
I presented remotely a short introduction to Kokkos at PPPL’s Center for Advanced Computation Tutorials. PPPL is renowned for its work on XGC, a gyrokinetic particle-in-cell code crucial for simulating the edge region of magnetically confined thermonuclear fusion plasmas. XGC utilizes Kokkos and Cabana to achieve high performance on leadership-class supercomputers.
Kokkos in Japan: A Tutorial at Kyushu University
Published:
Christian and I had the pleasure of presenting a one-day Kokkos tutorial at Kyushu University in Japan. The workshop was organized by the HPC OSS1 group of the PC Cluster Consortium (PCCC). You can find more details about the workshop on the PCCC website: https://www.pccluster.org/ja/event/2025/01/250127-hpc-oss-ws.html
High-Performance Computing (HPC) Open-Source Software (OSS) ↩
Sustainable HPC Software: Lessons from the Trenches (A Maintainer’s Perspective)
Published:
I was invited to present at the prestigious, invitation-only SOS27 workshop, themed “Fostering Innovation at Scale Beyond the Flops.” My presentation, “Sustainable HPC Software: Lessons from the Trenches (A Maintainer’s Perspective),” contributed to the session on “HPC Software Sustainability and Stewardship.” I shared firsthand experiences and insights gained from maintaining and leading the Kokkos project for over five years, addressing the critical challenges and solutions in ensuring the longevity and effectiveness of HPC software in today’s rapidly evolving landscape.
Kokkos Best Practices: Integrating Kokkos into Your Project
Published:
I was pleased to present a session on “Kokkos Best Practices” as part of the CExA Kokkos Tea-Time talk series. This talk focused on practical strategies for effectively integrating Kokkos into projects using CMake. I discussed key considerations such as managing Kokkos through external installations versus in-project builds, and provided detailed guidance on the recommended initialization and finalization procedures for Kokkos within libraries and applications. This session aimed to equip developers with the tools and knowledge necessary to leverage Kokkos efficiently and effectively in their work.
wg21
P0009: MDSPAN
Published:
2022-07 p0009r18
P1673: A free function linear algebra interface based on the BLAS
Published:
2023-12 p1673r13
P1684: mdarray: An Owning Multidimensional Array Analog of mdspan
Published:
2023-01 p1684r4
P2137: Goals and priorities for C++
Published:
2020 p2137r0
P2553: Make mdspan size_type controllable
Published:
2022-03 p2553r1
P2554: C-Array Interoperability of MDSpan
Published:
2022-02 p2554r0
P2630: Submdspan
Published:
2023-05 p2630r4
P2642: Padded mdspan layouts
Published:
2024-07 p2642r6
P2689: atomic_accessor
Published:
2024-05 p2689r3
P2763: layout_stride static extents default constructor fix
Published:
2023-01 p2763r0
P2798: Fix layout mappings all static extent default constructor
Published:
2023-02 p2798
P3242: Copy and fill for mdspan
Published:
2024-04 p3242r0
P3306: Atomic Read-Modify-Write Improvements
Published:
2024-05 p3306r0
P3330: User-defined Atomic Read-Modify-Write Operations
Published:
2024-07 p3330r0