Home News Enterprise Computing High Performance High-Performance Optimizing GPU Performance by Mapping Memory Hierarchy

Optimizing GPU Performance by Mapping Memory Hierarchy

Patrick Viry writes that the Ateji PX approach to GPU programming makes it easy to map the GPU memory hierarchy, which is essential for achieving good performance on GPUs or any hardware with non-uniform memory accesses (NUMA).

What is important here is that memory hierarchy is expressed in terms of lexical scope. A variable from a different level in the memory hierarchy is accessible if and only if it is visible in the lexical scope. In contrast, languages such as OpenCL use specific declaration modifiers to locate variables in different memory areas. With this approach, you can for instance a variable declared in the global lexical scope, but labeled with the __private modifier. It looks like there is only one variable, while actually each kernel has its own copy. Such modifiers make it very hard to understand the logic of the code.

Read the Full Story.

Read more at insideHPC


Subscribe to Comments Feed

Upcoming Linux Foundation Courses

  1. LFS220 Linux System Administration
    16 Mar » 19 Mar - Chicago +VIRTUAL
  2. LFD405 Embedded Linux Development with Yocto Project
    23 Mar » 26 Mar - Toronto - ON
  3. LFS520 OpenStack Cloud Architecture and Deployment
    23 Mar » 26 Mar - Atlanta + VIRTUAL

View All Upcoming Courses

Become an Individual Member
Check out the Friday Funnies

Sign Up For the Newsletter

Who we are ?

The Linux Foundation is a non-profit consortium dedicated to the growth of Linux.

More About the foundation...

Frequent Questions

Join / Linux Training / Board