docs/gpu/testing.rst

807f0584SJoseph Huber.. _libc_gpu_testing:
807f0584SJoseph Huber
807f0584SJoseph Huber
*6818c7b8SJoseph Huber=========================
*6818c7b8SJoseph HuberTesting the GPU C library
*6818c7b8SJoseph Huber=========================
807f0584SJoseph Huber
62a2a07cSJoseph Huber.. note::
62a2a07cSJoseph Huber   Running GPU tests with high parallelism is likely to cause spurious failures,
9a515d81SKazu Hirata   out of resource errors, or indefinite hangs. limiting the number of threads
089b8110SJoseph Huber   used while testing using ``LIBC_GPU_TEST_JOBS=<N>`` is highly recommended.
62a2a07cSJoseph Huber
807f0584SJoseph Huber.. contents:: Table of Contents
807f0584SJoseph Huber  :depth: 4
807f0584SJoseph Huber  :local:
807f0584SJoseph Huber
*6818c7b8SJoseph HuberTesting infrastructure
807f0584SJoseph Huber======================
807f0584SJoseph Huber
*6818c7b8SJoseph HuberThe LLVM C library supports different kinds of :ref:`tests <build_and_test>`
*6818c7b8SJoseph Huberdepending on the build configuration. The GPU target is considered a full build
*6818c7b8SJoseph Huberand therefore provides all of its own utilities to build and run the generated
*6818c7b8SJoseph Hubertests. Currently the GPU supports two kinds of tests.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber#. **Hermetic tests** - These are unit tests built with a test suite similar to
*6818c7b8SJoseph Huber   Google's ``gtest`` infrastructure. These use the same infrastructure as unit
*6818c7b8SJoseph Huber   tests except that the entire environment is self-hosted. This allows us to
*6818c7b8SJoseph Huber   run them on the GPU using our custom utilities. These are used to test the
*6818c7b8SJoseph Huber   majority of functional implementations.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber#. **Integration tests** - These are lightweight tests that simply call a
*6818c7b8SJoseph Huber   ``main`` function and checks if it returns non-zero. These are primarily used
*6818c7b8SJoseph Huber   to test interfaces that are sensitive to threading.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberThe GPU uses the same testing infrastructure as the other supported ``libc``
*6818c7b8SJoseph Hubertargets. We do this by treating the GPU as a standard hosted environment capable
*6818c7b8SJoseph Huberof launching a ``main`` function. Effectively, this means building our own
*6818c7b8SJoseph Huberstartup libraries and loader.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberTesting utilities
*6818c7b8SJoseph Huber=================
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberWe provide two utilities to execute arbitrary programs on the GPU. That is the
*6818c7b8SJoseph Huber``loader`` and the ``start`` object.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberStartup object
*6818c7b8SJoseph Huber--------------
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberThis object mimics the standard object used by existing C library
*6818c7b8SJoseph Huberimplementations. Its job is to perform the necessary setup prior to calling the
*6818c7b8SJoseph Huber``main`` function. In the GPU case, this means exporting GPU kernels that will
*6818c7b8SJoseph Huberperform the necessary operations. Here we use ``_begin`` and ``_end`` to handle
*6818c7b8SJoseph Hubercalling global constructors and destructors while ``_start`` begins the standard
*6818c7b8SJoseph Huberexecution. The following code block shows the implementation for AMDGPU
*6818c7b8SJoseph Huberarchitectures.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber.. code-block:: c++
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber  extern "C" [[gnu::visibility("protected"), clang::amdgpu_kernel]] void
*6818c7b8SJoseph Huber  _begin(int argc, char **argv, char **env) {
*6818c7b8SJoseph Huber    LIBC_NAMESPACE::atexit(&LIBC_NAMESPACE::call_fini_array_callbacks);
*6818c7b8SJoseph Huber    LIBC_NAMESPACE::call_init_array_callbacks(argc, argv, env);
*6818c7b8SJoseph Huber  }
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber  extern "C" [[gnu::visibility("protected"), clang::amdgpu_kernel]] void
*6818c7b8SJoseph Huber  _start(int argc, char **argv, char **envp, int *ret) {
*6818c7b8SJoseph Huber    __atomic_fetch_or(ret, main(argc, argv, envp), __ATOMIC_RELAXED);
*6818c7b8SJoseph Huber  }
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber  extern "C" [[gnu::visibility("protected"), clang::amdgpu_kernel]] void
*6818c7b8SJoseph Huber  _end(int retval) {
*6818c7b8SJoseph Huber    LIBC_NAMESPACE::exit(retval);
*6818c7b8SJoseph Huber  }
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberLoader runtime
*6818c7b8SJoseph Huber--------------
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberThe startup object provides a GPU executable with callable kernels for the
*6818c7b8SJoseph Huberrespective runtime. We can then define a minimal runtime that will launch these
*6818c7b8SJoseph Huberkernels on the given device. Currently we provide the ``amdhsa-loader`` and
*6818c7b8SJoseph Huber``nvptx-loader`` targeting the AMD HSA runtime and CUDA driver runtime
*6818c7b8SJoseph Huberrespectively. By default these will launch with a single thread on the GPU.
807f0584SJoseph Huber
807f0584SJoseph Huber.. code-block:: sh
807f0584SJoseph Huber
*6818c7b8SJoseph Huber   $> clang++ crt1.o test.cpp --target=amdgcn-amd-amdhsa -mcpu=native -flto
*6818c7b8SJoseph Huber   $> amdhsa_loader --threads 1 --blocks 1 ./a.out
807f0584SJoseph Huber   Test Passed!
807f0584SJoseph Huber
*6818c7b8SJoseph HuberThe loader utility will forward any arguments passed after the executable image
*6818c7b8SJoseph Huberto the program on the GPU as well as any set environment variables. The number
*6818c7b8SJoseph Huberof threads and blocks to be set can be controlled with ``--threads`` and
*6818c7b8SJoseph Huber``--blocks``. These also accept additional ``x``, ``y``, ``z`` variants for
*6818c7b8SJoseph Hubermultidimensional grids.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberRunning tests
*6818c7b8SJoseph Huber=============
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberTests will only be built and run if a GPU target architecture is set and the
*6818c7b8SJoseph Hubercorresponding loader utility was built. These can be overridden with the
*6818c7b8SJoseph Huber``LIBC_GPU_TEST_ARCHITECTURE`` and ``LIBC_GPU_LOADER_EXECUTABLE`` :ref:`CMake
*6818c7b8SJoseph Huberoptions <gpu_cmake_options>`. Once built, they can be run like any other tests.
*6818c7b8SJoseph HuberThe CMake target depends on how the library was built.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber#. **Cross build** - If the C library was built using ``LLVM_ENABLE_PROJECTS``
*6818c7b8SJoseph Huber   or a runtimes cross build, then the standard targets will be present in the
*6818c7b8SJoseph Huber   base CMake build directory.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber   #. All tests - You can run all supported tests with the command:
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber      .. code-block:: sh
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber        $> ninja check-libc
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber   #. Hermetic tests - You can run hermetic with tests the command:
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber      .. code-block:: sh
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber        $> ninja libc-hermetic-tests
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber   #. Integration tests - You can run integration tests by the command:
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber      .. code-block:: sh
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber        $> ninja libc-integration-tests
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber#. **Runtimes build** - If the library was built using ``LLVM_ENABLE_RUNTIMES``
*6818c7b8SJoseph Huber   then the actual ``libc`` build will be in a separate directory.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber   #. All tests - You can run all supported tests with the command:
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber      .. code-block:: sh
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber        $> ninja check-libc-amdgcn-amd-amdhsa
*6818c7b8SJoseph Huber        $> ninja check-libc-nvptx64-nvidia-cuda
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber   #. Specific tests - You can use the same targets as above by entering the
*6818c7b8SJoseph Huber      runtimes build directory.
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber      .. code-block:: sh
*6818c7b8SJoseph Huber
*6818c7b8SJoseph Huber        $> ninja -C runtimes/runtimes-amdgcn-amd-amdhsa-bins check-libc
*6818c7b8SJoseph Huber        $> ninja -C runtimes/runtimes-nvptx64-nvidia-cuda-bins check-libc
*6818c7b8SJoseph Huber        $> cd runtimes/runtimes-amdgcn-amd-amdhsa-bins && ninja check-libc
*6818c7b8SJoseph Huber        $> cd runtimes/runtimes-nvptx64-nvidia-cuda-bins && ninja check-libc
*6818c7b8SJoseph Huber
*6818c7b8SJoseph HuberTests can also be built and run manually using the respective loader utility.