gpudev.rst - OpenGrok history log for /dpdk/doc/guides/prog

Revision	Date	Author	Comments
# 41dd9a6b	08-Dec-2023	David Young <dave@youngcopy.com>	doc: reorganize prog guide Create categories in the index of the programmer's guide, sort chapters and rewrite some titles for consistency. Subdirectories are created for ethdev and eventdev for gr doc: reorganize prog guide Create categories in the index of the programmer's guide, sort chapters and rewrite some titles for consistency. Subdirectories are created for ethdev and eventdev for grouping the files together. Useless link anchors at the beginning of files are removed, the corresponding :ref: are replaced with :doc: links. Signed-off-by: David Young <dave@youngcopy.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> show more ...
# 0f91f952	22-Sep-2022	Thomas Monjalon <thomas@monjalon.net>	replace Mellanox with NVIDIA NVIDIA acquired Mellanox Technologies in 2020. The DPDK documentation and code might still include instances of or references to Mellanox trademarks (like BlueField and replace Mellanox with NVIDIA NVIDIA acquired Mellanox Technologies in 2020. The DPDK documentation and code might still include instances of or references to Mellanox trademarks (like BlueField and ConnectX) that are now NVIDIA trademarks. The PCI IDs and copyrights are unchanged. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Gal Cohen <galco@nvidia.com> show more ...
# 9b8cae4d	22-Feb-2022	Elena Agostini <eagostini@nvidia.com>	gpudev: use CPU mapping in communication list rte_gpu_mem_cpu_map() exposes a GPU memory area to the CPU. In gpudev communication list this is useful to store the status flag. A communication list gpudev: use CPU mapping in communication list rte_gpu_mem_cpu_map() exposes a GPU memory area to the CPU. In gpudev communication list this is useful to store the status flag. A communication list status flag allocated on GPU memory and mapped for CPU visibility can be updated by CPU and polled by a GPU workload. The polling operation is more frequent than the CPU update operation. Having the status flag in GPU memory reduces the GPU workload polling latency. If CPU mapping feature is not enabled, status flag resides in CPU memory registered so it's visible from the GPU. To facilitate the interaction with the status flag, this patch provides also the set/get functions for it. Signed-off-by: Elena Agostini <eagostini@nvidia.com> show more ...
# d69bb47d	27-Jan-2022	Elena Agostini <eagostini@nvidia.com>	gpudev: expose GPU memory to CPU Enable the possibility to expose a GPU memory area and make it accessible from the CPU. GPU memory has to be allocated via rte_gpu_mem_alloc(). This patch allows t gpudev: expose GPU memory to CPU Enable the possibility to expose a GPU memory area and make it accessible from the CPU. GPU memory has to be allocated via rte_gpu_mem_alloc(). This patch allows the gpudev library to map (and unmap), through the GPU driver, a chunk of GPU memory and to return a memory pointer usable by the CPU to access the GPU memory area. Signed-off-by: Elena Agostini <eagostini@nvidia.com> show more ...
# f64b299c	17-Nov-2021	Elena Agostini <eagostini@nvidia.com>	build: make gpudev optional This library can be made optional. drivers/gpu and app/test-gpudev depend on this library, so they are automatically disabled if the lib is disabled. Signed-off-by: Elen build: make gpudev optional This library can be made optional. drivers/gpu and app/test-gpudev depend on this library, so they are automatically disabled if the lib is disabled. Signed-off-by: Elena Agostini <eagostini@nvidia.com> show more ...
# 3a994644	08-Nov-2021	Elena Agostini <eagostini@nvidia.com>	doc: add CUDA example in GPU guide Add a pseudo-code example to show how to use gpudev API with a CUDA application. Signed-off-by: Elena Agostini <eagostini@nvidia.com>
# c7ebd65c	08-Nov-2021	Elena Agostini <eagostini@nvidia.com>	gpudev: add communication list In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. When mixing network activity with tas gpudev: add communication list In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. When mixing network activity with task processing there may be the need to put in communication the CPU with the device in order to synchronize operations. An example could be a receive-and-process application where CPU is responsible for receiving packets in multiple mbufs and the GPU is responsible for processing the content of those packets. The purpose of this list is to provide a buffer in CPU memory visible from the GPU that can be treated as a circular buffer to let the CPU provide fondamental info of received packets to the GPU. A possible use-case is described below. CPU: - Trigger some task on the GPU - in a loop: - receive a number of packets - provide packets info to the GPU GPU: - Do some pre-processing - Wait to receive a new set of packet to be processed Layout of a communication list would be: ------- \| 0 \| => pkt_list \| status \| \| #pkts \| ------- \| 1 \| => pkt_list \| status \| \| #pkts \| ------- \| 2 \| => pkt_list \| status \| \| #pkts \| ------- \| .... \| => pkt_list ------- Signed-off-by: Elena Agostini <eagostini@nvidia.com> show more ...
# f56160a2	08-Nov-2021	Elena Agostini <eagostini@nvidia.com>	gpudev: add communication flag In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. When mixing network activity with tas gpudev: add communication flag In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. When mixing network activity with task processing there may be the need to put in communication the CPU with the device in order to synchronize operations. The purpose of this flag is to allow the CPU and the GPU to exchange ACKs. A possible use-case is described below. CPU: - Trigger some task on the GPU - Prepare some data - Signal to the GPU the data is ready updating the communication flag GPU: - Do some pre-processing - Wait for more data from the CPU polling on the communication flag - Consume the data prepared by the CPU Signed-off-by: Elena Agostini <eagostini@nvidia.com> show more ...
# 2d61b429	08-Nov-2021	Elena Agostini <eagostini@nvidia.com>	gpudev: add memory barrier Add a function for the application to ensure the coherency of the writes executed by another device into the GPU memory. Signed-off-by: Elena Agostini <eagostini@nvidia.c gpudev: add memory barrier Add a function for the application to ensure the coherency of the writes executed by another device into the GPU memory. Signed-off-by: Elena Agostini <eagostini@nvidia.com> show more ...
# e818c4e2	08-Nov-2021	Elena Agostini <eagostini@nvidia.com>	gpudev: add memory API In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. Such workload distribution can be achieved by gpudev: add memory API In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. Such workload distribution can be achieved by sharing some memory. As a first step, the features are focused on memory management. A function allows to allocate memory inside the device, or in the main (CPU) memory while making it visible for the device. This memory may be used to save packets or for synchronization data. The next step should focus on GPU processing task control. Signed-off-by: Elena Agostini <eagostini@nvidia.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> show more ...
# 82e5f6b6	08-Nov-2021	Thomas Monjalon <thomas@monjalon.net>	gpudev: add child device representing a device context The computing device may operate in some isolated contexts. Memory and processing are isolated in a silo represented by a child device. The con gpudev: add child device representing a device context The computing device may operate in some isolated contexts. Memory and processing are isolated in a silo represented by a child device. The context is provided as an opaque by the caller of rte_gpu_add_child(). Signed-off-by: Thomas Monjalon <thomas@monjalon.net> show more ...
# 8b8036a6	08-Nov-2021	Elena Agostini <eagostini@nvidia.com>	gpudev: introduce GPU device class library In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. The new library gpudev i gpudev: introduce GPU device class library In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. The new library gpudev is for dealing with GPGPU computing devices from a DPDK application running on the CPU. The infrastructure is prepared to welcome drivers in drivers/gpu/. Signed-off-by: Elena Agostini <eagostini@nvidia.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> show more ...