1d0dff9baSBernard Iremonger.. BSD LICENSE 2d0dff9baSBernard Iremonger Copyright(c) 2010-2014 Intel Corporation. All rights reserved. 3d0dff9baSBernard Iremonger All rights reserved. 4d0dff9baSBernard Iremonger 5d0dff9baSBernard Iremonger Redistribution and use in source and binary forms, with or without 6d0dff9baSBernard Iremonger modification, are permitted provided that the following conditions 7d0dff9baSBernard Iremonger are met: 8d0dff9baSBernard Iremonger 9d0dff9baSBernard Iremonger * Redistributions of source code must retain the above copyright 10d0dff9baSBernard Iremonger notice, this list of conditions and the following disclaimer. 11d0dff9baSBernard Iremonger * Redistributions in binary form must reproduce the above copyright 12d0dff9baSBernard Iremonger notice, this list of conditions and the following disclaimer in 13d0dff9baSBernard Iremonger the documentation and/or other materials provided with the 14d0dff9baSBernard Iremonger distribution. 15d0dff9baSBernard Iremonger * Neither the name of Intel Corporation nor the names of its 16d0dff9baSBernard Iremonger contributors may be used to endorse or promote products derived 17d0dff9baSBernard Iremonger from this software without specific prior written permission. 18d0dff9baSBernard Iremonger 19d0dff9baSBernard Iremonger THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS 20d0dff9baSBernard Iremonger "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT 21d0dff9baSBernard Iremonger LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR 22d0dff9baSBernard Iremonger A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT 23d0dff9baSBernard Iremonger OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, 24d0dff9baSBernard Iremonger SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT 25d0dff9baSBernard Iremonger LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, 26d0dff9baSBernard Iremonger DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY 27d0dff9baSBernard Iremonger THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT 28d0dff9baSBernard Iremonger (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE 29d0dff9baSBernard Iremonger OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. 30d0dff9baSBernard Iremonger 31d0dff9baSBernard IremongerLink Status Interrupt Sample Application 32d0dff9baSBernard Iremonger======================================== 33d0dff9baSBernard Iremonger 34d0dff9baSBernard IremongerThe Link Status Interrupt sample application is a simple example of packet processing using 35*e0c7c473SSiobhan Butlerthe Data Plane Development Kit (DPDK) that 36d0dff9baSBernard Iremongerdemonstrates how network link status changes for a network port can be captured and 37*e0c7c473SSiobhan Butlerused by a DPDK application. 38d0dff9baSBernard Iremonger 39d0dff9baSBernard IremongerOverview 40d0dff9baSBernard Iremonger-------- 41d0dff9baSBernard Iremonger 42d0dff9baSBernard IremongerThe Link Status Interrupt sample application registers a user space callback for the link status interrupt of each port 43d0dff9baSBernard Iremongerand performs L2 forwarding for each packet that is received on an RX_PORT. 44d0dff9baSBernard IremongerThe following operations are performed: 45d0dff9baSBernard Iremonger 46d0dff9baSBernard Iremonger* RX_PORT and TX_PORT are paired with available ports one-by-one according to the core mask 47d0dff9baSBernard Iremonger 48d0dff9baSBernard Iremonger* The source MAC address is replaced by the TX_PORT MAC address 49d0dff9baSBernard Iremonger 50d0dff9baSBernard Iremonger* The destination MAC address is replaced by 02:00:00:00:00:TX_PORT_ID 51d0dff9baSBernard Iremonger 52d0dff9baSBernard IremongerThis application can be used to demonstrate the usage of link status interrupt and its user space callbacks 53d0dff9baSBernard Iremongerand the behavior of L2 forwarding each time the link status changes. 54d0dff9baSBernard Iremonger 55d0dff9baSBernard IremongerCompiling the Application 56d0dff9baSBernard Iremonger------------------------- 57d0dff9baSBernard Iremonger 58d0dff9baSBernard Iremonger#. Go to the example directory: 59d0dff9baSBernard Iremonger 60d0dff9baSBernard Iremonger .. code-block:: console 61d0dff9baSBernard Iremonger 62d0dff9baSBernard Iremonger export RTE_SDK=/path/to/rte_sdk 63d0dff9baSBernard Iremonger cd ${RTE_SDK}/examples/link_status_interrupt 64d0dff9baSBernard Iremonger 65d0dff9baSBernard Iremonger#. Set the target (a default target is used if not specified). For example: 66d0dff9baSBernard Iremonger 67d0dff9baSBernard Iremonger .. code-block:: console 68d0dff9baSBernard Iremonger 69d0dff9baSBernard Iremonger export RTE_TARGET=x86_64-native-linuxapp-gcc 70d0dff9baSBernard Iremonger 71*e0c7c473SSiobhan Butler See the *DPDK Getting Started Guide* for possible RTE_TARGET values. 72d0dff9baSBernard Iremonger 73d0dff9baSBernard Iremonger#. Build the application: 74d0dff9baSBernard Iremonger 75d0dff9baSBernard Iremonger .. code-block:: console 76d0dff9baSBernard Iremonger 77d0dff9baSBernard Iremonger make 78d0dff9baSBernard Iremonger 79d0dff9baSBernard Iremonger.. note:: 80d0dff9baSBernard Iremonger 81d0dff9baSBernard Iremonger The compiled application is written to the build subdirectory. 82d0dff9baSBernard Iremonger To have the application written to a different location, 83d0dff9baSBernard Iremonger the O=/path/to/build/directory option may be specified on the make command line. 84d0dff9baSBernard Iremonger 85d0dff9baSBernard IremongerRunning the Application 86d0dff9baSBernard Iremonger----------------------- 87d0dff9baSBernard Iremonger 88d0dff9baSBernard IremongerThe application requires a number of command line options: 89d0dff9baSBernard Iremonger 90d0dff9baSBernard Iremonger.. code-block:: console 91d0dff9baSBernard Iremonger 92d0dff9baSBernard Iremonger ./build/link_status_interrupt [EAL options] -- -p PORTMASK [-q NQ][-T PERIOD] 93d0dff9baSBernard Iremonger 94d0dff9baSBernard Iremongerwhere, 95d0dff9baSBernard Iremonger 96d0dff9baSBernard Iremonger* -p PORTMASK: A hexadecimal bitmask of the ports to configure 97d0dff9baSBernard Iremonger 98d0dff9baSBernard Iremonger* -q NQ: A number of queues (=ports) per lcore (default is 1) 99d0dff9baSBernard Iremonger 100d0dff9baSBernard Iremonger* -T PERIOD: statistics will be refreshed each PERIOD seconds (0 to disable, 10 default) 101d0dff9baSBernard Iremonger 102d0dff9baSBernard IremongerTo run the application in a linuxapp environment with 4 lcores, 4 memory channels, 16 ports and 8 RX queues per lcore, 103d0dff9baSBernard Iremongerissue the command: 104d0dff9baSBernard Iremonger 105d0dff9baSBernard Iremonger.. code-block:: console 106d0dff9baSBernard Iremonger 107d0dff9baSBernard Iremonger $ ./build/link_status_interrupt -c f -n 4-- -q 8 -p ffff 108d0dff9baSBernard Iremonger 109*e0c7c473SSiobhan ButlerRefer to the *DPDK Getting Started Guide* for general information on running applications 110d0dff9baSBernard Iremongerand the Environment Abstraction Layer (EAL) options. 111d0dff9baSBernard Iremonger 112d0dff9baSBernard IremongerExplanation 113d0dff9baSBernard Iremonger----------- 114d0dff9baSBernard Iremonger 115d0dff9baSBernard IremongerThe following sections provide some explanation of the code. 116d0dff9baSBernard Iremonger 117d0dff9baSBernard IremongerCommand Line Arguments 118d0dff9baSBernard Iremonger~~~~~~~~~~~~~~~~~~~~~~ 119d0dff9baSBernard Iremonger 120d0dff9baSBernard IremongerThe Link Status Interrupt sample application takes specific parameters, 121d0dff9baSBernard Iremongerin addition to Environment Abstraction Layer (EAL) arguments (see Section 13.3). 122d0dff9baSBernard Iremonger 123d0dff9baSBernard IremongerCommand line parsing is done in the same way as it is done in the L2 Forwarding Sample Application. 124d0dff9baSBernard IremongerSee Section 9.4.1, "Command Line Arguments" for more information. 125d0dff9baSBernard Iremonger 126d0dff9baSBernard IremongerMbuf Pool Initialization 127d0dff9baSBernard Iremonger~~~~~~~~~~~~~~~~~~~~~~~~ 128d0dff9baSBernard Iremonger 129d0dff9baSBernard IremongerMbuf pool initialization is done in the same way as it is done in the L2 Forwarding Sample Application. 130d0dff9baSBernard IremongerSee Section 9.4.2, "Mbuf Pool Initialization" for more information. 131d0dff9baSBernard Iremonger 132d0dff9baSBernard IremongerDriver Initialization 133d0dff9baSBernard Iremonger~~~~~~~~~~~~~~~~~~~~~ 134d0dff9baSBernard Iremonger 135d0dff9baSBernard IremongerThe main part of the code in the main() function relates to the initialization of the driver. 136d0dff9baSBernard IremongerTo fully understand this code, it is recommended to study the chapters that related to the Poll Mode Driver in the 137*e0c7c473SSiobhan Butler*DPDK Programmer's Guide and the DPDK API Reference*. 138d0dff9baSBernard Iremonger 139d0dff9baSBernard Iremonger.. code-block:: c 140d0dff9baSBernard Iremonger 141d0dff9baSBernard Iremonger if (rte_eal_pci_probe() < 0) 142d0dff9baSBernard Iremonger rte_exit(EXIT_FAILURE, "Cannot probe PCI\n"); 143d0dff9baSBernard Iremonger 144d0dff9baSBernard Iremonger nb_ports = rte_eth_dev_count(); 145d0dff9baSBernard Iremonger if (nb_ports == 0) 146d0dff9baSBernard Iremonger rte_exit(EXIT_FAILURE, "No Ethernet ports - bye\n"); 147d0dff9baSBernard Iremonger 148d0dff9baSBernard Iremonger if (nb_ports > RTE_MAX_ETHPORTS) 149d0dff9baSBernard Iremonger nb_ports = RTE_MAX_ETHPORTS; 150d0dff9baSBernard Iremonger 151d0dff9baSBernard Iremonger /* 152d0dff9baSBernard Iremonger * Each logical core is assigned a dedicated TX queue on each port. 153d0dff9baSBernard Iremonger */ 154d0dff9baSBernard Iremonger 155d0dff9baSBernard Iremonger for (portid = 0; portid < nb_ports; portid++) { 156d0dff9baSBernard Iremonger /* skip ports that are not enabled */ 157d0dff9baSBernard Iremonger 158d0dff9baSBernard Iremonger if ((lsi_enabled_port_mask & (1 << portid)) == 0) 159d0dff9baSBernard Iremonger continue; 160d0dff9baSBernard Iremonger 161d0dff9baSBernard Iremonger /* save the destination port id */ 162d0dff9baSBernard Iremonger 163d0dff9baSBernard Iremonger if (nb_ports_in_mask % 2) { 164d0dff9baSBernard Iremonger lsi_dst_ports[portid] = portid_last; 165d0dff9baSBernard Iremonger lsi_dst_ports[portid_last] = portid; 166d0dff9baSBernard Iremonger } 167d0dff9baSBernard Iremonger else 168d0dff9baSBernard Iremonger portid_last = portid; 169d0dff9baSBernard Iremonger 170d0dff9baSBernard Iremonger nb_ports_in_mask++; 171d0dff9baSBernard Iremonger 172d0dff9baSBernard Iremonger rte_eth_dev_info_get((uint8_t) portid, &dev_info); 173d0dff9baSBernard Iremonger } 174d0dff9baSBernard Iremonger 175d0dff9baSBernard IremongerObserve that: 176d0dff9baSBernard Iremonger 177d0dff9baSBernard Iremonger* rte_eal_pci_probe() parses the devices on the PCI bus and initializes recognized devices. 178d0dff9baSBernard Iremonger 179d0dff9baSBernard IremongerThe next step is to configure the RX and TX queues. 180d0dff9baSBernard IremongerFor each port, there is only one RX queue (only one lcore is able to poll a given port). 181d0dff9baSBernard IremongerThe number of TX queues depends on the number of available lcores. 182d0dff9baSBernard IremongerThe rte_eth_dev_configure() function is used to configure the number of queues for a port: 183d0dff9baSBernard Iremonger 184d0dff9baSBernard Iremonger.. code-block:: c 185d0dff9baSBernard Iremonger 186d0dff9baSBernard Iremonger ret = rte_eth_dev_configure((uint8_t) portid, 1, 1, &port_conf); 187d0dff9baSBernard Iremonger if (ret < 0) 188d0dff9baSBernard Iremonger rte_exit(EXIT_FAILURE, "Cannot configure device: err=%d, port=%u\n", ret, portid); 189d0dff9baSBernard Iremonger 190d0dff9baSBernard IremongerThe global configuration is stored in a static structure: 191d0dff9baSBernard Iremonger 192d0dff9baSBernard Iremonger.. code-block:: c 193d0dff9baSBernard Iremonger 194d0dff9baSBernard Iremonger static const struct rte_eth_conf port_conf = { 195d0dff9baSBernard Iremonger .rxmode = { 196d0dff9baSBernard Iremonger .split_hdr_size = 0, 197d0dff9baSBernard Iremonger .header_split = 0, /**< Header Split disabled */ 198d0dff9baSBernard Iremonger .hw_ip_checksum = 0, /**< IP checksum offload disabled */ 199d0dff9baSBernard Iremonger .hw_vlan_filter = 0, /**< VLAN filtering disabled */ 200d0dff9baSBernard Iremonger .hw_strip_crc= 0, /**< CRC stripped by hardware */ 201d0dff9baSBernard Iremonger }, 202d0dff9baSBernard Iremonger .txmode = {}, 203d0dff9baSBernard Iremonger .intr_conf = { 204d0dff9baSBernard Iremonger .lsc = 1, /**< link status interrupt feature enabled */ 205d0dff9baSBernard Iremonger }, 206d0dff9baSBernard Iremonger }; 207d0dff9baSBernard Iremonger 208d0dff9baSBernard IremongerConfiguring lsc to 0 (the default) disables the generation of any link status change interrupts in kernel space 209d0dff9baSBernard Iremongerand no user space interrupt event is received. 210d0dff9baSBernard IremongerThe public interface rte_eth_link_get() accesses the NIC registers directly to update the link status. 211d0dff9baSBernard IremongerConfiguring lsc to non-zero enables the generation of link status change interrupts in kernel space 212d0dff9baSBernard Iremongerwhen a link status change is present and calls the user space callbacks registered by the application. 213d0dff9baSBernard IremongerThe public interface rte_eth_link_get() just reads the link status in a global structure 214d0dff9baSBernard Iremongerthat would be updated in the interrupt host thread only. 215d0dff9baSBernard Iremonger 216d0dff9baSBernard IremongerInterrupt Callback Registration 217d0dff9baSBernard Iremonger~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218d0dff9baSBernard Iremonger 219d0dff9baSBernard IremongerThe application can register one or more callbacks to a specific port and interrupt event. 220d0dff9baSBernard IremongerAn example callback function that has been written as indicated below. 221d0dff9baSBernard Iremonger 222d0dff9baSBernard Iremonger.. code-block:: c 223d0dff9baSBernard Iremonger 224d0dff9baSBernard Iremonger static void 225d0dff9baSBernard Iremonger lsi_event_callback(uint8_t port_id, enum rte_eth_event_type type, void *param) 226d0dff9baSBernard Iremonger { 227d0dff9baSBernard Iremonger struct rte_eth_link link; 228d0dff9baSBernard Iremonger 229d0dff9baSBernard Iremonger RTE_SET_USED(param); 230d0dff9baSBernard Iremonger 231d0dff9baSBernard Iremonger printf("\n\nIn registered callback...\n"); 232d0dff9baSBernard Iremonger 233d0dff9baSBernard Iremonger printf("Event type: %s\n", type == RTE_ETH_EVENT_INTR_LSC ? "LSC interrupt" : "unknown event"); 234d0dff9baSBernard Iremonger 235d0dff9baSBernard Iremonger rte_eth_link_get_nowait(port_id, &link); 236d0dff9baSBernard Iremonger 237d0dff9baSBernard Iremonger if (link.link_status) { 238d0dff9baSBernard Iremonger printf("Port %d Link Up - speed %u Mbps - %s\n\n", port_id, (unsigned)link.link_speed, 239d0dff9baSBernard Iremonger (link.link_duplex == ETH_LINK_FULL_DUPLEX) ? ("full-duplex") : ("half-duplex")); 240d0dff9baSBernard Iremonger } else 241d0dff9baSBernard Iremonger printf("Port %d Link Down\n\n", port_id); 242d0dff9baSBernard Iremonger } 243d0dff9baSBernard Iremonger 244d0dff9baSBernard IremongerThis function is called when a link status interrupt is present for the right port. 245d0dff9baSBernard IremongerThe port_id indicates which port the interrupt applies to. 246d0dff9baSBernard IremongerThe type parameter identifies the interrupt event type, 247d0dff9baSBernard Iremongerwhich currently can be RTE_ETH_EVENT_INTR_LSC only, but other types can be added in the future. 248d0dff9baSBernard IremongerThe param parameter is the address of the parameter for the callback. 249d0dff9baSBernard IremongerThis function should be implemented with care since it will be called in the interrupt host thread, 250d0dff9baSBernard Iremongerwhich is different from the main thread of its caller. 251d0dff9baSBernard Iremonger 252d0dff9baSBernard IremongerThe application registers the lsi_event_callback and a NULL parameter to the link status interrupt event on each port: 253d0dff9baSBernard Iremonger 254d0dff9baSBernard Iremonger.. code-block:: c 255d0dff9baSBernard Iremonger 256d0dff9baSBernard Iremonger rte_eth_dev_callback_register((uint8_t)portid, RTE_ETH_EVENT_INTR_LSC, lsi_event_callback, NULL); 257d0dff9baSBernard Iremonger 258d0dff9baSBernard IremongerThis registration can be done only after calling the rte_eth_dev_configure() function and before calling any other function. 259d0dff9baSBernard IremongerIf lsc is initialized with 0, the callback is never called since no interrupt event would ever be present. 260d0dff9baSBernard Iremonger 261d0dff9baSBernard IremongerRX Queue Initialization 262d0dff9baSBernard Iremonger~~~~~~~~~~~~~~~~~~~~~~~ 263d0dff9baSBernard Iremonger 264d0dff9baSBernard IremongerThe application uses one lcore to poll one or several ports, depending on the -q option, 265d0dff9baSBernard Iremongerwhich specifies the number of queues per lcore. 266d0dff9baSBernard Iremonger 267d0dff9baSBernard IremongerFor example, if the user specifies -q 4, the application is able to poll four ports with one lcore. 268d0dff9baSBernard IremongerIf there are 16 ports on the target (and if the portmask argument is -p ffff), 269d0dff9baSBernard Iremongerthe application will need four lcores to poll all the ports. 270d0dff9baSBernard Iremonger 271d0dff9baSBernard Iremonger.. code-block:: c 272d0dff9baSBernard Iremonger 273d0dff9baSBernard Iremonger ret = rte_eth_rx_queue_setup((uint8_t) portid, 0, nb_rxd, SOCKET0, &rx_conf, lsi_pktmbuf_pool); 274d0dff9baSBernard Iremonger if (ret < 0) 275d0dff9baSBernard Iremonger rte_exit(EXIT_FAILURE, "rte_eth_rx_queue_setup: err=%d, port=%u\n", ret, portid); 276d0dff9baSBernard Iremonger 277d0dff9baSBernard IremongerThe list of queues that must be polled for a given lcore is stored in a private structure called struct lcore_queue_conf. 278d0dff9baSBernard Iremonger 279d0dff9baSBernard Iremonger.. code-block:: c 280d0dff9baSBernard Iremonger 281d0dff9baSBernard Iremonger struct lcore_queue_conf { 282d0dff9baSBernard Iremonger unsigned n_rx_port; 283d0dff9baSBernard Iremonger unsigned rx_port_list[MAX_RX_QUEUE_PER_LCORE]; unsigned tx_queue_id; 284d0dff9baSBernard Iremonger struct mbuf_table tx_mbufs[LSI_MAX_PORTS]; 285d0dff9baSBernard Iremonger } rte_cache_aligned; 286d0dff9baSBernard Iremonger 287d0dff9baSBernard Iremonger struct lcore_queue_conf lcore_queue_conf[RTE_MAX_LCORE]; 288d0dff9baSBernard Iremonger 289d0dff9baSBernard IremongerThe n_rx_port and rx_port_list[] fields are used in the main packet processing loop 290d0dff9baSBernard Iremonger(see Section 13.4.7, "Receive, Process and Transmit Packets" later in this chapter). 291d0dff9baSBernard Iremonger 292d0dff9baSBernard IremongerThe global configuration for the RX queues is stored in a static structure: 293d0dff9baSBernard Iremonger 294d0dff9baSBernard Iremonger.. code-block:: c 295d0dff9baSBernard Iremonger 296d0dff9baSBernard Iremonger static const struct rte_eth_rxconf rx_conf = { 297d0dff9baSBernard Iremonger .rx_thresh = { 298d0dff9baSBernard Iremonger .pthresh = RX_PTHRESH, 299d0dff9baSBernard Iremonger .hthresh = RX_HTHRESH, 300d0dff9baSBernard Iremonger .wthresh = RX_WTHRESH, 301d0dff9baSBernard Iremonger }, 302d0dff9baSBernard Iremonger }; 303d0dff9baSBernard Iremonger 304d0dff9baSBernard IremongerTX Queue Initialization 305d0dff9baSBernard Iremonger~~~~~~~~~~~~~~~~~~~~~~~ 306d0dff9baSBernard Iremonger 307d0dff9baSBernard IremongerEach lcore should be able to transmit on any port. 308d0dff9baSBernard IremongerFor every port, a single TX queue is initialized. 309d0dff9baSBernard Iremonger 310d0dff9baSBernard Iremonger.. code-block:: c 311d0dff9baSBernard Iremonger 312d0dff9baSBernard Iremonger /* init one TX queue logical core on each port */ 313d0dff9baSBernard Iremonger 314d0dff9baSBernard Iremonger fflush(stdout); 315d0dff9baSBernard Iremonger 316d0dff9baSBernard Iremonger ret = rte_eth_tx_queue_setup(portid, 0, nb_txd, rte_eth_dev_socket_id(portid), &tx_conf); 317d0dff9baSBernard Iremonger if (ret < 0) 318d0dff9baSBernard Iremonger rte_exit(EXIT_FAILURE, "rte_eth_tx_queue_setup: err=%d,port=%u\n", ret, (unsigned) portid); 319d0dff9baSBernard Iremonger 320d0dff9baSBernard IremongerThe global configuration for TX queues is stored in a static structure: 321d0dff9baSBernard Iremonger 322d0dff9baSBernard Iremonger.. code-block:: c 323d0dff9baSBernard Iremonger 324d0dff9baSBernard Iremonger static const struct rte_eth_txconf tx_conf = { 325d0dff9baSBernard Iremonger .tx_thresh = { 326d0dff9baSBernard Iremonger .pthresh = TX_PTHRESH, 327d0dff9baSBernard Iremonger .hthresh = TX_HTHRESH, 328d0dff9baSBernard Iremonger .wthresh = TX_WTHRESH, 329d0dff9baSBernard Iremonger }, 330d0dff9baSBernard Iremonger .tx_free_thresh = RTE_TEST_TX_DESC_DEFAULT + 1, /* disable feature */ 331d0dff9baSBernard Iremonger }; 332d0dff9baSBernard Iremonger 333d0dff9baSBernard IremongerReceive, Process and Transmit Packets 334d0dff9baSBernard Iremonger~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335d0dff9baSBernard Iremonger 336d0dff9baSBernard IremongerIn the lsi_main_loop() function, the main task is to read ingress packets from the RX queues. 337d0dff9baSBernard IremongerThis is done using the following code: 338d0dff9baSBernard Iremonger 339d0dff9baSBernard Iremonger.. code-block:: c 340d0dff9baSBernard Iremonger 341d0dff9baSBernard Iremonger /* 342d0dff9baSBernard Iremonger * Read packet from RX queues 343d0dff9baSBernard Iremonger */ 344d0dff9baSBernard Iremonger 345d0dff9baSBernard Iremonger for (i = 0; i < qconf->n_rx_port; i++) { 346d0dff9baSBernard Iremonger portid = qconf->rx_port_list[i]; 347d0dff9baSBernard Iremonger nb_rx = rte_eth_rx_burst((uint8_t) portid, 0, pkts_burst, MAX_PKT_BURST); 348d0dff9baSBernard Iremonger port_statistics[portid].rx += nb_rx; 349d0dff9baSBernard Iremonger 350d0dff9baSBernard Iremonger for (j = 0; j < nb_rx; j++) { 351d0dff9baSBernard Iremonger m = pkts_burst[j]; 352d0dff9baSBernard Iremonger rte_prefetch0(rte_pktmbuf_mtod(m, void *)); 353d0dff9baSBernard Iremonger lsi_simple_forward(m, portid); 354d0dff9baSBernard Iremonger } 355d0dff9baSBernard Iremonger } 356d0dff9baSBernard Iremonger 357d0dff9baSBernard IremongerPackets are read in a burst of size MAX_PKT_BURST. 358d0dff9baSBernard IremongerThe rte_eth_rx_burst() function writes the mbuf pointers in a local table and returns the number of available mbufs in the table. 359d0dff9baSBernard Iremonger 360d0dff9baSBernard IremongerThen, each mbuf in the table is processed by the lsi_simple_forward() function. 361d0dff9baSBernard IremongerThe processing is very simple: processes the TX port from the RX port and then replaces the source and destination MAC addresses. 362d0dff9baSBernard Iremonger 363d0dff9baSBernard Iremonger.. note:: 364d0dff9baSBernard Iremonger 365d0dff9baSBernard Iremonger In the following code, the two lines for calculating the output port require some explanation. 366d0dff9baSBernard Iremonger If portId is even, the first line does nothing (as portid & 1 will be 0), and the second line adds 1. 367d0dff9baSBernard Iremonger If portId is odd, the first line subtracts one and the second line does nothing. 368d0dff9baSBernard Iremonger Therefore, 0 goes to 1, and 1 to 0, 2 goes to 3 and 3 to 2, and so on. 369d0dff9baSBernard Iremonger 370d0dff9baSBernard Iremonger.. code-block:: c 371d0dff9baSBernard Iremonger 372d0dff9baSBernard Iremonger static void 373d0dff9baSBernard Iremonger lsi_simple_forward(struct rte_mbuf *m, unsigned portid) 374d0dff9baSBernard Iremonger { 375d0dff9baSBernard Iremonger struct ether_hdr *eth; 376d0dff9baSBernard Iremonger void *tmp; 377d0dff9baSBernard Iremonger unsigned dst_port = lsi_dst_ports[portid]; 378d0dff9baSBernard Iremonger 379d0dff9baSBernard Iremonger eth = rte_pktmbuf_mtod(m, struct ether_hdr *); 380d0dff9baSBernard Iremonger 381d0dff9baSBernard Iremonger /* 02:00:00:00:00:xx */ 382d0dff9baSBernard Iremonger 383d0dff9baSBernard Iremonger tmp = ð->d_addr.addr_bytes[0]; 384d0dff9baSBernard Iremonger 385d0dff9baSBernard Iremonger *((uint64_t *)tmp) = 0x000000000002 + (dst_port << 40); 386d0dff9baSBernard Iremonger 387d0dff9baSBernard Iremonger /* src addr */ 388d0dff9baSBernard Iremonger ether_addr_copy(&lsi_ports_eth_addr[dst_port], ð->s_addr); 389d0dff9baSBernard Iremonger 390d0dff9baSBernard Iremonger lsi_send_packet(m, dst_port); 391d0dff9baSBernard Iremonger } 392d0dff9baSBernard Iremonger 393d0dff9baSBernard IremongerThen, the packet is sent using the lsi_send_packet(m, dst_port) function. 394d0dff9baSBernard IremongerFor this test application, the processing is exactly the same for all packets arriving on the same RX port. 395d0dff9baSBernard IremongerTherefore, it would have been possible to call the lsi_send_burst() function directly from the main loop 396d0dff9baSBernard Iremongerto send all the received packets on the same TX port using 397d0dff9baSBernard Iremongerthe burst-oriented send function, which is more efficient. 398d0dff9baSBernard Iremonger 399d0dff9baSBernard IremongerHowever, in real-life applications (such as, L3 routing), 400d0dff9baSBernard Iremongerpacket N is not necessarily forwarded on the same port as packet N-1. 401d0dff9baSBernard IremongerThe application is implemented to illustrate that so the same approach can be reused in a more complex application. 402d0dff9baSBernard Iremonger 403d0dff9baSBernard IremongerThe lsi_send_packet() function stores the packet in a per-lcore and per-txport table. 404d0dff9baSBernard IremongerIf the table is full, the whole packets table is transmitted using the lsi_send_burst() function: 405d0dff9baSBernard Iremonger 406d0dff9baSBernard Iremonger.. code-block:: c 407d0dff9baSBernard Iremonger 408d0dff9baSBernard Iremonger /* Send the packet on an output interface */ 409d0dff9baSBernard Iremonger 410d0dff9baSBernard Iremonger static int 411d0dff9baSBernard Iremonger lsi_send_packet(struct rte_mbuf *m, uint8_t port) 412d0dff9baSBernard Iremonger { 413d0dff9baSBernard Iremonger unsigned lcore_id, len; 414d0dff9baSBernard Iremonger struct lcore_queue_conf *qconf; 415d0dff9baSBernard Iremonger 416d0dff9baSBernard Iremonger lcore_id = rte_lcore_id(); 417d0dff9baSBernard Iremonger qconf = &lcore_queue_conf[lcore_id]; 418d0dff9baSBernard Iremonger len = qconf->tx_mbufs[port].len; 419d0dff9baSBernard Iremonger qconf->tx_mbufs[port].m_table[len] = m; 420d0dff9baSBernard Iremonger len++; 421d0dff9baSBernard Iremonger 422d0dff9baSBernard Iremonger /* enough pkts to be sent */ 423d0dff9baSBernard Iremonger 424d0dff9baSBernard Iremonger if (unlikely(len == MAX_PKT_BURST)) { 425d0dff9baSBernard Iremonger lsi_send_burst(qconf, MAX_PKT_BURST, port); 426d0dff9baSBernard Iremonger len = 0; 427d0dff9baSBernard Iremonger } 428d0dff9baSBernard Iremonger qconf->tx_mbufs[port].len = len; 429d0dff9baSBernard Iremonger 430d0dff9baSBernard Iremonger return 0; 431d0dff9baSBernard Iremonger } 432d0dff9baSBernard Iremonger 433d0dff9baSBernard IremongerTo ensure that no packets remain in the tables, each lcore does a draining of the TX queue in its main loop. 434d0dff9baSBernard IremongerThis technique introduces some latency when there are not many packets to send. 435d0dff9baSBernard IremongerHowever, it improves performance: 436d0dff9baSBernard Iremonger 437d0dff9baSBernard Iremonger.. code-block:: c 438d0dff9baSBernard Iremonger 439d0dff9baSBernard Iremonger cur_tsc = rte_rdtsc(); 440d0dff9baSBernard Iremonger 441d0dff9baSBernard Iremonger /* 442d0dff9baSBernard Iremonger * TX burst queue drain 443d0dff9baSBernard Iremonger */ 444d0dff9baSBernard Iremonger 445d0dff9baSBernard Iremonger diff_tsc = cur_tsc - prev_tsc; 446d0dff9baSBernard Iremonger 447d0dff9baSBernard Iremonger if (unlikely(diff_tsc > drain_tsc)) { 448d0dff9baSBernard Iremonger /* this could be optimized (use queueid instead of * portid), but it is not called so often */ 449d0dff9baSBernard Iremonger 450d0dff9baSBernard Iremonger for (portid = 0; portid < RTE_MAX_ETHPORTS; portid++) { 451d0dff9baSBernard Iremonger if (qconf->tx_mbufs[portid].len == 0) 452d0dff9baSBernard Iremonger continue; 453d0dff9baSBernard Iremonger 454d0dff9baSBernard Iremonger lsi_send_burst(&lcore_queue_conf[lcore_id], 455d0dff9baSBernard Iremonger qconf->tx_mbufs[portid].len, (uint8_t) portid); 456d0dff9baSBernard Iremonger qconf->tx_mbufs[portid].len = 0; 457d0dff9baSBernard Iremonger } 458d0dff9baSBernard Iremonger 459d0dff9baSBernard Iremonger /* if timer is enabled */ 460d0dff9baSBernard Iremonger 461d0dff9baSBernard Iremonger if (timer_period > 0) { 462d0dff9baSBernard Iremonger /* advance the timer */ 463d0dff9baSBernard Iremonger 464d0dff9baSBernard Iremonger timer_tsc += diff_tsc; 465d0dff9baSBernard Iremonger 466d0dff9baSBernard Iremonger /* if timer has reached its timeout */ 467d0dff9baSBernard Iremonger 468d0dff9baSBernard Iremonger if (unlikely(timer_tsc >= (uint64_t) timer_period)) { 469d0dff9baSBernard Iremonger /* do this only on master core */ 470d0dff9baSBernard Iremonger 471d0dff9baSBernard Iremonger if (lcore_id == rte_get_master_lcore()) { 472d0dff9baSBernard Iremonger print_stats(); 473d0dff9baSBernard Iremonger 474d0dff9baSBernard Iremonger /* reset the timer */ 475d0dff9baSBernard Iremonger timer_tsc = 0; 476d0dff9baSBernard Iremonger } 477d0dff9baSBernard Iremonger } 478d0dff9baSBernard Iremonger } 479d0dff9baSBernard Iremonger prev_tsc = cur_tsc; 480d0dff9baSBernard Iremonger } 481