User Guide

This section describes the details on controlling and configuring the QDMA IP

Running the DPDK software test application

The below steps describe the step by step procedure to run the DPDK QDMA test application and to interact with the QDMA PCIe device.

  1. Navigate to examples/qdma_testapp directory.
# cd <server_dir>/<dpdk_test_area>/dpdk-stable/examples/qdma_testapp
  1. Run the ‘lspci’ command on the console and verify that the PFs are detected as shown below. Here, ‘81’ is the PCIe bus number on which Xilinx QDMA device is installed.
# lspci | grep Xilinx
81:00.0 Memory controller: Xilinx Corporation Device 903f
81:00.1 Memory controller: Xilinx Corporation Device 913f
81:00.2 Memory controller: Xilinx Corporation Device 923f
81:00.3 Memory controller: Xilinx Corporation Device 933f
  1. Execute the following commands required for running the DPDK application
# mkdir /mnt/huge
# mount -t hugetlbfs nodev /mnt/huge
# modprobe uio
# insmod ../../dpdk-kmods/linux/igb_uio/igb_uio.ko
  1. Bind PF ports to the igb_uio module as shown below
# ../../usertools/dpdk-devbind.py -b igb_uio 81:00.0
# ../../usertools/dpdk-devbind.py -b igb_uio 81:00.1
# ../../usertools/dpdk-devbind.py -b igb_uio 81:00.2
# ../../usertools/dpdk-devbind.py -b igb_uio 81:00.3

5. The execution of steps 3 and 4 creates a max_vfs file entry in /sys/bus/pci/devices/0000:<bus>:<device>.<function>. Enable VFs for each PF by writing the number of VFs to enable to this file as shown below. This example creates 1 VF for each PF.

# echo 1 > /sys/bus/pci/devices/0000\:81\:00.0/max_vfs
# echo 1 > /sys/bus/pci/devices/0000\:81\:00.1/max_vfs
# echo 1 > /sys/bus/pci/devices/0000\:81\:00.2/max_vfs
# echo 1 > /sys/bus/pci/devices/0000\:81\:00.3/max_vfs
  1. Run the lspci command on the console and verify that the VFs are listed in the output as shown below
# lspci | grep Xilinx
81:00.0 Memory controller: Xilinx Corporation Device 903f
81:00.1 Memory controller: Xilinx Corporation Device 913f
81:00.2 Memory controller: Xilinx Corporation Device 923f
81:00.3 Memory controller: Xilinx Corporation Device 933f
81:00.4 Memory controller: Xilinx Corporation Device a03f
81:08.4 Memory controller: Xilinx Corporation Device a13f
81:10.0 Memory controller: Xilinx Corporation Device a23f
81:17.4 Memory controller: Xilinx Corporation Device a33f

In total, 8 ports are serially arranged as shown above, where 81:00.0 represents port 0, 81:00.1 represents port 1 and so on. 81:17.4 being the last one, represents port 7.

  1. Execute the following commands to bind the VF ports to igb_uio module
# ../../usertools/dpdk-devbind.py -b igb_uio 81:00.4
# ../../usertools/dpdk-devbind.py -b igb_uio 81:08.4
# ../../usertools/dpdk-devbind.py -b igb_uio 81:10.0
# ../../usertools/dpdk-devbind.py -b igb_uio 81:17.4
  1. Run the qdma_testapp using the following command
#./build/qdma_testapp -c 0xf -n 4
  • -c represents processor mask
  • -n represents number of memory channels

Binding PCIe functions to VFIO

Above section binds PFs and VFs with igb_uio driver. However, one can also bind to vfio-pci driver, which provides more secure and IOMMU protected access to device.

Boot argument changes for VFIO

To bind with VFIO, enable IOMMU in the grub file as below

Update /etc/default/grub file as below.

GRUB_CMDLINE_LINUX="default_hugepagesz=1GB hugepagesz=1G hugepages=20 iommu=on intel_iommu=on"

Execute the following command to modify the /boot/grub/grub.cfg with the configuration set in the above steps and permanently add them to the kernel command line.

update-grub

Reboot host system after making the above modifications.

Driver binding with vfio-pci

For linux kernel versions prior to 5.0, vfio-pci doesn’t provide sysfs interface to enable VFs. Hence, we first bind PFs with igb_uio and enable VFs and then unbind from igb_uio to bind with vfio-pci.

Execute steps 1 to 6 of Running the DPDK software test application to bind PFs with igb_uio and enable VFs.

Now, unbind PFs from igb_uio driver and bind PFs and VFs to vfio-pci driver.

# ../../usertools/dpdk-devbind.py -u 81:00.0
# ../../usertools/dpdk-devbind.py -u 81:00.1
# ../../usertools/dpdk-devbind.py -u 81:00.2
# ../../usertools/dpdk-devbind.py -u 81:00.3

# modprobe vfio-pci
# ../../usertools/dpdk-devbind.py -b vfio-pci 81:00.0
# ../../usertools/dpdk-devbind.py -b vfio-pci 81:00.1
# ../../usertools/dpdk-devbind.py -b vfio-pci 81:00.2
# ../../usertools/dpdk-devbind.py -b vfio-pci 81:00.3
# ../../usertools/dpdk-devbind.py -b vfio-pci 81:00.4
# ../../usertools/dpdk-devbind.py -b vfio-pci 81:08.4
# ../../usertools/dpdk-devbind.py -b vfio-pci 81:10.0
# ../../usertools/dpdk-devbind.py -b vfio-pci 81:17.4

Starting from Linux kernel version 5.7 and later, the vfio-pci module has been enhanced to fully support VFs. This includes the capability to create VFs seamlessly via the sysfs interface after binding the PF to the vfio-pci module. The following steps provide guidance on VF creation.

  1. Generate the VF token by uuid command
14d63f20-8445-11ea-8900-1f9ce7d5650c
  1. Load the vfio-pci module with enable_sriov parameter set
sudo modprobe vfio-pci enable_sriov=1
  1. Alternatively, pass the enable_sriov parameter through the sysfs if the module is already loaded or is built-in:
echo 1 | sudo tee /sys/module/vfio_pci/parameters/enable_sriov
  1. Bind the PCI devices to vfio-pci driver
../../usertools/dpdk-devbind.py -b vfio-pci 0000:81:00.0
  1. Create the desired number of VF devices
echo 2 > /sys/bus/pci/devices/0000:81:00.0/sriov_numvfs

Please note that between Linux kernel versions 5.0 and 5.7, there have been observed issues when unbinding PF after creating VFs using igb_uio. This process may leave SR-IOV enabled, potentially causing compatibility challenges with newer kernel versions that incorporate vfio-pci as a built-in module. To mitigate this concern, we recommend considering an upgrade to kernel version 5.7 or a more recent release, as these versions have addressed these issues and offer improved compatibility.

Controlling and Configuring the QDMA IP

Supported Device arguments (module parameters)

Device specific parameters can be passed to a device by using the ‘-a’ EAL option. Xilinx supports following device arguments to configure PCIe device.

  1. config_bar

    This parameter specifies the PCIe BAR number where QDMA configuration register space is mapped. Valid values are 0 to 5. Default is set to 0 i.e. BAR 0 in the driver.

    Example usage:

    ./build/qdma_testapp -c 0x1f -n 4 -a 81:00.0,config_bar=2 -a 81:00.1,config_bar=4
    

    This example configures BAR 2 as QDMA configuration BAR for device “81:00.0” and BAR 4 as QDMA configuration BAR for device “81:00.1”.

  2. desc_prefetch

    This parameter enables or disables descriptor prefetch on C2H streaming queues. Default is prefetch disabled.

    Example usage:

    ./build/qdma_testapp -c 0x1f -n 4 -a 81:00.0,desc_prefetch=1
    

    This example enables descriptor prefetch on all the streaming C2H queues of the device “81:00.0”.

  3. cmpt_desc_len

    This parameter sets the completion entry length of the completion queue. Valid lengths are 8, 16, 32 and 64 bytes. Default length is 8 bytes.

    Example usage:

    ./build/qdma_testapp -c 0x1f -n 4 -a 81:00.0,cmpt_desc_len=32
    

    This example sets completion entry length to 32 bytes on all the completion queues of the device “81:00.0”.

  4. trigger_mode

    This parameter sets the trigger mode for completion. Possible values for trigger_mode are:

    0 - DISABLE

    1 – Trigger on EVERY event

    2 – Trigger when USER_COUNT threshold is reached

    3 – Trigger when USER defined event is reached

    4 - Trigger when USER_TIMER threshold is reached

    5 - Trigger when either of USER_TIMER or COUNT is reached.

    Default value configured in the driver is 4.

    Example usage:

    ./build/qdma_testapp -c 0x1f -n 4 -a 81:00.0,trigger_mode=1
    

    This example sets the trigger mode to every event for all the completion queues of the device “81:00.0”.

  5. c2h_byp_mode

    This parameter sets the C2H stream mode. Valid values are

    0 - Bypass disabled

    1 - Cache bypass mode

    2 - Simple bypass mode

    Default is internal mode i.e. bypass disabled.

    Example usage:

    ./build/qdma_testapp -c 0x1f -n 4 -a 81:00.0,c2h_byp_mode=2
    

    This example sets simple bypass mode on all the C2H queues belonging to the PCIe device “81:00.0”.

  6. h2c_byp_mode

    This parameter sets the H2C bypass mode. Valid values are

    0 - Bypass disabled

    1 - Bypass enabled

    Default is Bypass disabled.

    Example usage:

    ./build/qdma_testapp -c 0x1f -n 4 -a 81:00.0,h2c_byp_mode=1
    

    This example sets bypass mode on all the H2C queues belonging to the PCIe device “81:00.0”.

CLI support in qdma_testapp

Sample log of the qdma_testapp execution is given below. After running the qdma_testapp, command line prompt appears on the console as shown below.

#./build/qdma_testapp -c 0xf -n 4
QDMA testapp rte eal init...
EAL: Detected 8 lcore(s)
EAL: Probing VFIO support...
EAL: PCI device 0000:81:00.0 on NUMA socket -1
EAL: probe driver: 10ee:903f net_qdma
EAL: PCI device 0000:81:00.1 on NUMA socket -1
EAL: probe driver: 10ee:913f net_qdma
EAL: PCI device 0000:81:00.2 on NUMA socket -1
EAL: probe driver: 10ee:923f net_qdma
EAL: PCI device 0000:81:00.3 on NUMA socket -1
EAL: probe driver: 10ee:933f net_qdma
EAL: PCI device 0000:81:00.4 on NUMA socket -1
EAL: probe driver: 10ee:a03f net_qdma
EAL: PCI device 0000:81:08.4 on NUMA socket -1
EAL: probe driver: 10ee:a13f net_qdma
EAL: PCI device 0000:81:10.0 on NUMA socket -1
EAL: probe driver: 10ee:a23f net_qdma
EAL: PCI device 0000:81:17.4 on NUMA socket -1
EAL: probe driver: 10ee:a33f net_qdma
Ethernet Device Count: 8
Logical Core Count: 4
xilinx-app>

Commands supported by the qdma_testapp CLI

  1. port_init

    This command assigns queues to the port, sets up required resources for the queues, and prepares queues for data processing. Format for this commad is:

    port_init <port-id> <num-queues> <num-st-queues> <ring-depth> <pkt-buff-size>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    num-queues represents the total number of queues to be assigned to the port

    num-st-queues represents the total number of queues to be configured in streaming mode. This implies that the (num-queues - num-st-queues) number of queues has to be configured in memory mapped mode.

    ring-depth represents the number of entries in C2H and H2C descriptor rings of each queue of the port

    pkt-buff-size represents the size of the data that a single C2H or H2C descriptor can support

    Example usage:

    port_init 1 32 16 1024 4096
    

    This example initializes Port 1 with first 16 queues in streaming mode and remaining 16 queues in memory mapped mode. Number of C2H and H2C descriptor ring depth is set to 1024 and data buffer of 4KB supported by each descriptor.

  2. port_close

    This command frees up all the allocated resources and removes the queues associated with the port. Format for this commad is:

    port_close <port-id>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    Example usage:

    port_close 0
    

    This example closes the port 0. Port 0 can again be re-initialized with port_init command after port_close command.

  3. port_reset

    This command resets the DPDK port. This command is supported for VF port only. This command closes the port and re-initializes it with the values provided in this command. Format for this commad is:

    port_reset <port-id> <num-queues> <num-st-queues> <ring-depth> <pkt-buff-size>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    num-queues represents the total number of queues to be assigned to the port

    num-st-queues represents the total number of queues to be configured in streaming mode. This implies that the (num-queues - num-st-queues) number of queues has to be configured in memory mapped mode.

    ring-depth represents the number of entries in C2H and H2C descriptor rings of each queue of the port

    pkt-buff-size represents the size of the data that a single C2H or H2C descriptor can support

    Example usage:

    port_reset 4 32 16 1024 4096
    

    This example command resets the port 4 and re-initializes it with first 16 queues in streaming mode and remaining 16 queues in memory mapped mode. Number of C2H and H2C descriptor ring depth is set to 1024 and data buffer of 4KB supported by each descriptor.

  4. port_remove

    This command frees up all the resources allocated for the port and removes the port from application use. User will need to restart the application to use the port again. Format for this commad is:

    port_remove <port-id>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    Example usage:

    port_remove 4
    

    This example removes the port 4. Restart the application to use port 4 again.

  5. dma_to_device

    This command is used to DMA the data from host to card. Format for this commad is:

    dma_to_device <port-id> <num-queues> <input-filename> <dst-addr> <size> <iterations>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    num-queues represents the total number of queues to use for transmitting the data

    input-filename represents the path to a valid binary data file, contents of which needs to be DMA’ed

    dst-addr represents the destination address (offset) in the card to where DMA should be done in memory mapped mode. This field is ignored for streaming mode queues.

    size represents the amount of data in bytes that needs to be transmitted to the card from the given input file. Data will be segmented across queues such that the total data transferred to the card is size amount

    iterations represents the number of loops to repeat the same DMA transfer

    Example usage:

    dma_to_device 0 2048 mm_datafile_1MB.bin 0 524288 0
    

    This example segments the (524288) bytes from the mm_datafile_1MB.bin file equally to 2048 queues and transmits the segmented data on each queue starting at destination BRAM offset 0 for 1st queue, offset (1*524288)/2048 for 2nd queue, and so on.

  6. dma_from_device

    This command is used to DMA the data from card to host. Format for this commad is:

    dma_from_device <port-id> <num-queues> <output-filename> <src-addr> <size> <iterations>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    num-queues represents the total number of queues to use to receive the data

    output-filename represents the path to output file to dump the received data

    src-addr represents the source address (offset) in the card from where DMA should be done in memory mapped mode. This field is ignored for streaming mode queues.

    size represents the amount of data in bytes that needs to be received from the card. Data will be segmented across queues such that the total data transferred from the card is size amount

    iterations represents the number of loops to repeat the same DMA transfer

    Example usage:

    dma_from_device 0 2048 port0_qcount2048_size524288.bin 0 524288 0
    

    This example receives 524288 bytes from 2048 queues and writes to port0_qcount2048_size524288.bin file. 1st queue receives (524288/2048) bytes of data from BRAM offset 0, 2nd queue receives (524288/2048) bytes of data from BRAM offset (1*524288)/2048, and so on.

  7. reg_read

    This command is used to read the specified register. Format for this commad is:

    reg_read <port-id> <bar-num> <address>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    bar-num represents the PCIe BAR where the register is located and this input value should be a decimal value in the range 0-5

    address represents offset of the register in the PCIe BAR bar-num

  8. reg_write

    This command is used to write a 32-bit value to the specified register. Format for this commad is:

    reg_write <port-id> <bar-num> <address> <value>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    bar-num represents the PCIe BAR where the register is located and this input value should be a decimal value in the range 0-5

    address represents offset of the register in the PCIe BAR bar-num

    value represents the value to be written at the register offset address

  9. reg_dump

    This command dumps important QDMA registers values of the given port on console. Format for this commad is:

    reg_dump <port-id>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

  10. reg_info_read

    This command reads the field info for the specified number of registers of the given port on console. Format for this commad is:

    reg_info_read <port-id> <reg-addr> <num-regs>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    reg-addr represents offset of the register in the PCIe BAR bar-num

    rum-regs represents number of registers to read

  11. queue_dump

    This command dumps the queue context of the specified queue number of the given port. Format for this commad is:

    queue_dump <port-id> <queue-id>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    queue-id represents the queue number relative to the port, whose context information needs to be logged

  12. desc_dump

    This command dumps the descriptors of the C2H and H2C rings of the specified queue number of the given port. Format for this commad is:

    desc_dump <port-id> <queue-id>
    

    port-id represents a logical numbering for PCIe functions in the order they are bind to igb_uio driver. The first PCIe function that is bound has port id as 0.

    queue-id represents the queue number relative to the port, whose C2H and H2C ring descriptors needs to be dumped

  13. load_cmds

    This command executes the list of CLI commands from the given file. Format for this commad is:

    load_cmds <file_name>
    

    file_name represents path to a valid file containing list of above described CLI commands to be executed in sequence.

  14. help

    This command dumps the help menu with supported commands and their format. Format for this commad is:

    help
    
  15. ctrl+d

    The keyboard keys Ctrl and D when pressed together quits the application.

Virtual Machine (VM) execution and test

Assuming that the VM image has been created with the settings outlined in Table Guest System Configuration, follow below steps to execute qdma_testapp on VM.

  1. On host system, update /etc/default/grub file as below to enable IOMMU.

    GRUB_CMDLINE_LINUX="default_hugepagesz=1GB hugepagesz=1G hugepages=20 iommu=on intel_iommu=pt"
    

    Execute the following command to modify the /boot/grub/grub.cfg with the configuration set in the above step and permanently add them to the kernel command line.

    update-grub
    

    Reboot host system after making the above modifications.

  2. On the host system, insert igb_uio driver and bind the PFs with the igb_uio driver

  3. Enable VF(s) on host system by writing the number of VF(s) to enable in max_vfs file under /sys/bus/pci/devices/0000:<bus>:<device>.<function>.

    # echo 1 > /sys/bus/pci/devices/0000\:81\:00.0/max_vfs
    

    lspci should now show new entry for VF

    81:00.4 Memory controller: Xilinx Corporation Device a03f
    
  4. Start qdma_testapp application on host system

  5. Start the VM using below command by attaching the VF (81:00.4 in this example)

    qemu-system-x86_64 -cpu host -enable-kvm -m 15360 -smp sockets=1,cores=10 -hda <vm_image.qcow2>
    -device e1000,netdev=net0 -netdev user,id=net0,hostfwd=tcp::5556-:22 -device vfio-pci,host=81:00.4 -vnc :5556 &
    
  6. Copy the DPDK source code in VM by executing below command from VM.

    scp -r <username>@<host_ip>:<dpdk_sources> <dpdk_copy_path_in_vm>
    
  7. Once the VM is launched, execute steps as outlined in section Building QDMA DPDK Software to build DPDK on VM

  8. Bind the VF device in VM to igb_uio driver and execute qdma_testapp in VM as outlined in section Running the DPDK software test application.

Executing with vfio-pci driver in VM using QEMU

Follow below steps on Host side before attaching VFs to QEMU.

1. Find the Vendor Id (e.g. 10ee) and Device Id (e.g. a03f, a13f) of the VFs being attached to VM using lspci command. Add the Vendor Id and Device Id to vfio-pci and bind the VFs to vfio-pci as below

echo "10ee a03f" > /sys/bus/pci/drivers/vfio-pci/new_id
echo 0000:81:00.4 > /sys/bus/pci/devices/0000:81:00.4/driver/unbind
echo 0000:81:00.4 > /sys/bus/pci/drivers/vfio-pci/bind

echo "10ee a13f" > /sys/bus/pci/drivers/vfio-pci/new_id
echo 0000:81:08.4 > /sys/bus/pci/devices/0000:81:08.4/driver/unbind
echo 0000:81:08.4 > /sys/bus/pci/drivers/vfio-pci/bind
  1. Attach the VFs (here, 81:00.4 and 81:08.4) to VM via vfio-pci using QEMU with below command.
qemu-system-x86_64 -cpu host -enable-kvm -m 15360 -smp sockets=1,cores=10 -hda <vm_image.qcow2>
-device e1000,netdev=net0 -netdev user,id=net0,hostfwd=tcp::5556-:22 -device vfio-pci,host=81:00.4, -device vfio-pci,host=81:08.4 -vnc :5556 &

Notes:

  1. If Qemu fails to launch VM with the above command, with “VF token required for access” error, Update Qemu to version 8.2.4. This version supports passing vf-token to VM. Run below steps to register VF token.
  2. VF token can be passed to DPDK by using EAL parameter - -vfio-vf-token. Run below command to pass vf-token across all PF and VF ports.(Use the uuid generated earlier as outlined under “Driver binding with vfio-pci” section)
./build/qdma_testapp -c 0xf --vfio-vf-token=<uuid> -n 4

After running the above command, exit from the application as VFs need to be attached to VM. vf-token should be same for PF’s and VF’s.

  1. Now launch VM using following command with vf-token parameter
qemu-system-x86_64 -cpu host -enable-kvm -m 15360 -smp sockets=1,cores=10 -hda <vm_image.qcow2> -device e1000,netdev=net0 -netdev user,id=net0,hostfwd=tcp::5556-:22
-device vfio-pci,host=0000:81:00.4,vf-token=<uuid>, -device vfio-pci,host=0000:81:08.4,vf-token=<uuid> -vnc :5556 &

Before running any dpdk application in VM, ensure to run qdma_testapp on host system with vf-token as shown in step 2, to handle mailbox events.

Follow below steps inside VM to bind the functions with vfio-pci

  1. Once the VM is launched, execute steps as outlined in section Building QDMA DPDK Software to build DPDK on VM
  2. Enable noiommu mode to support vfio-pci driver in VM as there won’t be physical IOMMU device present inside VM
modprobe vfio-pci
echo 1 > /sys/module/vfio/parameters/enable_unsafe_noiommu_mode
  1. Bind the VF device in VM to vfio-pci driver and execute qdma_testapp in VM as outlined in section Running the DPDK software test application.