Combine_Bandwidth_Hostmemory¶
This is a simple design that verifies if the platform has basic functionalities It also tests the possible bandwidth between Kernel and Global Memory and validates direct host memory access from kernel.
This example contains verify test, bandwidth test and host memory test kernels to validate FPGA.
In the verify test we have extremely simple HLS C Kernel to verify that the platform has basic functionality. It will make a call to the kernel with an empty global buffer. The kernel will then write the string of characters “Hello World” into the buffer and return. The host will copy this buffer locally and then print out the resulting buffer contents.
In the bandwidth test we try to get the maximum possible bandwidth between Kernel and Global Memory. Extracting Memory Information and generate cfg file:
Platforminfo -j (path to xpfm) > platform_info.json
From the platform_info.json file we pick the memory info
Generate the meta data related to Memory Banks(DDR/HBM/HOST) to platform_bandwidth.cfg file
Using the sp
option in the platform_bandwidth.cfg file AXI-Master Port is connected to the IP.
sp=bandwidth_1.input:DDR[0]
sp=bandwidth_1.output:DDR[0]
hostmemory test is to validate direct host memory access from kernel using slave bridge.
The host allocates a buffer into specific host-only buffer using XCL_MEM_EXT_HOST_ONLY
. The cl_mem_ext_ptr
object needs to be used in cases where memory assignment is done by user explicitly:
cl_mem_ext_ptr_t input_buffer_ext;
input_buffer_ext.flags = XCL_MEM_EXT_HOST_ONLY;
input_buffer_ext.obj = nullptr;
input_buffer_ext.param = 0;
OCL_CHECK(err, input_buffer[i] = new cl::Buffer(context, CL_MEM_READ_WRITE | CL_MEM_EXT_PTR_XILINX, vector_size_bytes,
&input_buffer_ext, &err));
Using the sp
option in the platform_hostmemory.cfg file, AXI-Master Port is connected to the Slave-Bridge IP:
sp=hostmemory.input:HOST[0]
sp=hostmemory.output:HOST[0]
EXCLUDED PLATFORMS:
Alveo U25 SmartNIC
Alveo U30
Alveo U50lv
Alveo U50 gen3x4
All Embedded Zynq Platforms, i.e zc702, zcu102 etc
All Versal Platforms, i.e vck190 etc
AWS VU9P F1
All Platforms with 2019 Version
All Platforms with 2018 Version
Samsung SmartSSD Computation Storage Drive
Samsung U.2 SmartSSD
DESIGN FILES¶
Application code is located in the src directory. Accelerator binary files will be compiled to the xclbin directory. The xclbin directory is required by the Makefile and its contents will be filled during compilation. A listing of all the files in this example is shown below
src/bandwidth.cpp
src/host.cpp
src/hostmemory.cpp
src/verify.cpp
Access these files in the github repo by clicking here.
COMMAND LINE ARGUMENTS¶
Once the environment has been configured, the application can be executed by
./combine_bw_hm.exe platform_test_path