Multiple Devices¶
This example show how to take advantage of multiple FPGAs on a system. It will show how to initialized an OpenCL context, allocate memory on the two devices and execute a kernel on each FPGA.
KEY CONCEPTS: OpenCL Host APIs, Multi-FPGA Execution, Event Handling
KEYWORDS: xcl::get_xil_devices
This example demonstrates how multiple FPGA devices can be configured on a system.
OpenCL context and queues are initialized for each FPGA.
contexts[d] =cl::Context(devices[d], props, nullptr, nullptr, &err);
queues[d] = cl::CommandQueue(contexts[d], devices[d], CL_QUEUE_PROFILING_ENABLE, &err);
Each FPGA device needs to be loaded and programmed with binary file and a kernel each is created for FPGAs on the system.
fileBuf[d] = xcl::read_binary_file(binaryFile, fileBufSize);
bins[d].push_back({fileBuf[d], fileBufSize});
programs[d] = load_cl2_binary(bins[d], devices[d], contexts[d]);
kernels[d] = cl::Kernel(programs[d], "vadd", &err);
Buffers are also created for each FPGA seperately.
buffer_a[d] = cl::Buffer(contexts[d], CL_MEM_USE_HOST_PTR | CL_MEM_READ_ONLY, size_per_device, &A[offset], &err);
Following table summarizes the observations while running the design on 1 and 2 U50 platforms:
Device Count |
Time Taken(s) |
---|---|
1 |
47.41 |
2 |
23.71 |
Speedup |
1.99 |
EXCLUDED PLATFORMS:
All NoDMA Platforms, i.e u50 nodma etc
DESIGN FILES¶
Application code is located in the src directory. Accelerator binary files will be compiled to the xclbin directory. The xclbin directory is required by the Makefile and its contents will be filled during compilation. A listing of all the files in this example is shown below
src/host.cpp
src/vector_addition.cpp
Access these files in the github repo by clicking here.
COMMAND LINE ARGUMENTS¶
Once the environment has been configured, the application can be executed by
./multiple_devices <vector_addition XCLBIN> <vector_addition XCLBIN>