XOCL (PCIe User Physical Function) Driver Interfaces¶
A GEM style driver for Xilinx PCIe based accelerators
File xocl_ioctl.h defines ioctl command codes and associated structures for interacting with xocl PCI driver for Xilinx FPGA platforms.
Device memory allocation is modeled as buffer objects (bo). For each bo driver tracks the host pointer backed by scatter gather list – which provides backing storage on host – and the corresponding device side allocation of contiguous buffer in one of the memory mapped DDRs/BRAMs, etc.
Exection model is asynchronous where execute commands are submitted using command buffers and POSIX poll is used to wait for finished commands. Commands for a compute unit can only be submitted after an explicit context has been opened by the client.
“xocl” driver allows user land to perform mmap on multiple entities distinguished by offset:
- page offset == 0: whole user BAR is mapped
- page offset > 0 and <= 128: one CU reg space is mapped, offset is used as CU index
- page offset >= (4G >> PAGE_SHIFT): one BO is mapped, offset should be obtained from drm_xocl_map_bo()
xocl driver functionality is described in the following table. All the APIs are multi-threading and multi-process safe.
# |
Functionality |
ioctl request code |
data format |
---|---|---|---|
1 |
Allocate buffer on device |
DRM_IOCTL_XOCL_CREATE_BO |
drm_xocl_create_bo |
2 |
Allocate buffer on device with userptr |
DRM_IOCTL_XOCL_USERPTR_BO |
drm_xocl_userptr_bo |
3 |
Prepare bo for mapping into user’s address space |
DRM_IOCTL_XOCL_MAP_BO |
drm_xocl_map_bo |
4 |
Synchronize (DMA) buffer contents in requested direction |
DRM_IOCTL_XOCL_SYNC_BO |
drm_xocl_sync_bo |
5 |
Obtain information about buffer object |
DRM_IOCTL_XOCL_INFO_BO |
drm_xocl_info_bo |
6 |
Update bo backing storage with user’s data |
DRM_IOCTL_XOCL_PWRITE_BO |
drm_xocl_pwrite_bo |
7 |
Read back data in bo backing storage |
DRM_IOCTL_XOCL_PREAD_BO |
drm_xocl_pread_bo |
8 |
Open/close a context on a compute unit on the device |
DRM_IOCTL_XOCL_CTX |
drm_xocl_ctx |
9 |
Create a hw context on a slot for a xclbin on the device |
DRM_IOCTL_XOCL_CREATE_HW_CTX |
drm_xocl_create_hw_ctx |
10 |
Destroy a hw context on a slot for a xclbin on the device |
DRM_IOCTL_XOCL_DESTROY_HW_CTX |
drm_xocl_destroy_hw_ctx |
11 |
Open a context on a compute unit on the device |
DRM_IOCTL_XOCL_OPEN_CU_CTX |
drm_xocl_open_cu_ctx |
12 |
Close a context on a compute unit on the device |
DRM_IOCTL_XOCL_CLOSE_CU_CTX |
drm_xocl_close_cu_ctx |
13 |
Unprotected write to device memory |
DRM_IOCTL_XOCL_PWRITE_UNMGD |
drm_xocl_pwrite_unmgd |
14 |
Unprotected read from device memory |
DRM_IOCTL_XOCL_PREAD_UNMGD |
drm_xocl_pread_unmgd |
15 |
Send an execute job to a compute unit |
DRM_IOCTL_XOCL_EXECBUF |
drm_xocl_execbuf |
16 |
Send an execute job to a compute unit |
DRM_IOCTL_XOCL_HW_CTX_EXECBUF |
drm_xocl_hw_ctx_execbuf |
17 |
Register eventfd handle for MSIX interrupt |
DRM_IOCTL_XOCL_USER_INTR |
drm_xocl_user_intr |
18 |
Update device view with a specific xclbin image |
DRM_IOCTL_XOCL_READ_AXLF |
drm_xocl_axlf |
19 |
Obtain info of bo |
DRM_IOCTL_XOCL_INFO |
drm_xocl_info_bo |
20 |
Obtain bo related statistics |
DRM_IOCTL_XOCL_OCL_USAGE_STAT |
drm_xocl_usage_stat |
21 |
Perform hot reset |
DRM_IOCTL_XOCL_HOT_RESET |
N/A |
22 |
Perform clock scaling |
DRM_IOCTL_XOCL_RECLOCK |
drm_xocl_reclock_info |
23 |
Allocate buffer on host memory |
DRM_IOCTL_XOCL_ALLOC_CMA |
drm_xocl_alloc_cma_info |
24 |
Free host memory buffer |
DRM_IOCTL_XOCL_FREE_CMA |
N/A |
25 |
Copy bo buffers |
DRM_IOCTL_XOCL_COPY_BO |
drm_xocl_copy_bo |
- struct drm_xocl_create_bo
Create buffer object used with DRM_IOCTL_XOCL_CREATE_BO ioctl
Definition
struct drm_xocl_create_bo {
uint64_t size;
uint32_t handle;
uint32_t flags;
uint32_t type;
};
Members
size
Requested size of the buffer object
handle
bo handle returned by the driver
flags
DRM_XOCL_BO_XXX flags
type
The type of bo
- struct drm_xocl_userptr_bo
Create buffer object with user’s pointer used with DRM_IOCTL_XOCL_USERPTR_BO ioctl
Definition
struct drm_xocl_userptr_bo {
uint64_t addr;
uint64_t size;
uint32_t handle;
uint32_t flags;
uint32_t type;
};
Members
addr
Address of buffer allocated by user
size
Requested size of the buffer object
handle
bo handle returned by the driver
flags
DRM_XOCL_BO_XXX flags
type
The type of bo
- struct drm_xocl_map_kern_mem
Map a buffer object to linux kernel memory (sgl or virtual address) used with DRM_IOCTL_XOCL_MAP_KERN_MEM ioctl. WARNING: INTERNAL USE ONLY. NOT FOR PUBLIC CONSUMPTION. For use with Linux kernel space specific IOCTLs.
Definition
struct drm_xocl_map_kern_mem {
uint32_t handle;
uint64_t addr;
uint64_t size;
uint32_t flags;
};
Members
handle
bo handle returned by the driver
addr
Address of sgl or kernel buffer allocated by user
size
Requested size of the buffer object
flags
DRM_XOCL_BO_XXX flags
- struct drm_xocl_map_bo
Prepare a buffer object for mmap used with DRM_IOCTL_XOCL_MAP_BO ioctl
Definition
struct drm_xocl_map_bo {
uint32_t handle;
uint32_t pad;
uint64_t offset;
};
Members
handle
bo handle
pad
Unused
offset
‘Fake’ offset returned by the driver which can be used with POSIX mmap
- struct drm_xocl_sync_bo
Synchronize the buffer in the requested direction between device and host used with DRM_IOCTL_XOCL_SYNC_BO ioctl
Definition
struct drm_xocl_sync_bo {
uint32_t handle;
uint32_t flags;
uint64_t size;
uint64_t offset;
enum drm_xocl_sync_bo_dir dir;
};
Members
handle
bo handle
flags
Unused
size
Number of bytes to synchronize
offset
Offset into the object to synchronize
dir
DRM_XOCL_SYNC_DIR_XXX
- struct drm_xocl_sync_bo_cb
Synchronize the buffer in the requested direction between device and host used with DRM_IOCTL_XOCL_SYNC_BO_CB ioctl (linux kernel only) WARNING: INTERNAL USE ONLY. NOT FOR PUBLIC CONSUMPTION. For use with Linux kernel space specific IOCTLs.
Definition
struct drm_xocl_sync_bo_cb {
uint32_t handle;
uint32_t flags;
uint64_t size;
uint64_t offset;
enum drm_xocl_sync_bo_dir dir;
uint64_t cb_func;
uint64_t cb_data;
};
Members
handle
bo handle
flags
Unused
size
Number of bytes to synchronize
offset
Offset into the object to synchronize
dir
DRM_XOCL_SYNC_DIR_XXX
cb_func
Pointer to callback function(void(*fn)(long,int))
cb_data
Pointer to context that callback needs to be invoked with
- struct drm_xocl_info_bo
Obtain information about an allocated buffer obbject used with DRM_IOCTL_XOCL_INFO_BO IOCTL
Definition
struct drm_xocl_info_bo {
uint32_t handle;
uint32_t flags;
uint64_t size;
uint64_t paddr;
};
Members
handle
bo handle
flags
Flags passed while creating BO (out)
size
Size of buffer object (out)
paddr
Physical address (out)
- struct drm_xocl_copy_bo
Device memory to memory copy bo used with DRM_IOCTL_XOCL_COPY_BO IOCTL
Definition
struct drm_xocl_copy_bo {
uint32_t dst_handle;
uint32_t src_handle;
uint64_t size;
uint64_t dst_offset;
uint64_t src_offset;
};
Members
dst_handle
dst bo handle
src_handle
src bo handle
size
bo size in bytes
dst_offset
dst offset
src_offset
src offset
- struct drm_xocl_set_cu_range
Set CU range used with DRM_IOCTL_XOCL_SET_CU_READONLY_RANGE
Definition
struct drm_xocl_set_cu_range {
uint32_t cu_index;
uint32_t start;
uint32_t size;
};
Members
cu_index
Index of the compute unit in the device inage for which
start
Start offset of the range
size
Size of the range
- struct argument_info
Kernel argument information
Definition
struct argument_info {
char name;
uint32_t offset;
uint32_t size;
uint32_t dir;
};
Members
name
argument name
offset
argument offset in CU
size
argument size in bytes
dir
input or output argument for a CU
- struct kernel_info
Kernel information
Definition
struct kernel_info {
char name;
uint32_t range;
int anums;
int features;
struct argument_info args;
};
Members
name
kernel name
range
kernel register range
anums
number of argument
features
number of argument
args
argument array
- struct drm_xocl_kinfo_bo
Get a buffer object’s kernel virtual address used with DRM_IOCTL_XOCL_KINFO_BO ioctl. WARNING: INTERNAL USE ONLY. NOT FOR PUBLIC CONSUMPTION. For use with Linux kernel space specific IOCTLs.
Definition
struct drm_xocl_kinfo_bo {
uint32_t handle;
uint32_t flags;
uint64_t size;
uint64_t paddr;
uint64_t vaddr;
};
Members
handle
bo handle of BO whose info is required
flags
Unused
size
Size of buffer object
paddr
Physical address (BO’s Device address)
vaddr
Kernel Virtual address of BO
- struct drm_xocl_kds
KDS user configuration
Definition
struct drm_xocl_kds {
uint32_t slot_size;
uint32_t ert:1;
uint32_t polling:1;
uint32_t cu_dma:1;
uint32_t cu_isr:1;
uint32_t cq_int:1;
uint32_t dataflow:1;
uint32_t rw_shared:1;
};
Members
slot_size
CQ slot size
ert
enable embedded HW scheduler
polling
poll for command completion
cu_dma
enable CUDMA custom module for HW scheduler
cu_isr
enable CUISR custom module for HW scheduler
cq_int
enable interrupt from host to HW scheduler
dataflow
enable dataflow mode
rw_shared
allow xclRegWrite/xclRegRead access shared CU
- struct drm_xocl_axlf
load xclbin (AXLF) device image used with DRM_IOCTL_XOCL_READ_AXLF ioctl
Definition
struct drm_xocl_axlf {
struct axlf * xclbin;
int ksize;
char * kernels;
struct drm_xocl_kds kds_cfg;
uint32_t flags;
};
Members
xclbin
Pointer to user’s xclbin structure in memory
ksize
size of kernels in bytes
kernels
pointer of argument array
kds_cfg
kds configuration
flags
flags passed while programming xclbin
NOTE
This ioctl will be removed in next release
- struct drm_xocl_pwrite_bo
Update bo with user’s data used with DRM_IOCTL_XOCL_PWRITE_BO ioctl
Definition
struct drm_xocl_pwrite_bo {
uint32_t handle;
uint32_t pad;
uint64_t offset;
uint64_t size;
uint64_t data_ptr;
};
Members
handle
bo handle
pad
Unused
offset
Offset into the buffer object to write to
size
Length of data to write
data_ptr
User’s pointer to read the data from
- struct drm_xocl_pread_bo
Read data from bo used with DRM_IOCTL_XOCL_PREAD_BO ioctl
Definition
struct drm_xocl_pread_bo {
uint32_t handle;
uint32_t pad;
uint64_t offset;
uint64_t size;
uint64_t data_ptr;
};
Members
handle
bo handle
pad
Unused
offset
Offset into the buffer object to read from
size
Length of data to read
data_ptr
User’s pointer to write the data into
- struct drm_xocl_ctx
Open or close a context on a compute unit on device used with DRM_XOCL_CTX ioctl
Definition
struct drm_xocl_ctx {
enum drm_xocl_ctx_code op;
xuid_t xclbin_id;
uint32_t cu_index;
uint32_t flags;
uint32_t handle;
};
Members
op
Alloc or free a context (XOCL_CTX_OP_ALLOC_CTX/XOCL_CTX_OP_FREE_CTX)
xclbin_id
UUID of the device image (xclbin)
cu_index
Index of the compute unit in the device inage for which the request is being made
flags
Shared or exclusive context (XOCL_CTX_SHARED/XOCL_CTX_EXCLUSIVE)
handle
Unused
- struct drm_xocl_create_hw_ctx
Create a hw context on a slot on device used with DRM_XOCL_CREATE_HW_CTX ioctl
Definition
struct drm_xocl_create_hw_ctx {
struct drm_xocl_axlf * axlf_ptr;
uint32_t qos;
uint32_t hw_context;
};
Members
axlf_ptr
axlf pointer which need to download
qos
QOS information
hw_context
Returns Context handle
- struct drm_xocl_destroy_hw_ctx
Close/Destroy a hw context on a slot on device used with DRM_XOCL_DESTROY_HW_CTX ioctl
Definition
struct drm_xocl_destroy_hw_ctx {
uint32_t hw_context;
};
Members
hw_context
Context handle which need to close
- struct drm_xocl_open_cu_ctx
Open a cu context under a hw context on device used with DRM_XOCL_OPEN_CU_CTX ioctl
Definition
struct drm_xocl_open_cu_ctx {
uint32_t hw_context;
char cu_name;
uint32_t flags;
uint32_t cu_index;
};
Members
hw_context
Open CU under this hw Context handle
cu_name
Name of the compute unit in the device image for which the open request is being made
flags
Shared or exclusive context (XOCL_CTX_SHARED/XOCL_CTX_EXCLUSIVE)
cu_index
Return the acquired CU index. This will require for close cu context
- struct drm_xocl_close_cu_ctx
Open a cu context under a hw context on device used with DRM_XOCL_CLOSE_CU_CTX ioctl
Definition
struct drm_xocl_close_cu_ctx {
uint32_t hw_context;
uint32_t cu_index;
};
Members
hw_context
Open CU under this hw Context handle
cu_index
Index of the compute unit in the device image for which the close request is being made
- struct drm_xocl_pwrite_unmgd
unprotected write to device memory used with DRM_IOCTL_XOCL_PWRITE_UNMGD ioctl
Definition
struct drm_xocl_pwrite_unmgd {
uint32_t address_space;
uint32_t pad;
uint64_t paddr;
uint64_t size;
uint64_t data_ptr;
};
Members
address_space
Address space in the DSA; currently only 0 is suported
pad
Unused
paddr
Physical address in the specified address space
size
Length of data to write
data_ptr
User’s pointer to read the data from
- struct drm_xocl_pread_unmgd
unprotected read from device memory used with DRM_IOCTL_XOCL_PREAD_UNMGD ioctl
Definition
struct drm_xocl_pread_unmgd {
uint32_t address_space;
uint32_t pad;
uint64_t paddr;
uint64_t size;
uint64_t data_ptr;
};
Members
address_space
Address space in the DSA; currently only 0 is valid
pad
Unused
paddr
Physical address in the specified address space
size
Length of data to write
data_ptr
User’s pointer to write the data to
- struct drm_xocl_usage_stat
obtain device memory usage and DMA statistics used with DRM_IOCTL_XOCL_USAGE_STAT ioctl
Definition
struct drm_xocl_usage_stat {
unsigned dma_channel_count;
unsigned mm_channel_count;
uint64_t h2c;
uint64_t c2h;
struct drm_xocl_mm_stat mm;
};
Members
dma_channel_count
How many DMA channels are present
mm_channel_count
How many storage banks (DDR) are present
h2c
Total data transferred from host to device by a DMA channel
c2h
Total data transferred from device to host by a DMA channel
mm
BO statistics for a storage bank (DDR)
- struct drm_xocl_execbuf
Submit a command buffer for execution on a compute unit used with DRM_IOCTL_XOCL_EXECBUF ioctl
Definition
struct drm_xocl_execbuf {
uint32_t ctx_id;
uint32_t exec_bo_handle;
uint32_t deps;
};
Members
ctx_id
Pass 0
exec_bo_handle
BO handle of command buffer formatted as ERT command
deps
Upto MAX_DEPENT_CMD_BO dependency command BO handles this command is dependent on for automatic event dependency handling by ERT
- struct drm_xocl_hw_ctx_execbuf
Submit a command buffer for execution on a compute unit used with DRM_IOCTL_XOCL_HW_CTX_EXECBUF ioctl with new hw context parameters
Definition
struct drm_xocl_hw_ctx_execbuf {
uint32_t hw_ctx_id;
uint32_t exec_bo_handle;
uint32_t deps;
};
Members
hw_ctx_id
Pass the HW Context id
exec_bo_handle
BO handle of command buffer formatted as ERT command
deps
Upto MAX_DEPENT_CMD_BO dependency command BO handles this command is dependent on for automatic event dependency handling by ERT
- struct drm_xocl_execbuf_cb
Submit a command buffer for execution on a compute unit used with DRM_IOCTL_XOCL_EXECBUF_CB ioctl with a callback (linux kernel only) WARNING: INTERNAL USE ONLY. NOT FOR PUBLIC CONSUMPTION. For use with Linux kernel space specific IOCTLs.
Definition
struct drm_xocl_execbuf_cb {
uint32_t ctx_id;
uint32_t exec_bo_handle;
uint32_t deps;
uint64_t cb_func;
uint64_t cb_data;
};
Members
ctx_id
Pass 0
exec_bo_handle
BO handle of command buffer formatted as ERT command
deps
Upto 8 dependency command BO handles this command is dependent on for automatic event dependency handling by ERT
cb_func
Pointer to callback function(void (*fn)(long,int)) upon exec completion
cb_data
Pointer to context that callback needs to be invoked with
- struct drm_xocl_user_intr
Register user’s eventfd for MSIX interrupt used with DRM_IOCTL_XOCL_USER_INTR ioctl
Definition
struct drm_xocl_user_intr {
uint32_t ctx_id;
int fd;
int msix;
};
Members
ctx_id
Pass 0
fd
File descriptor created with eventfd system call
msix
User interrupt number (0 to 15)
- struct drm_xocl_reclock_info
perform clock scaling
Definition
struct drm_xocl_reclock_info {
unsigned region;
unsigned short ocl_target_freq;
};
Members
region
Region
ocl_target_freq
clock scacling request array
- struct drm_xocl_alloc_cma_info
Alloc buffer on host memory
Definition
struct drm_xocl_alloc_cma_info {
uint64_t total_size;
uint64_t entry_num;
uint64_t * user_addr;
};
Members
total_size
total size
entry_num
number of entries
user_addr
user space address