v2020.2 Release Notes¶
Vitis HPC Library¶
FPGA-accelerated library for HPC workloads. Initial release focuses on Seismic Imaging & Geophysics Simulation use-cases.
- Reverse Time Migration (RTM) – Seismic imaging technique for accurate representation of subsurface
- High-precision Multi-layer Perceptron (MLP) - Reconstruction of subsurface properties using seismic reflection data (Seismic Inversion)
- Optimized for single precision floating point data types (FP32) which is a key requirement within HPC applications
- Version 1 of the library offers the following:
- L1 Stencil primitive, L1 MLP activation functions including Sigmoid, Relu, and Prelu
- L2 2D RTM forward kernel, 2D RTM backward kernels, and 3D RTM forward kernel
- L3 2D RTM APIs for supporting shot parallelism
Vitis Vision Library¶
- 2020.2 ISP Pipeline Example Design Enhancements
- Supports pixel depth up to 16 bits
- HDR Merge in Bayer Domain (without deghosting support)
- Local tone mapping
- Auto Exposure Correction
- Quantization & Dithering
- Color Correction Matrix
- Black Level Correction
- Lens Shading correction
- 3D Look Up Table
- New Library Functions
- Feature Matching - Brute Force Feature Matching
- Smoothing Functions - Mode Filter
- DNN Pre Processing Functions - blobFromImage
- Edge Detection Function - Laplacian Operator
- Image Segmentation Function - Distance Transform
- Library Infrastructure & Other Enhancements
- All library functions support Alveo U50 platform
- Color Conversion: supporting RGBX or fourth channel support
- Line Stride support in Data Converters
Vitis Data Analytics Library¶
Text Processing APIs.
Two major APIs included - the regular expression match and geo-IP lookup. The former API can be used to extract content from unstructured data like logs, while the latter is often used in processing web logs, to annotate with geographic information by IP address. A demo tool that converts Apache HTTP server log in batch into JSON file is provided with the library.DataFrame APIs for in-memory Data Abstraction
DataFrame is widely popular for in-memory data abstraction in data analytics domain, the DataFrame write and read APIs should enable data analytics kernel developers to store temporal data or interact with open-source software using Apache Arrow DataFrame more easily.
Vitis Graph Library¶
- Single-Source Shortest Path API (singleSourceShortestPath): 2020.2 version now supports the Alveo U50 platform and provides a new output ‘pred32’ for the shortest path information.
- Page Rank APIs: 2020.2 version now supports Alveo U50 platform and including two APIs both named ‘pageRankTop’ - One to leverage a single memory channel and the other to utilize multi-bank memories.
- Similarity APIs: 3 new APIs to cover different applications: .‘denseSimilarityKernel’ is for dense graph applications, ‘sparseSimilarityKernel’ for Sparse graph applications and ‘generalSimilarityKernel’ for both types of applications with single kernel.
- The following APIs now support Alveo U50 platform:
- Breadth-First search bfs API (bfs)
- Degree calculation API (calcuDegree)
- Connected component API (connectedComponents)
- Converting format from CSC to CSR API (convertCsrCsc)
- Label propagation API (labelPropagation)
- Strongly connected component API (stronglyConnectedComponents)
- Triangle count API (triangleCount)
Vitis BLAS Library¶
- New L2 GEMM Kernel
- For FP32 data types, the L3 GEMM performance has been improved from 280 GFLOPS to 340 GFLOPS
Vitis Sparse Library¶
- Introduced FP32 L2 CSCMV kernel (sparse matrix vector multiplication for CSC - Compressed Sparse Column - format matrices) that utilizes 16 HBM channel support on the Alveo U280 accelerator card.
Vitis Database Library¶
- The 2020.2 release brings a major enhancements and updates to the General Query Engine (GQE) kernel design, and brand-new Level 3 APIs for JOIN and GROUP-BY AGGREGATE.
- Columns as Input Buffers: The GQE kernels treat each column as an input buffer, simplifying the data preparation in the host code.
Additionally, allocating multiple buffers on host side will reduce out-of-memory issues compared to big contiguous memory allocations, especially when the server is under heavy load. - Command Classes for generating Configuration bits : The L2 layer now provides command classes to generate the configuration bits for GQE kernels.
Developers no longer have to dive into the bitmap table to understand which bit(s) to toggle to enable or disable a function in GQE pipeline. Thus, the host code can be more sustainable and less error-prone. - New Level-3 APIs: New experimental L3 APIs for JOIN and GROUP-BY AGGREGATE are built to scale the problem size that GQE can handle.
They can breakdown the tables into parts based on hash and call the GQE kernels multiple rounds in a well-schedule fashion. The strategy of execution is separated from execution, so database gurus can fine-tune the execution based on table statistics, without messing with the OpenCL execution part.
- Columns as Input Buffers: The GQE kernels treat each column as an input buffer, simplifying the data preparation in the host code.
Vitis Compression Library¶
- LIBZ Library Acceleration using Alveo U50
- Seamless acceleration of libz standard APIs : deflate, compress2 and uncompress
- Ready-to-use libz.so library to accelerate any host code without any code change
- xzlib standalone executable for both gzip/zlib compress & decompress
- ZSTD Decompression : New implementation of Facebook ZSTD algorithm available
- Snappy Dual Core Kernel : New implementation of Google snappy Dual Core decompression algorithm achieves 2x throughput improvement for single file decompress.
- GZIP Compress Kernel: New GZIP Quad Core Compress Kernel (in-built , LZ77 , TreeGen, Huffman encoder) implementation available. More than 20% reduction in overall resources and 50% reduction in DDR bandwidth requirement.
- GZIP Compress Streaming Kernel: Fully standard compliance GZIP(include header & footer) implementation available, streaming free running kernels.
- GZIP/ZLIB L3 Application on Alveo U50: GZIP/ZLIB Application available as an L3 API , optimized for Alveo U50 (HBM) and Alveo U250 cards. Single FPGA binary (xclbin) supports both zlib & gzip format for compress and uncompress
- Support for to Alveo U50 : Library functions (LZ4, Snappy, GZIP, ZLIB) ported to support the Alveo U50 platform.
- Low Latency GZIP/ZLIB Decompress : Initial decompression latency reduced from 5K to 2.5K for 4KB/8KB/16KB block sizes
Vitis DSP Library¶
- APIs revised to fully support Vitis HLS compiler
Vitis Security Library¶
- New Signature Generation and Verification Algorithms: DSA, ECC, ECDSA(secp256k1) and EdDSA(ed25519)
- New Checksum Algorithms: Adler32 and CRC32.
- Verifiable delay function (VDF) evaluation and verification: Pietrzak's VDF and Wesolowski's VDF.
- Commercial Cryptography constituted by CAS: SM2, SM3 and SM4.
- Stream Cipher: XChacha20.
- Optimization on RSA, GMAC, AES-GCM and SHA3 to improve their performance and resource utilization.
Vitis Utilities Library¶
- Argument parser (Beta): Parses the options and flags passed from command line and offers automatic help information generation enabling developers to create unified experience on test cases and user applications.
- FIFO multiplexer: This module wraps around a FIFO (implemented through hls::stream in kernel code ) to enable passing data of different type through the same hardware resource. When the data is too wide, it will automatically be transferred using multiple cycles. This module is expected to make the dataflow code more compact and readable.