Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

Unreleased

Added

  • HTTP/REST C++ client (@cbf33b8)

  • gRPC API based on KServe v2 API (@37a6aad and others)

  • TensorFlow/Pytorch + ZenDNN backend (#17 and #21)

  • ‘ServerMetadata’ endpoint to the API (@7747911)

  • ‘modelList’ endpoint to the API (@7477b7d)

  • Parse JSON data as string in HTTP body (@694800e)

  • Directory monitoring for model loading (@6459797)

  • ‘ModelMetadata’ endpoint to the API (@22b9d1a)

  • MIGraphX backend (#34)

  • Pre-commit for style verification(@048bdd7)

Changed

  • Use Pybind11 to create Python API (#20)

  • Two logs are created now: server and client

  • Logging macro is now PROTEUS_LOG_*

  • Loading workers is now case-insensitive (@14ed4ef and @90a51ae)

  • Build AKS from source (@e04890f)

  • Use consistent custom exceptions (#30)

  • Update Docker build commands to opt-in to all backends (#43)

Fixed

  • Get the right request size in the batcher when enqueuing with the C++ API (@d1ad81d)

  • Construct responses correctly in the XModel worker if there are multiple input buffers (@d1ad81d)

  • Populate the right number of offsets in the hard batcher (@6666142)

  • Calculate offset values correctly during batching (@8c7534b)

  • Get correct library dependencies for production container (@14ed4ef)

  • Correctly throw an exception if a worker gets an error during initialization (#29)

  • Detect errors in HTTP client during loading (@99ffc33)

  • Construct batches with the right sizes (#57)

0.1.0 - 2022-02-08

Added

  • Core inference server functionality

  • Batching support

  • Support for running multiple workers simultaneously

  • Support for different batcher and buffer implementations

  • XModel support

  • Logging, metrics and tracing support

  • REST API based on KServe v2 API

  • C++ API

  • Python library for REST

  • Documentation, examples, and some tests

  • Experimental GUI