Roadmap

The AMD Inference Server is in active development and this is the tentative and non-exhaustive roadmap of features we would like to add. Of course, this is subject to change based on our own assessment and on feedback from the community, both of which may affect which features take priority over others. More detailed information about the work that’s ongoing and/or completed can be found in the change log and the Github roadmap.

2022 Q1

  • gRPC support (series of commits starting in @37a6aad)

2022 Q2

  • ZenDNN CPU support (#17 and #21)

  • Official integration with KServe (KServe website #179)

2022 Q3

  • GPU support (#34)

Future

  • Refactor memory model

  • Expanded testing with models in Vitis AI model zoo

  • Benchmarking with MLPerf