PLDM Telemetry¶
Alveo™ supports PLDM Over MCTP Over SMBus at slave address 0x30 (8-bit). The latest sensor information is stored locally in SC FW and is reported to server BMC via PLDM Type-2 commands. Sensor information is reported to the BMC via the platform descriptor record (PDR).
PLDM Over MCTP Over SMBus Protocol
Alveo™ PLDM implementation adheres to the following DMTF specifications:
- PLDM Base Specification ([DSP0240])
- PLDM for Platform Monitoring and Control Specification ([DSP0248])
List of supported PLDM Type-2 commands
Numeric Sensor commands
- SetNumericSensorEnable
- GetSensorReading
- GetSensorThresholds
Effecter commands
- SetNumericEffecterEnable
- SetNumericEffecterValue
- GetNumericEffecterValue
NOTE: Effecter commands are supported only in Alveo™ MA35D product
PDR repository commands
- GetPDRRepositoryInfo
- GetPDR
- GetPDRRepositorySignature
PLDM PDR types
Table: List of supported PDR types
PDR Type Number | PDR Type Name | Notes |
---|---|---|
1 (0x01) | Terminus Locator PDR | |
2 (0x02) | Numeric Sensor PDR | |
6 (0x06) | Sensor Auxiliary Names PDR | |
9 (0x09) | Numeric Effecter PDR | Supported only in MA35D |
13 (0x0D) | Effecter Auxiliary Names PDR | Supported only in MA35D |
PLDM sensors
Table: List of Numeric sensors supported in Alveo™ products
Applicable to Alveo™ U200, U250, U280, U50, U50LV, U30, U55C, U55N, UL3xxx & V70 products.
Sensor ID | Sensor Name | Description |
---|---|---|
Numeric sensor - Thermal | ||
1 (0x01) | FPGA Temperature | Consolidated FPGA die/junction temperature |
2 (0x02) | Board Temperature | Max of (Inlet, outlet, VRM) temperature |
3 (0x03) | QSFP0 Temperature | QSFP0 temperature when present (N/A for U30, U55C, UL3xxx & V70) |
4 (0x04) | QSFP1 Temperature | QSFP1 temperature when present ((N/A for U30, U55C, UL3xxx & V70) |
Table: List of Numeric sensors supported in MA35D product
Sensor ID | Sensor Name | Description |
---|---|---|
Numeric sensor - Thermal | ||
1 (0x01) | ASIC1 Temperature | ASIC device temperature retrieved from ASIC1 |
2 (0x02) | ASIC2 Temperature | ASIC device temperature retrieved from ASIC2 |
3 (0x03) | Inlet Temperature | Board Inlet temperature sensor (placed near IO bracket) |
4 (0x04) | Normalized VRM Temperature | Normalized temperature sensor value from the Voltage Regulators (VRM) |
Numeric sensor - Electrical | ||
5 (0x05) | 3v3 Aux Voltage | |
6 (0x06) | 3v3 Pex Voltage | |
7 (0x07) | 12v Pex Voltage | |
8 (0x08) | ASIC1 Voltage | ASIC voltage retrieved from ASIC1 |
9 (0x09) | ASIC2 Voltage | ASIC voltage retrieved from ASIC2 |
10 (0x0A) | 3v3 Aux Current | |
11 (0x0B) | 3v3 Pex Current | |
12 (0x0C) | 12v Pex Current | |
13 (0x0D) | Total Power | Total Power consumption of the MA35D board |
Numeric sensor - Firmware Versions | ||
14 (0x0E) | SC FW Version | |
15 (0x0F) | ASIC1 ZSP FW Version | |
16 (0x10) | ASIC2 ZSP FW Version | |
27 (0x1B) | ASIC1 eSecure FW Version | |
28 (0x1C) | ASIC2 eSecure FW Version | |
Numeric sensor - ASIC Info - Heartbeat, Errors & PCIe | ||
17 (0x11) | ASIC1 Heartbeat sensor | |
18 (0x12) | ASIC2 Heartbeat sensor | |
19 (0x13) | ASIC1 PCIe Correctable Errors | |
20 (0x14) | ASIC2 PCIe Correctable Errors | |
21 (0x15) | ASIC1 PCIe Uncorrectable Errors | |
22 (0x16) | ASIC2 PCIe Uncorrectable Errors | |
23 (0x17) | ASIC1 ECC Correctable Errors | |
24 (0x18) | ASIC2 ECC Correctable Errors | |
25 (0x19) | ASIC1 ECC Uncorrectable Errors | |
26 (0x1A) | ASIC2 ECC Uncorrectable Errors | |
29 (0x1D) | ASIC1 PCIe Link Speed | |
30 (0x1E) | ASIC2 PCIe Link Speed | |
31 (0x1F) | ASIC1 PCIe Link Width | |
32 (0x20) | ASIC2 PCIe Link Width |
Table: List of Effecters sensors supported in MA35D product
Sensor ID | Sensor Name | Description |
---|---|---|
Effecters - Resets and SPI flash WP config | ||
1 (0x01) | ASIC1 reset | Effector to reset ASIC1; SC resets ASIC1 via GPIO signal |
2 (0x02) | ASIC2 reset | Effector to reset ASIC2; SC resets ASIC1 via GPIO signal |
3 (0x03) | SC reset | Effector to soft reset MSP432 MCU |
4 (0x04) | ASIC1 SPI flash WP set | Effector to enable/disable ASIC1 flash Write Protect; Default: WP enable |
5 (0x05) | ASIC2 SPI flash WP set | Effector to enable/disable ASIC2 flash Write Protect; Default: WP enable |
NOTE: Effecter values range between 0x00 and 0x01. For ASIC/SC resets, sending 0x01 results in reset. For flash Write Protects, sending 0x00 disables WP and 0x01 enables it back.
AMD Support
For support resources such as answers, documentation, downloads, and forums, see the Alveo Accelerator Cards AMD/Xilinx Community Forum.
License
Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License.
You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0
All images and documentation, including all debug and support documentation, are licensed under the Creative Commons (CC) Attribution 4.0 International License (the “CC-BY-4.0 License”); you may not use this file except in compliance with the CC-BY-4.0 License.
You may obtain a copy of the CC-BY-4.0 License at https://creativecommons.org/licenses/by/4.0/
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an “AS IS” BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
XD038 | © Copyright 2023, Advanced Micro Devices Inc.