Entity Resolution Use Case OverviewΒΆ
In this use case, two data sources are used. The first data source is a reference CSV file with a list of last names. The second data source is a new CSV file with a new list of last names that slightly differ from the reference. The table below shows some examples:
Reference Name |
New Name |
Levenshtein Edit Distance |
---|---|---|
HIGGINBOTHAM |
HJGGJNCOTHAM |
3 |
STRINGFELLOW |
STRINGFELMOW |
1 |
VANLANDINGHAM |
VAOLBNDINGHAM |
2 |
The reference names are loaded to the Xilinx Alveo card first. The list of new names will be sent to the card to match against the reference with a user sepcified threshold. If the Levenshtein edit distance between the new name and the reference is within the threshold, a match is found.