Benchmark/QoR

This section provides the L2 performance benchmarks and QoR (Quality of Results) for AIE DSP library elements with various configurations. The results are extracted from hardware emulation based simulations using the Makefile flow defined in: Compiling and Simulating Using the Makefile.

The QoR are reflected using the below metrics:

  • cycleCountAvg - average cycle count that takes to execute kernel function (not including kernel/window buffer overheads).
  • throughputAvg - input throughput calculated based on cycleCountAvg, taking into account input window size.
  • initiationInterval - time that must pass between two consecutive iterations execution starts of a given function, including overheads i.e., time between a function start and its previous start.
  • throughpuInitIntAvg - input throughput calculated based on initiationInterval, taking into account input window size.
  • NUM_BANKS - number of memory banks used by the design
  • NUM_AIE - number of AIE tiles used by the design
  • DATA_MEMORY - total data memory in Bytes used by the design
  • PROGRAM_MEMORY - total program memory in Bytes used by the design

In addition, QoR metrics are harvested for each kernel the design consists of. For example a FIR configured to be implemented on two tiles (CASC_LEN=2) will have two sets of figures displayed in the table below (space deliminated).

Filters

Following table gives results for FIR filter with a wide variety of supported parameters, which are defined in: L2 FIR configuration parameters

fir_benchmark.csv

FIR benchmark
Library Element DATA_TYPE COEFF_TYPE FIR_LEN INTERPOLATE_FACTOR DECIMATE_FACTOR INPUT_WINDOW_VSIZE CASC_LEN DUAL_IP USE_COEFF_RELOAD cycleCountAvg throughputAvg initiationInterval throughpuInitIntAvg NUM_BANKS NUM_AIE DATA_MEMORY PROGRAM_MEMORY
fir_decimate_asym cfloat cfloat 21 1 3 384 1 0 0 9358 41 MSa/s 9440 40 MSa/s 5 1 11361 4956
fir_decimate_asym cint32 cint32 30 1 3 384 1 0 0 2167 177 MSa/s 2251 170 MSa/s 5 1 11484 3014
fir_decimate_asym cint32 cint16 99 1 3 384 1 0 0 3706 103 MSa/s 3815 100 MSa/s 5 1 13206 4040
fir_decimate_asym cint32 cint16 9 1 3 384 1 0 0 451 851 MSa/s 777 494 MSa/s 5 1 10382 1748
fir_decimate_asym cint32 cint16 30 1 3 384 1 0 0 1125 341 MSa/s 1208 317 MSa/s 5 1 10978 2902
fir_decimate_asym cint32 cint16 21 1 3 384 1 0 0 839 457 MSa/s 927 414 MSa/s 5 1 10718 2184
fir_decimate_asym cint32 cint32 9 1 3 384 1 0 0 741 518 MSa/s 817 470 MSa/s 5 1 10516 2336
fir_decimate_asym cint16 int16 99 1 3 384 2 0 0 557 544 689 705 MSa/s 686 683 559 562 MSa/s 7 2 13184 2526 2308
fir_decimate_asym cint16 int16 99 1 3 384 1 0 0 935 410 MSa/s 1025 374 MSa/s 5 1 7568 2840
fir_decimate_asym cint16 int16 9 1 3 384 1 0 1 150 2560 MSa/s 390 984 MSa/s 8 1 6120 2878
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 390 984 MSa/s 5 1 6108 2210
fir_decimate_asym cint16 int16 9 1 3 192 1 0 0 94 2042 MSa/s 197 974 MSa/s 5 1 4060 2210
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 402 955 MSa/s 5 1 6108 2210
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 390 984 MSa/s 5 1 6108 2226
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 390 984 MSa/s 5 1 6108 2226
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 390 984 MSa/s 5 1 6108 2226
fir_decimate_asym cint16 int16 99 1 3 384 1 0 1 1066 360 MSa/s 1183 324 MSa/s 8 1 7748 3198
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 390 984 MSa/s 5 1 6108 2210
fir_decimate_asym cint32 cint32 99 1 3 384 1 0 0 7271 52 MSa/s 7380 52 MSa/s 5 1 14756 4286
fir_decimate_asym cint32 int16 30 1 3 384 1 0 0 674 569 MSa/s 777 494 MSa/s 5 1 10726 2256
fir_decimate_asym int32 int32 99 1 3 384 1 0 0 1770 216 MSa/s 1860 206 MSa/s 5 1 8342 3798
fir_decimate_asym int32 int32 9 1 3 384 1 0 0 214 1794 MSa/s 391 982 MSa/s 5 1 6222 2698
fir_decimate_asym int32 int32 30 1 3 384 1 0 0 642 598 MSa/s 719 534 MSa/s 5 1 6626 2406
fir_decimate_asym int32 int32 21 1 3 384 1 0 0 485 791 MSa/s 565 679 MSa/s 5 1 6430 1850
fir_decimate_asym int32 int16 99 1 3 384 1 0 0 3086 124 MSa/s 3177 120 MSa/s 5 1 7568 4294
fir_decimate_asym int32 int16 9 1 3 384 1 0 0 238 1613 MSa/s 391 982 MSa/s 5 1 6108 2126
fir_decimate_asym int32 int16 30 1 3 384 1 0 0 1010 380 MSa/s 1089 352 MSa/s 5 1 6374 2996
fir_decimate_asym int32 int16 21 1 3 384 1 0 0 803 478 MSa/s 884 434 MSa/s 5 1 6292 2612
fir_decimate_asym cint32 int16 21 1 3 384 1 0 0 545 704 MSa/s 778 493 MSa/s 5 1 10580 1916
fir_decimate_asym float float 99 1 3 384 1 0 0 10829 35 MSa/s 10920 35 MSa/s 5 1 8599 7274
fir_decimate_asym float float 30 1 3 384 1 0 0 3715 103 MSa/s 3793 101 MSa/s 5 1 6883 3712
fir_decimate_asym float float 21 1 3 384 1 0 0 2625 146 MSa/s 2701 142 MSa/s 5 1 6687 3474
fir_decimate_asym cint32 int32 99 1 3 384 1 0 0 3706 103 MSa/s 3815 100 MSa/s 5 1 13206 4040
fir_decimate_asym cint32 int32 9 1 3 384 1 0 0 451 851 MSa/s 777 494 MSa/s 5 1 10382 1748
fir_decimate_asym cint32 int32 30 1 3 384 1 0 0 1125 341 MSa/s 1208 317 MSa/s 5 1 10978 2902
fir_decimate_asym cint32 int32 21 1 3 384 1 0 0 839 457 MSa/s 927 414 MSa/s 5 1 10718 2184
fir_decimate_asym cint32 int16 99 1 3 384 1 0 0 1806 212 MSa/s 1914 200 MSa/s 5 1 12432 3856
fir_decimate_asym cint32 int16 9 1 3 384 1 0 0 354 1084 MSa/s 776 494 MSa/s 5 1 10268 1676
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 390 984 MSa/s 5 1 6108 2210
fir_decimate_asym cint32 cint32 21 1 3 384 1 0 0 1510 254 MSa/s 1592 241 MSa/s 5 1 11092 3174
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 390 984 MSa/s 5 1 6108 2210
fir_decimate_asym cint16 int16 144 1 2 432 1 0 0 2154 200 MSa/s 2251 191 MSa/s 5 1 9258 3256
fir_decimate_asym cint16 int16 140 1 7 420 1 0 0 1678 250 MSa/s 1776 236 MSa/s 5 1 7906 3226
fir_decimate_asym cint16 int16 140 1 5 420 1 0 0 888 472 MSa/s 985 426 MSa/s 5 1 8098 3232
fir_decimate_asym cint16 int16 12 1 4 384 1 0 0 135 2844 MSa/s 390 984 MSa/s 5 1 5858 1872
fir_decimate_asym cint16 int16 12 1 3 384 1 0 1 192 2000 MSa/s 391 982 MSa/s 8 1 6134 2914
fir_decimate_asym cint16 int16 12 1 3 384 1 0 0 151 2543 MSa/s 390 984 MSa/s 5 1 6114 2230
fir_decimate_asym cint16 int16 12 1 2 384 1 0 0 183 2098 MSa/s 390 984 MSa/s 5 1 6626 1878
fir_decimate_asym cint16 int16 12 1 3 384 1 0 0 151 2543 MSa/s 390 984 MSa/s 5 1 6114 2230
fir_decimate_asym cint16 cint16 99 1 3 384 1 0 0 1691 227 MSa/s 1782 215 MSa/s 5 1 8342 3810
fir_decimate_asym cint16 cint16 9 1 3 384 1 0 0 214 1794 MSa/s 391 982 MSa/s 5 1 6222 2698
fir_decimate_asym cint16 cint16 30 1 3 384 1 0 0 642 598 MSa/s 719 534 MSa/s 5 1 6626 2390
fir_decimate_asym cfloat float 99 1 3 384 1 0 0 38309 10 MSa/s 38418 9 MSa/s 5 1 13477 11006
fir_decimate_asym cfloat float 9 1 3 384 1 0 0 3786 101 MSa/s 3869 99 MSa/s 5 1 10653 2738
fir_decimate_asym cfloat float 30 1 3 384 1 0 0 12139 31 MSa/s 12222 31 MSa/s 5 1 11249 4784
fir_decimate_asym cfloat float 21 1 3 384 1 0 0 8714 44 MSa/s 8795 43 MSa/s 5 1 10989 4162
fir_decimate_asym cfloat cfloat 99 1 3 384 1 0 0 44340 8 MSa/s 44449 8 MSa/s 5 1 15025 12474
fir_decimate_asym cint16 int16 9 1 3 384 1 0 0 150 2560 MSa/s 390 984 MSa/s 5 1 6108 2226
fir_decimate_asym cfloat cfloat 30 1 3 384 1 0 0 12341 31 MSa/s 12426 30 MSa/s 5 1 11753 5274
fir_decimate_asym cint16 int16 144 1 4 432 1 0 0 1128 382 MSa/s 1225 352 MSa/s 5 1 8394 3240
fir_decimate_asym cint16 int16 144 1 6 432 1 0 0 1970 219 MSa/s 2068 208 MSa/s 5 1 8106 3114
fir_decimate_asym cfloat cfloat 9 1 3 384 1 0 0 4589 83 MSa/s 4666 82 MSa/s 5 1 10785 3350
fir_decimate_asym cint16 int16 15 1 3 384 1 0 1 224 1714 MSa/s 392 979 MSa/s 8 1 6156 3142
fir_decimate_asym cint16 int16 15 1 3 384 1 0 0 183 2098 MSa/s 390 984 MSa/s 5 1 6120 2394
fir_decimate_asym cint16 int16 63 1 3 384 1 0 1 742 517 MSa/s 853 450 MSa/s 8 1 7020 2704
fir_decimate_asym cint16 int16 60 1 6 384 1 0 0 523 734 MSa/s 605 634 MSa/s 5 1 6370 2748
fir_decimate_asym cint16 int16 60 1 5 320 1 0 0 354 903 MSa/s 437 732 MSa/s 5 1 5858 2440
fir_decimate_asym cint16 int16 60 1 4 512 1 0 0 674 759 MSa/s 757 676 MSa/s 5 1 7906 2440
fir_decimate_asym cint16 int16 60 1 3 384 1 0 0 611 628 MSa/s 694 553 MSa/s 5 1 6882 2432
fir_decimate_asym cint16 int16 33 1 3 384 1 0 0 421 912 MSa/s 500 768 MSa/s 5 1 6540 2216
fir_decimate_asym cint16 int16 28 1 7 448 1 0 0 310 1445 MSa/s 453 988 MSa/s 5 1 6370 2168
fir_decimate_asym cint16 int16 63 1 7 448 1 0 0 570 785 MSa/s 654 685 MSa/s 5 1 6888 2832
fir_decimate_asym cint16 int16 24 1 6 384 1 0 0 262 1465 MSa/s 391 982 MSa/s 5 1 5786 1806
fir_decimate_asym cint16 int16 237 1 3 768 1 0 0 4169 184 MSa/s 4274 179 MSa/s 5 1 13796 4128
fir_decimate_asym cint16 int16 237 1 3 384 1 0 1 2396 160 MSa/s 2529 151 MSa/s 8 1 10176 4602
fir_decimate_asym cint16 int16 237 1 3 384 1 0 0 2121 181 MSa/s 2226 172 MSa/s 5 1 9700 4128
fir_decimate_asym cint16 int16 21 1 3 384 1 0 1 387 992 MSa/s 494 777 MSa/s 8 1 6320 2648
fir_decimate_asym cint16 int16 18 1 3 384 1 0 1 323 1188 MSa/s 428 897 MSa/s 8 1 6298 2608
fir_decimate_hb cint32 cint32 239 1 2 256 1 0 0 6423 39 MSa/s 6556 39 MSa/s 5 1 15196 4352
fir_decimate_hb cint32 cint32 7 1 2 256 1 0 0 261 980 MSa/s 519 493 MSa/s 5 1 8188 2426
fir_decimate_hb cint32 int16 239 1 2 256 1 0 0 21162 12 MSa/s 21295 12 MSa/s 5 1 12610 7252
fir_decimate_hb cint32 int16 11 1 2 256 1 0 0 1576 162 MSa/s 1655 154 MSa/s 5 1 8186 2446
fir_decimate_hb cint32 int16 15 1 2 256 1 0 0 2343 109 MSa/s 2423 105 MSa/s 5 1 8258 2740
fir_decimate_hb cint16 int16 99 1 2 128 4 0 0 230 196 120 124 556 653 1066 1032 MSa/s 273 270 271 272 468 474 472 470 MSa/s 11 4 16328 2174 1768 1768 1998
fir_decimate_hb cint32 cint32 99 1 2 256 1 0 0 2616 97 MSa/s 2714 94 MSa/s 5 1 10972 4170
fir_decimate_hb cint32 cint32 15 1 2 256 1 0 0 549 466 MSa/s 628 407 MSa/s 5 1 8476 2130
fir_decimate_hb cint32 cint16 15 1 2 256 1 0 0 518 494 MSa/s 598 428 MSa/s 5 1 8320 1808
fir_decimate_hb cint32 cint16 99 1 2 256 1 0 0 2184 117 MSa/s 2282 112 MSa/s 5 1 10288 3314
fir_decimate_hb cint32 cint16 7 1 2 256 1 0 0 167 1532 MSa/s 518 494 MSa/s 5 1 8128 2110
fir_decimate_hb cint32 cint16 239 1 2 256 1 0 0 5047 50 MSa/s 5180 49 MSa/s 5 1 13472 4360
fir_decimate_hb cint32 cint16 11 1 2 256 1 0 0 454 563 MSa/s 533 480 MSa/s 5 1 8208 1948
fir_decimate_hb cint16 int16 99 1 2 256 1 0 1 552 463 MSa/s 713 359 MSa/s 8 1 6102 4230
fir_decimate_hb cint16 int16 99 1 2 256 1 0 0 552 463 MSa/s 639 400 MSa/s 5 1 6058 2374
fir_decimate_hb cint16 int16 99 1 2 128 5 0 0 123 127 230 196 120 1040 1007 556 653 1066 MSa/s 271 273 274 269 271 472 468 467 475 472 MSa/s 13 5 21170 2174 1768 1842 1842 2030
fir_decimate_hb cint32 int16 7 1 2 256 1 0 0 1576 162 MSa/s 1654 154 MSa/s 5 1 8114 2426
fir_decimate_hb cint32 cint32 11 1 2 256 1 0 0 453 565 MSa/s 531 482 MSa/s 5 1 8284 2114
fir_decimate_hb cint32 int16 99 1 2 256 1 0 0 8741 29 MSa/s 8844 28 MSa/s 5 1 9898 4024
fir_decimate_hb float float 99 1 2 256 1 0 0 5333 48 MSa/s 5419 47 MSa/s 5 1 6704 4978
fir_decimate_hb cint32 int32 15 1 2 256 1 0 0 518 494 MSa/s 598 428 MSa/s 5 1 8320 1808
fir_decimate_hb cint16 int16 99 1 2 128 3 0 0 256 222 152 500 576 842 MSa/s 298 295 297 429 433 430 MSa/s 9 3 11934 2358 1952 2106
fir_decimate_hb int32 int32 99 1 2 256 1 0 0 1162 220 MSa/s 1249 204 MSa/s 5 1 6448 2320
fir_decimate_hb int32 int32 7 1 2 256 1 0 0 118 2169 MSa/s 262 977 MSa/s 5 1 4992 2182
fir_decimate_hb int32 int32 27 1 2 256 1 0 0 485 527 MSa/s 562 455 MSa/s 5 1 5296 1816
fir_decimate_hb int32 int32 239 1 2 256 1 0 0 1430 179 MSa/s 1533 166 MSa/s 5 1 8480 3522
fir_decimate_hb int32 int32 23 1 2 256 1 0 0 389 658 MSa/s 465 550 MSa/s 5 1 5216 1800
fir_decimate_hb int32 int16 99 1 2 256 1 0 0 3764 68 MSa/s 3852 66 MSa/s 5 1 6058 3656
fir_decimate_hb int32 int16 7 1 2 256 1 0 0 86 2976 MSa/s 261 980 MSa/s 5 1 4978 2106
fir_decimate_hb cint32 int32 11 1 2 256 1 0 0 454 563 MSa/s 533 480 MSa/s 5 1 8208 1860
fir_decimate_hb int32 int16 239 1 2 256 1 0 0 8316 30 MSa/s 8419 30 MSa/s 5 1 7618 6370
fir_decimate_hb int32 int16 15 1 2 256 1 0 0 103 2485 MSa/s 261 980 MSa/s 5 1 5058 2090
fir_decimate_hb float float 7 1 2 256 1 0 0 295 867 MSa/s 370 691 MSa/s 5 1 5248 2348
fir_decimate_hb float float 239 1 2 256 1 0 0 12290 20 MSa/s 12393 20 MSa/s 5 1 8736 8680
fir_decimate_hb float float 19 1 2 256 1 0 0 1287 198 MSa/s 1364 187 MSa/s 5 1 5456 3096
fir_decimate_hb float float 15 1 2 256 1 0 0 787 325 MSa/s 863 296 MSa/s 5 1 5376 2454
fir_decimate_hb cint32 int32 99 1 2 256 1 0 0 2184 117 MSa/s 2282 112 MSa/s 5 1 10288 3314
fir_decimate_hb cint32 int32 7 1 2 256 1 0 0 167 1532 MSa/s 518 494 MSa/s 5 1 8128 2038
fir_decimate_hb cint32 int32 239 1 2 256 1 0 0 5047 50 MSa/s 5180 49 MSa/s 5 1 13472 4360
fir_decimate_hb int32 int16 19 1 2 256 1 0 0 982 260 MSa/s 1059 241 MSa/s 5 1 5130 2572
fir_decimate_hb cint16 int16 99 1 2 128 2 0 0 264 230 484 556 MSa/s 305 303 419 422 MSa/s 7 2 7988 2472 2242
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cfloat float 239 1 2 256 1 0 0 32114 7 MSa/s 32247 7 MSa/s 5 1 13728 10120
fir_decimate_hb cint16 int16 15 1 2 256 1 0 0 103 2485 MSa/s 261 980 MSa/s 5 1 5058 1956
fir_decimate_hb cint16 int16 15 1 2 256 1 0 0 103 2485 MSa/s 272 941 MSa/s 5 1 5058 1972
fir_decimate_hb cint16 int16 11 1 2 256 1 0 1 71 3605 MSa/s 262 977 MSa/s 8 1 5070 2358
fir_decimate_hb cint16 int16 11 1 2 256 1 0 0 71 3605 MSa/s 261 980 MSa/s 5 1 5050 1904
fir_decimate_hb cint16 cint16 99 1 2 256 1 0 0 1158 221 MSa/s 1245 205 MSa/s 5 1 6448 2336
fir_decimate_hb cint16 cint16 7 1 2 256 1 0 0 118 2169 MSa/s 261 980 MSa/s 5 1 4992 2264
fir_decimate_hb cint16 cint16 27 1 2 256 1 0 0 485 527 MSa/s 562 455 MSa/s 5 1 5296 1816
fir_decimate_hb cint16 cint16 239 1 2 256 1 0 0 1421 180 MSa/s 1524 167 MSa/s 5 1 8480 3478
fir_decimate_hb cint16 int16 15 1 2 256 1 0 1 103 2485 MSa/s 262 977 MSa/s 8 1 5086 2428
fir_decimate_hb cint16 cint16 23 1 2 256 1 0 0 389 658 MSa/s 465 550 MSa/s 5 1 5216 1800
fir_decimate_hb cfloat float 7 1 2 256 1 0 0 1703 150 MSa/s 1781 143 MSa/s 5 1 8384 2484
fir_decimate_hb cfloat float 15 1 2 256 1 0 0 2630 97 MSa/s 2710 94 MSa/s 5 1 8576 2996
fir_decimate_hb cfloat float 11 1 2 256 1 0 0 1577 162 MSa/s 1657 154 MSa/s 5 1 8464 2548
fir_decimate_hb cfloat cfloat 99 1 2 256 1 0 0 16527 15 MSa/s 16625 15 MSa/s 5 1 11228 6426
fir_decimate_hb cfloat cfloat 7 1 2 256 1 0 0 1579 162 MSa/s 1658 154 MSa/s 5 1 8444 2570
fir_decimate_hb cfloat cfloat 15 1 2 256 1 0 0 3298 77 MSa/s 3379 75 MSa/s 5 1 8732 2618
fir_decimate_hb cfloat cfloat 11 1 2 256 1 0 0 1961 130 MSa/s 2041 125 MSa/s 5 1 8540 2794
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cfloat float 99 1 2 256 1 0 0 13803 18 MSa/s 13901 18 MSa/s 5 1 10544 5698
fir_decimate_hb cint16 int16 19 1 2 256 1 0 0 104 2461 MSa/s 262 977 MSa/s 5 1 5130 1956
fir_decimate_hb cint16 int16 11 1 2 256 1 0 0 71 3605 MSa/s 267 958 MSa/s 5 1 5050 1904
fir_decimate_hb cint16 int16 23 1 2 256 1 0 0 104 2461 MSa/s 272 941 MSa/s 5 1 5138 1892
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cint16 int16 19 1 2 256 1 0 1 104 2461 MSa/s 262 977 MSa/s 8 1 5158 2492
fir_decimate_hb cint16 int16 7 1 2 64 1 0 0 51 1254 MSa/s 123 520 MSa/s 5 1 2674 1722
fir_decimate_hb cint16 int16 7 1 2 512 1 0 0 102 5019 MSa/s 517 990 MSa/s 5 1 8050 1862
fir_decimate_hb cint16 int16 7 1 2 256 1 0 1 70 3657 MSa/s 262 977 MSa/s 8 1 4998 2278
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cint16 int16 7 1 2 1024 1 0 0 166 6168 MSa/s 1030 994 MSa/s 5 1 14194 1862
fir_decimate_hb cint16 int16 7 1 2 256 1 0 0 70 3657 MSa/s 261 980 MSa/s 5 1 4978 1878
fir_decimate_hb cint16 int16 55 1 2 256 1 1 0 173 1479 MSa/s 320 800 MSa/s 7 1 7986 2414
fir_decimate_hb cint16 int16 27 1 2 256 1 0 1 235 1089 MSa/s 340 752 MSa/s 8 1 5246 2510
fir_decimate_hb cint16 int16 27 1 2 256 1 0 0 201 1273 MSa/s 279 917 MSa/s 5 1 5210 1914
fir_decimate_hb cint16 int16 27 1 2 256 1 1 0 139 1841 MSa/s 277 924 MSa/s 7 1 7514 2218
fir_decimate_hb cint16 int16 239 1 2 256 1 0 1 1278 200 MSa/s 1410 181 MSa/s 8 1 7742 3952
fir_decimate_hb cint16 int16 7 1 2 128 1 0 0 57 2245 MSa/s 133 962 MSa/s 5 1 3442 1816
fir_decimate_hb cint16 int16 239 1 2 256 1 0 0 1216 210 MSa/s 1319 194 MSa/s 5 1 7618 3638
fir_decimate_sym cint16 int16 99 1 3 384 5 0 0 327 317 317 318 319 1174 1211 1211 1207 1203 MSa/s 519 515 516 517 518 739 745 744 742 741 MSa/s 13 5 32656 2448 2248 2100 2106 2376
fir_decimate_sym cint16 int16 99 1 3 384 4 0 0 358 317 317 331 1072 1211 1211 1160 MSa/s 526 523 524 525 730 734 732 731 MSa/s 11 4 25056 2448 2248 2242 2392
fir_decimate_sym cint16 int16 99 1 3 384 3 0 0 391 349 361 982 1100 1063 MSa/s 558 556 557 688 690 689 MSa/s 9 3 18288 2494 2272 2538
fir_decimate_sym cint16 int16 99 1 3 384 2 0 0 453 421 847 912 MSa/s 564 564 680 680 MSa/s 7 2 12352 2722 2718
fir_decimate_sym cint16 int16 99 1 3 384 1 0 1 773 496 MSa/s 890 431 MSa/s 8 1 7324 3056
fir_decimate_sym cint16 int16 9 1 3 192 1 0 0 77 2493 MSa/s 196 979 MSa/s 5 1 3995 2160
fir_decimate_sym cint16 int16 9 1 3 384 1 0 1 117 3282 MSa/s 390 984 MSa/s 8 1 6071 2508
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 389 987 MSa/s 5 1 6043 2160
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 394 974 MSa/s 5 1 6043 2160
fir_decimate_sym cint32 cint16 24 1 2 384 1 0 0 900 426 MSa/s 987 389 MSa/s 5 1 11627 2306
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 389 987 MSa/s 5 1 6043 2176
fir_decimate_sym cint16 int16 99 1 3 384 1 0 0 711 540 MSa/s 801 479 MSa/s 5 1 7248 3650
fir_decimate_sym cint32 cint16 30 1 2 384 1 0 0 1619 237 MSa/s 1708 224 MSa/s 5 1 11779 2426
fir_decimate_sym cint16 int16 28 1 2 256 1 0 0 608 421 MSa/s 686 373 MSa/s 5 1 5218 2244
fir_decimate_sym cint32 cint16 96 1 2 384 1 0 0 2635 145 MSa/s 2742 140 MSa/s 5 1 13451 3546
fir_decimate_sym cint32 cint32 21 1 3 384 1 0 0 1137 337 MSa/s 1219 315 MSa/s 5 1 10772 2694
fir_decimate_sym cint32 cint32 30 1 3 384 1 0 0 1247 307 MSa/s 1331 288 MSa/s 5 1 11068 2926
fir_decimate_sym cint32 cint32 9 1 3 384 1 0 0 496 774 MSa/s 777 494 MSa/s 5 1 10388 2108
fir_decimate_sym cint32 cint32 99 1 3 384 1 0 0 4350 88 MSa/s 4459 86 MSa/s 5 1 13572 4026
fir_decimate_sym cint32 int32 24 1 2 384 1 0 0 900 426 MSa/s 987 389 MSa/s 5 1 11627 2294
fir_decimate_sym cint32 int32 30 1 2 384 1 0 0 1619 237 MSa/s 1708 224 MSa/s 5 1 11779 2414
fir_decimate_sym cint32 int32 8 1 2 384 1 0 0 372 1032 MSa/s 776 494 MSa/s 5 1 11180 1814
fir_decimate_sym cint32 int32 96 1 2 384 1 0 0 2635 145 MSa/s 2742 140 MSa/s 5 1 13451 3546
fir_decimate_sym int32 int32 21 1 3 384 1 0 0 547 702 MSa/s 626 613 MSa/s 5 1 6304 2090
fir_decimate_sym int32 int32 30 1 3 384 1 0 0 739 519 MSa/s 819 468 MSa/s 5 1 6404 2264
fir_decimate_sym int32 int32 9 1 3 384 1 0 0 147 2612 MSa/s 389 987 MSa/s 5 1 6095 2404
fir_decimate_sym int32 int32 99 1 3 384 1 0 0 1135 338 MSa/s 1225 313 MSa/s 5 1 7736 3064
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 389 987 MSa/s 5 1 6043 2176
fir_decimate_sym cint32 cint16 8 1 2 384 1 0 0 372 1032 MSa/s 776 494 MSa/s 5 1 11180 1814
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 389 987 MSa/s 5 1 6043 2176
fir_decimate_sym cint16 int16 237 1 3 384 1 0 1 1467 261 MSa/s 1600 240 MSa/s 8 1 9224 3970
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 389 987 MSa/s 5 1 6043 2176
fir_decimate_sym cint16 cint16 21 1 3 384 1 0 0 548 700 MSa/s 627 612 MSa/s 5 1 6302 2102
fir_decimate_sym cint16 cint16 30 1 3 384 1 0 0 739 519 MSa/s 819 468 MSa/s 5 1 6402 2276
fir_decimate_sym cint16 cint16 9 1 3 384 1 0 0 147 2612 MSa/s 389 987 MSa/s 5 1 6094 2408
fir_decimate_sym cint16 cint16 99 1 3 384 1 0 0 1135 338 MSa/s 1225 313 MSa/s 5 1 7734 3080
fir_decimate_sym cint16 int16 100 1 2 256 1 0 0 1383 185 MSa/s 1472 173 MSa/s 5 1 6226 2794
fir_decimate_sym cint16 int16 12 1 3 384 1 0 0 118 3254 MSa/s 389 987 MSa/s 5 1 6049 2248
fir_decimate_sym cint16 int16 12 1 2 256 1 0 0 102 2509 MSa/s 260 984 MSa/s 5 1 5025 2076
fir_decimate_sym cint16 int16 12 1 3 384 1 0 0 118 3254 MSa/s 389 987 MSa/s 5 1 6049 2248
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 389 987 MSa/s 5 1 6043 2176
fir_decimate_sym cint16 int16 15 1 3 384 1 0 0 258 1488 MSa/s 390 984 MSa/s 5 1 6055 1894
fir_decimate_sym cint16 int16 15 1 3 384 1 0 1 284 1352 MSa/s 393 977 MSa/s 8 1 6091 2220
fir_decimate_sym cint16 int16 16 1 2 384 1 0 0 134 2865 MSa/s 396 969 MSa/s 5 1 6699 2086
fir_decimate_sym cint16 int16 18 1 3 384 1 0 0 259 1482 MSa/s 390 984 MSa/s 5 1 6158 1982
fir_decimate_sym cint16 int16 18 1 3 384 1 0 1 286 1342 MSa/s 393 977 MSa/s 8 1 6170 2324
fir_decimate_sym cint16 int16 12 1 3 384 1 0 1 118 3254 MSa/s 390 984 MSa/s 8 1 6077 2620
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 389 987 MSa/s 5 1 6043 2176
fir_decimate_sym cint16 int16 237 1 3 384 1 0 0 1323 290 MSa/s 1427 269 MSa/s 5 1 8996 3580
fir_decimate_sym cint16 int16 9 1 3 384 1 0 0 117 3282 MSa/s 389 987 MSa/s 5 1 6043 2160
fir_decimate_sym cint16 int16 8 1 2 256 1 0 0 69 3710 MSa/s 260 984 MSa/s 5 1 4952 1908
fir_decimate_sym cint16 int16 60 1 3 384 1 0 0 484 793 MSa/s 567 677 MSa/s 5 1 6658 2708
fir_decimate_sym cint16 int16 60 1 2 384 1 0 0 1377 278 MSa/s 1460 263 MSa/s 5 1 7170 2696
fir_decimate_sym cint16 int16 60 1 2 384 1 1 0 662 580 MSa/s 792 484 MSa/s 7 1 10754 3004
fir_decimate_sym cint16 int16 27 1 3 384 1 0 0 358 1072 MSa/s 438 876 MSa/s 5 1 6240 2182
fir_decimate_sym cint16 int16 26 1 2 256 1 0 0 356 719 MSa/s 434 589 MSa/s 5 1 5214 2158
fir_decimate_sym cint16 int16 24 1 2 256 1 0 0 259 988 MSa/s 336 761 MSa/s 5 1 5146 2004
fir_decimate_sym cint16 int16 240 1 2 256 1 0 0 1296 197 MSa/s 1401 182 MSa/s 5 1 7978 3314
fir_decimate_sym cint16 int16 237 1 3 768 1 0 0 2603 295 MSa/s 2707 283 MSa/s 5 1 13092 3580
fir_decimate_sym cint16 int16 27 1 3 384 1 0 1 394 974 MSa/s 500 768 MSa/s 8 1 6268 2476
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1962
fir_interpolate_asym cint32 cint16 8 2 1 256 1 0 0 554 462 MSa/s 1028 249 MSa/s 5 1 14300 2428
fir_interpolate_asym cint32 cint16 64 2 1 256 1 0 0 4570 56 MSa/s 4656 54 MSa/s 5 1 16092 4314
fir_interpolate_asym cint32 cint16 32 2 1 256 1 0 0 2362 108 MSa/s 2443 104 MSa/s 5 1 15068 2972
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1962
fir_interpolate_asym cint32 cint32 32 2 1 256 1 0 0 5041 50 MSa/s 5122 49 MSa/s 5 1 15836 3580
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1946
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1946
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1930
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1930
fir_interpolate_asym cint16 int16 8 2 1 64 1 0 0 71 901 MSa/s 144 444 MSa/s 5 1 3484 1930
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1962
fir_interpolate_asym cint32 cint32 64 2 1 256 1 0 0 9426 27 MSa/s 9510 26 MSa/s 5 1 17628 4104
fir_interpolate_asym cint16 int16 8 2 1 512 1 0 0 295 1735 MSa/s 1023 500 MSa/s 5 1 14236 1946
fir_interpolate_asym cint32 int16 32 2 1 256 1 0 0 1644 155 MSa/s 1726 148 MSa/s 5 1 14684 2178
fir_interpolate_asym cint32 int16 64 2 1 256 1 0 0 2681 95 MSa/s 2765 92 MSa/s 5 1 15324 3112
fir_interpolate_asym cint32 int16 8 2 1 256 1 0 0 296 864 MSa/s 1023 250 MSa/s 5 1 14204 1850
fir_interpolate_asym cint32 int32 32 2 1 256 1 0 0 2362 108 MSa/s 2443 104 MSa/s 5 1 15068 2972
fir_interpolate_asym cint32 int32 64 2 1 256 1 0 0 4570 56 MSa/s 4656 54 MSa/s 5 1 16092 4314
fir_interpolate_asym cint32 int32 8 2 1 256 1 0 0 554 462 MSa/s 1028 249 MSa/s 5 1 14300 2428
fir_interpolate_asym int32 int16 32 2 1 256 1 0 0 748 342 MSa/s 826 309 MSa/s 5 1 8540 2130
fir_interpolate_asym int32 int16 64 2 1 256 1 0 0 1209 211 MSa/s 1287 198 MSa/s 5 1 9180 3118
fir_interpolate_asym int32 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1930
fir_interpolate_asym int32 int32 32 2 1 256 1 0 0 1144 223 MSa/s 1221 209 MSa/s 5 1 9052 2884
fir_interpolate_asym int32 int32 64 2 1 256 1 0 0 2202 116 MSa/s 2280 112 MSa/s 5 1 10204 4210
fir_interpolate_asym int32 int32 8 2 1 256 1 0 0 297 861 MSa/s 519 493 MSa/s 5 1 8220 2536
fir_interpolate_asym cint32 cint32 8 2 1 256 1 0 0 1724 148 MSa/s 1801 142 MSa/s 5 1 14492 2844
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 1 167 1532 MSa/s 520 492 MSa/s 8 1 8128 2950
fir_interpolate_asym cint16 int16 64 2 1 256 1 0 0 1209 211 MSa/s 1287 198 MSa/s 5 1 9180 3118
fir_interpolate_asym cint16 int16 8 2 1 128 1 0 0 103 1242 MSa/s 263 486 MSa/s 5 1 5020 1930
fir_interpolate_asym cint16 cint16 32 2 1 256 1 0 0 1144 223 MSa/s 1221 209 MSa/s 5 1 9052 2884
fir_interpolate_asym cint16 cint16 8 2 1 256 1 0 0 297 861 MSa/s 519 493 MSa/s 5 1 8220 2536
fir_interpolate_asym cint16 int16 128 2 1 256 1 0 0 2331 109 MSa/s 2415 106 MSa/s 5 1 10460 4492
fir_interpolate_asym cint16 int16 128 2 1 256 1 0 1 2541 100 MSa/s 2651 96 MSa/s 8 1 10720 5330
fir_interpolate_asym cint16 int16 128 2 1 256 2 0 0 1298 1267 197 202 MSa/s 1386 1385 184 184 MSa/s 7 2 16680 3176 3114
fir_interpolate_asym cint16 int16 128 2 1 256 3 0 0 992 1008 1034 258 253 247 MSa/s 1174 1159 1161 218 220 220 MSa/s 9 3 24132 2858 3288 3398
fir_interpolate_asym cint16 int16 128 2 1 256 4 0 0 782 741 749 765 327 345 341 334 MSa/s 912 908 909 910 280 281 281 281 MSa/s 11 4 32528 2192 2036 2036 2122
fir_interpolate_asym cint16 int16 128 2 1 256 5 0 0 753 764 740 742 749 339 335 345 345 341 MSa/s 894 917 891 892 893 286 279 287 286 286 MSa/s 13 5 42476 3160 3008 3008 3060 2034
fir_interpolate_asym cint16 int16 16 2 1 256 1 0 0 296 864 MSa/s 519 493 MSa/s 5 1 8220 2412
fir_interpolate_asym cint16 int16 16 2 1 256 1 0 1 296 864 MSa/s 530 483 MSa/s 8 1 8256 4550
fir_interpolate_asym cint16 int16 240 2 1 256 1 0 0 4524 56 MSa/s 4619 55 MSa/s 5 1 12700 4750
fir_interpolate_asym cint16 int16 240 2 1 256 1 0 1 4925 51 MSa/s 5043 50 MSa/s 8 1 13184 5624
fir_interpolate_asym cint16 cint16 64 2 1 256 1 0 0 2202 116 MSa/s 2280 112 MSa/s 5 1 10204 4210
fir_interpolate_asym cint16 int16 24 3 1 256 1 0 0 592 432 MSa/s 774 330 MSa/s 5 1 10680 3546
fir_interpolate_asym cint16 int16 8 2 1 1024 1 0 0 551 1858 MSa/s 2037 502 MSa/s 5 1 26524 1946
fir_interpolate_asym cint16 int16 64 2 1 256 1 0 1 1311 195 MSa/s 1418 180 MSa/s 8 1 9312 3714
fir_interpolate_asym cint16 int16 8 2 1 256 1 0 0 167 1532 MSa/s 516 496 MSa/s 5 1 8092 1930
fir_interpolate_asym cint16 int16 24 2 1 256 1 0 0 620 412 MSa/s 697 367 MSa/s 5 1 8444 1988
fir_interpolate_asym cint16 int16 32 2 1 256 1 0 0 748 342 MSa/s 826 309 MSa/s 5 1 8540 2130
fir_interpolate_asym cint16 int16 32 2 1 256 1 0 1 806 317 MSa/s 910 281 MSa/s 8 1 8608 3000
fir_interpolate_asym cint16 int16 30 3 1 256 1 0 0 855 299 MSa/s 933 274 MSa/s 5 1 10876 3826
fir_interpolate_asym cint16 int16 30 2 1 256 1 0 0 812 315 MSa/s 888 288 MSa/s 5 1 8532 3244
fir_interpolate_asym cint16 int16 24 8 1 256 1 0 0 1408 181 MSa/s 2045 125 MSa/s 5 1 20900 3212
fir_interpolate_asym cint16 int16 24 6 1 256 1 0 0 1098 233 MSa/s 1537 166 MSa/s 5 1 16812 3898
fir_interpolate_asym cint16 int16 24 4 1 256 1 0 0 742 345 MSa/s 1030 248 MSa/s 5 1 12500 2010
fir_interpolate_asym cint16 int16 30 5 1 256 1 0 0 918 278 MSa/s 1282 199 MSa/s 5 1 15052 3826
fir_interpolate_fract_asym cint16 int16 60 5 2 256 1 0 0 640 400 MSa/s 722 354 MSa/s 5 1 10196 3674
fir_interpolate_fract_asym cint16 int16 60 5 2 240 1 0 0 603 398 MSa/s 685 350 MSa/s 5 1 9748 3674
fir_interpolate_fract_asym cint16 int16 48 4 3 144 1 0 0 266 541 MSa/s 351 410 MSa/s 5 1 5364 2016
fir_interpolate_fract_asym cint16 int16 41 5 4 256 1 0 0 339 755 MSa/s 422 606 MSa/s 5 1 7350 3842
fir_interpolate_fract_asym cint16 int16 36 9 5 360 1 0 0 441 816 MSa/s 642 560 MSa/s 5 1 11492 5170
fir_interpolate_fract_asym cint16 int16 40 10 3 120 1 0 0 264 454 MSa/s 403 297 MSa/s 5 1 7388 4624
fir_interpolate_fract_asym cint16 int16 36 9 8 576 1 0 0 479 1202 MSa/s 647 890 MSa/s 5 1 13220 4596
fir_interpolate_fract_asym cint16 int16 36 9 7 504 1 0 0 475 1061 MSa/s 647 778 MSa/s 5 1 12644 5216
fir_interpolate_fract_asym cint16 int16 60 5 3 240 1 0 0 480 500 MSa/s 562 427 MSa/s 5 1 8148 3754
fir_interpolate_fract_asym cint16 int16 40 10 7 280 1 0 0 294 952 MSa/s 407 687 MSa/s 5 1 8668 4702
fir_interpolate_fract_asym cint16 int16 60 5 4 480 1 0 0 704 681 MSa/s 786 610 MSa/s 5 1 11668 3720
fir_interpolate_fract_asym cint16 int16 84 7 6 336 1 0 0 482 697 MSa/s 565 594 MSa/s 5 1 9604 4266
fir_interpolate_fract_asym cint16 int16 75 5 4 160 3 0 1 501 478 429 319 334 372 MSa/s 538 546 530 297 293 301 MSa/s 12 3 15746 4280 3868 3768
fir_interpolate_fract_asym cint16 int16 84 7 2 336 1 0 0 1142 294 MSa/s 1225 274 MSa/s 5 1 15876 4262
fir_interpolate_fract_asym cint16 int16 84 7 3 168 1 0 0 452 371 MSa/s 534 314 MSa/s 5 1 8260 4138
fir_interpolate_fract_asym cint16 int16 84 7 4 672 1 0 0 1350 497 MSa/s 1432 469 MSa/s 5 1 18564 4194
fir_interpolate_fract_asym cint16 int16 84 7 5 840 1 0 0 1373 611 MSa/s 1456 576 MSa/s 5 1 19908 4234
fir_interpolate_fract_asym cint16 int16 95 8 5 320 1 0 0 576 555 MSa/s 658 486 MSa/s 5 1 10708 4010
fir_interpolate_fract_asym cint16 int16 96 8 3 192 1 0 0 487 394 MSa/s 569 337 MSa/s 5 1 9700 3374
fir_interpolate_fract_asym cint16 int16 96 8 5 480 1 0 0 840 571 MSa/s 922 520 MSa/s 5 1 14052 3470
fir_interpolate_fract_asym cint16 int16 96 8 7 672 1 0 0 906 741 MSa/s 987 680 MSa/s 5 1 15588 3416
fir_interpolate_fract_asym cint32 cint32 60 5 2 256 1 0 0 5361 47 MSa/s 5445 47 MSa/s 5 1 21660 4986
fir_interpolate_fract_asym cint16 int16 36 9 4 288 1 0 0 444 648 MSa/s 643 447 MSa/s 5 1 10916 4484
fir_interpolate_fract_asym cint16 int16 72 6 5 360 1 0 0 517 696 MSa/s 599 601 MSa/s 5 1 9644 3248
fir_interpolate_fract_asym cint16 int16 36 9 2 144 1 0 0 395 364 MSa/s 637 226 MSa/s 5 1 9764 4418
fir_interpolate_fract_asym cint16 int16 40 10 9 360 1 0 0 305 1180 MSa/s 408 882 MSa/s 5 1 9308 4822
fir_interpolate_fract_asym cint16 int16 32 8 7 224 1 0 0 195 1148 MSa/s 282 794 MSa/s 5 1 6436 1812
fir_interpolate_fract_asym cint16 int16 36 3 2 144 1 0 0 292 493 MSa/s 374 385 MSa/s 5 1 5412 4700
fir_interpolate_fract_asym cint16 int16 108 9 2 432 1 0 0 1775 243 MSa/s 1944 222 MSa/s 5 1 25204 5930
fir_interpolate_fract_asym cint16 int16 108 9 4 864 1 0 0 1961 440 MSa/s 2044 422 MSa/s 5 1 28660 6074
fir_interpolate_fract_asym cint16 int16 108 9 5 1080 1 0 0 2094 515 MSa/s 2177 496 MSa/s 5 1 30388 5874
fir_interpolate_fract_asym cint16 int16 108 9 8 1728 1 0 0 2256 765 MSa/s 2338 739 MSa/s 5 1 35572 6022
fir_interpolate_fract_asym cint16 int16 120 10 3 240 1 0 0 781 307 MSa/s 863 278 MSa/s 5 1 14172 4778
fir_interpolate_fract_asym cint16 int16 120 10 7 840 1 0 0 1449 579 MSa/s 1531 548 MSa/s 5 1 22172 4802
fir_interpolate_fract_asym cint16 int16 120 10 9 720 1 0 0 992 725 MSa/s 1075 669 MSa/s 5 1 18012 4754
fir_interpolate_fract_asym cint16 int16 12 3 2 48 1 0 0 80 600 MSa/s 158 303 MSa/s 5 1 3028 1768
fir_interpolate_fract_asym cint16 int16 16 4 3 48 1 0 0 75 640 MSa/s 153 313 MSa/s 5 1 2996 1724
fir_interpolate_fract_asym cint16 int16 20 5 2 80 1 0 0 144 555 MSa/s 224 357 MSa/s 5 1 4548 2022
fir_interpolate_fract_asym cint16 int16 20 5 3 120 1 0 0 161 745 MSa/s 243 493 MSa/s 5 1 4868 2082
fir_interpolate_fract_asym cint16 int16 108 9 7 1512 1 0 0 2311 654 MSa/s 2394 631 MSa/s 5 1 33844 6008
fir_interpolate_fract_asym cint16 int16 20 5 4 256 1 0 0 237 1080 MSa/s 341 750 MSa/s 5 1 6878 2046
fir_interpolate_fract_asym cint16 int16 20 5 4 160 1 0 0 162 987 MSa/s 247 647 MSa/s 5 1 5188 2046
fir_interpolate_fract_asym cint16 int16 32 8 3 96 1 0 0 171 561 MSa/s 266 360 MSa/s 5 1 5412 1762
fir_interpolate_fract_asym cint16 int16 28 7 5 280 1 0 0 287 975 MSa/s 400 700 MSa/s 5 1 7924 3548
fir_interpolate_fract_asym cint16 int16 28 7 4 224 1 0 0 279 802 MSa/s 399 561 MSa/s 5 1 7476 3490
fir_interpolate_fract_asym cint16 int16 28 7 6 336 1 0 0 287 1170 MSa/s 400 840 MSa/s 5 1 8372 3544
fir_interpolate_fract_asym cint16 int16 28 7 3 168 1 0 0 263 638 MSa/s 397 423 MSa/s 5 1 7028 3478
fir_interpolate_fract_asym cint16 int16 28 7 2 112 1 0 0 259 432 MSa/s 396 282 MSa/s 5 1 6580 3416
fir_interpolate_fract_asym cint16 int16 24 6 5 120 1 0 0 125 960 MSa/s 203 591 MSa/s 5 1 4460 2866
fir_interpolate_fract_asym cint16 int16 24 3 2 256 1 0 0 428 598 MSa/s 533 480 MSa/s 7 1 10460 2362
fir_interpolate_fract_asym cint16 int16 32 8 5 160 1 0 0 187 855 MSa/s 272 588 MSa/s 5 1 5924 1802
fir_interpolate_hb cint32 cint32 99 2 1 256 1 0 0 6315 40 MSa/s 6406 39 MSa/s 5 1 16381 3224
fir_interpolate_hb cint32 cint16 7 2 1 256 1 0 0 391 654 MSa/s 1025 249 MSa/s 5 1 14209 2060
fir_interpolate_hb cint32 cint32 7 2 1 256 1 0 0 423 605 MSa/s 1025 249 MSa/s 5 1 14301 2334
fir_interpolate_hb cint32 cint32 239 2 1 256 1 0 0 8252 31 MSa/s 8357 30 MSa/s 5 1 19453 4140
fir_interpolate_hb cint32 cint32 15 2 1 256 1 0 0 679 377 MSa/s 1030 248 MSa/s 5 1 14525 2814
fir_interpolate_hb cint32 cint32 11 2 1 256 1 0 0 552 463 MSa/s 1028 249 MSa/s 5 1 14397 2570
fir_interpolate_hb cint32 cint16 99 2 1 256 1 0 0 2025 126 MSa/s 2116 120 MSa/s 5 1 15665 3150
fir_interpolate_hb cint32 int16 11 2 1 256 1 0 0 1143 223 MSa/s 1224 209 MSa/s 5 1 14267 2630
fir_interpolate_hb cint16 int16 99 2 1 256 1 0 0 1508 169 MSa/s 1591 160 MSa/s 5 1 8747 2326
fir_interpolate_hb cint32 cint16 15 2 1 256 1 0 0 455 562 MSa/s 1027 249 MSa/s 5 1 14337 2296
fir_interpolate_hb cint32 cint16 11 2 1 256 1 0 0 392 653 MSa/s 1025 249 MSa/s 5 1 14289 2202
fir_interpolate_hb cint16 int16 99 2 1 256 1 0 1 1508 169 MSa/s 1654 154 MSa/s 8 1 8791 2590
fir_interpolate_hb cint16 int16 99 2 1 128 5 0 0 448 448 361 349 332 285 285 354 366 385 MSa/s 466 467 468 463 464 274 274 273 276 275 MSa/s 13 5 22359 1880 1842 1806 1806 1834
fir_interpolate_hb int32 int32 99 2 1 256 1 0 0 1522 168 MSa/s 1605 159 MSa/s 5 1 9137 2738
fir_interpolate_hb cint16 int16 99 2 1 128 3 0 0 459 405 419 278 316 305 MSa/s 543 540 541 235 237 236 MSa/s 9 3 12993 1880 1842 1904
fir_interpolate_hb cint32 int16 15 2 1 256 1 0 0 1127 227 MSa/s 1208 211 MSa/s 5 1 14275 2706
fir_interpolate_hb cint16 int16 99 2 1 128 2 0 0 515 481 248 266 MSa/s 583 582 219 219 MSa/s 7 2 9078 1948 1944
fir_interpolate_hb cint32 cint16 239 2 1 256 1 0 0 6711 38 MSa/s 6816 37 MSa/s 5 1 17697 4098
fir_interpolate_hb cint32 int16 239 2 1 256 1 0 0 2416 105 MSa/s 2521 101 MSa/s 5 1 16835 3324
fir_interpolate_hb int32 int16 19 2 1 256 1 0 0 232 1103 MSa/s 518 494 MSa/s 5 1 8139 2116
fir_interpolate_hb cint32 int16 99 2 1 256 1 0 0 2788 91 MSa/s 2880 88 MSa/s 5 1 15275 2578
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2104
fir_interpolate_hb int32 int32 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8065 2068
fir_interpolate_hb int32 int32 27 2 1 256 1 0 0 359 713 MSa/s 520 492 MSa/s 5 1 8241 2682
fir_interpolate_hb int32 int32 239 2 1 256 1 0 0 2416 105 MSa/s 2511 101 MSa/s 5 1 10593 3222
fir_interpolate_hb int32 int32 23 2 1 256 1 0 0 296 864 MSa/s 519 493 MSa/s 5 1 8225 2574
fir_interpolate_hb int32 int16 99 2 1 256 1 0 0 774 330 MSa/s 857 298 MSa/s 5 1 8747 2512
fir_interpolate_hb int32 int16 7 2 1 256 1 0 0 230 1113 MSa/s 518 494 MSa/s 5 1 8051 1882
fir_interpolate_hb int32 int16 239 2 1 256 1 0 0 1167 219 MSa/s 1262 202 MSa/s 5 1 9731 3218
fir_interpolate_hb cint32 int16 7 2 1 256 1 0 0 422 606 MSa/s 1026 249 MSa/s 5 1 14195 1914
fir_interpolate_hb int32 int16 15 2 1 256 1 0 0 231 1108 MSa/s 518 494 MSa/s 5 1 8067 2010
fir_interpolate_hb float float 7 2 1 256 1 0 0 859 298 MSa/s 935 273 MSa/s 5 1 8321 2258
fir_interpolate_hb float float 19 2 1 256 1 0 0 1675 152 MSa/s 1753 146 MSa/s 5 1 8465 2794
fir_interpolate_hb float float 15 2 1 256 1 0 0 1219 210 MSa/s 1295 197 MSa/s 5 1 8385 2550
fir_interpolate_hb cint32 int32 99 2 1 256 1 0 0 2025 126 MSa/s 2116 120 MSa/s 5 1 15665 3150
fir_interpolate_hb cint32 int32 7 2 1 256 1 0 0 391 654 MSa/s 1025 249 MSa/s 5 1 14209 1966
fir_interpolate_hb cint32 int32 239 2 1 256 1 0 0 6711 38 MSa/s 6816 37 MSa/s 5 1 17697 4098
fir_interpolate_hb cint32 int32 15 2 1 256 1 0 0 455 562 MSa/s 1027 249 MSa/s 5 1 14337 2296
fir_interpolate_hb cint32 int32 11 2 1 256 1 0 0 392 653 MSa/s 1025 249 MSa/s 5 1 14289 2076
fir_interpolate_hb float float 99 2 1 256 1 0 0 10400 24 MSa/s 10483 24 MSa/s 5 1 9393 3884
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2104
fir_interpolate_hb cint16 int16 99 2 1 128 4 0 0 367 349 336 349 348 366 380 366 MSa/s 467 463 464 466 274 276 275 274 MSa/s 11 4 17420 1880 1842 1842 1894
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2088
fir_interpolate_hb cint16 int16 15 2 1 256 1 0 1 176 1454 MSa/s 518 494 MSa/s 8 1 8095 2480
fir_interpolate_hb cint16 int16 15 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8067 2000
fir_interpolate_hb cint16 int16 15 2 1 256 1 0 0 176 1454 MSa/s 501 510 MSa/s 5 1 8067 2016
fir_interpolate_hb cint16 int16 11 2 1 256 1 0 1 176 1454 MSa/s 518 494 MSa/s 8 1 8079 2552
fir_interpolate_hb cint16 int16 11 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8059 2104
fir_interpolate_hb cint16 int16 11 2 1 256 1 0 0 176 1454 MSa/s 501 510 MSa/s 5 1 8059 2104
fir_interpolate_hb cint16 cint16 99 2 1 256 1 0 0 1516 168 MSa/s 1599 160 MSa/s 5 1 9137 2746
fir_interpolate_hb cint16 cint16 27 2 1 256 1 0 0 359 713 MSa/s 520 492 MSa/s 5 1 8241 2702
fir_interpolate_hb cint16 int16 19 2 1 256 1 0 1 231 1108 MSa/s 519 493 MSa/s 8 1 8167 2706
fir_interpolate_hb cint16 cint16 239 2 1 256 1 0 0 2413 106 MSa/s 2508 102 MSa/s 5 1 10593 3218
fir_interpolate_hb cfloat float 99 2 1 256 1 0 0 21536 11 MSa/s 21627 11 MSa/s 5 1 15921 4044
fir_interpolate_hb cfloat float 7 2 1 256 1 0 0 1691 151 MSa/s 1769 144 MSa/s 5 1 14465 2290
fir_interpolate_hb cfloat float 15 2 1 256 1 0 0 2460 104 MSa/s 2540 100 MSa/s 5 1 14593 2582
fir_interpolate_hb cfloat cfloat 7 2 1 256 1 0 0 2735 93 MSa/s 2813 91 MSa/s 5 1 14557 2832
fir_interpolate_hb cfloat cfloat 239 2 1 256 1 0 0 45363 5 MSa/s 45468 5 MSa/s 5 1 19709 8226
fir_interpolate_hb cfloat cfloat 15 2 1 256 1 0 0 3326 76 MSa/s 3406 75 MSa/s 5 1 14781 3092
fir_interpolate_hb cfloat cfloat 11 2 1 256 1 0 0 2767 92 MSa/s 2847 89 MSa/s 5 1 14653 2786
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2104
fir_interpolate_hb cint16 cint16 23 2 1 256 1 0 0 296 864 MSa/s 519 493 MSa/s 5 1 8225 2582
fir_interpolate_hb cint16 int16 23 2 1 256 1 0 0 231 1108 MSa/s 508 503 MSa/s 5 1 8147 2262
fir_interpolate_hb cint16 cint16 7 2 1 256 1 0 0 176 1454 MSa/s 516 496 MSa/s 5 1 8065 2182
fir_interpolate_hb cint16 int16 239 2 1 256 1 0 0 2853 89 MSa/s 2948 86 MSa/s 5 1 9731 2890
fir_interpolate_hb cint16 int16 23 2 1 256 1 0 0 167 1532 MSa/s 500 512 MSa/s 5 1 8151 1988
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2104
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2088
fir_interpolate_hb cint16 int16 7 2 1 64 1 0 0 74 864 MSa/s 148 432 MSa/s 5 1 3443 2088
fir_interpolate_hb cint16 int16 7 2 1 512 1 0 0 312 1641 MSa/s 1024 500 MSa/s 5 1 14195 2088
fir_interpolate_hb cint16 int16 7 2 1 32 1 0 0 93 344 MSa/s 168 190 MSa/s 5 1 2675 2230
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2088
fir_interpolate_hb cint16 int16 7 2 1 128 1 0 0 108 1185 MSa/s 263 486 MSa/s 5 1 4979 2088
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2088
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 1 176 1454 MSa/s 518 494 MSa/s 8 1 8071 2504
fir_interpolate_hb cint16 int16 7 2 1 256 1 0 0 176 1454 MSa/s 517 495 MSa/s 5 1 8051 2104
fir_interpolate_hb cint16 int16 47 2 1 256 1 1 0 232 1103 MSa/s 514 498 MSa/s 7 1 10538 2338
fir_interpolate_hb cint16 int16 27 2 1 256 1 0 0 231 1108 MSa/s 508 503 MSa/s 5 1 8155 2310
fir_interpolate_hb cint16 int16 27 2 1 256 1 0 0 231 1108 MSa/s 518 494 MSa/s 5 1 8155 2310
fir_interpolate_hb cint16 int16 27 2 1 256 1 0 1 231 1108 MSa/s 520 492 MSa/s 8 1 8191 2918
fir_interpolate_hb cint16 int16 239 2 1 256 1 0 1 2853 89 MSa/s 3031 84 MSa/s 8 1 9855 3090
fir_interpolate_hb cint16 int16 43 2 1 256 1 1 0 234 1094 MSa/s 514 498 MSa/s 7 1 10530 2334
fir_interpolate_hb cint16 int16 47 2 1 256 1 0 0 928 275 MSa/s 1021 250 MSa/s 5 1 8291 1872
fir_interpolate_hb cint16 int16 43 2 1 256 1 0 0 547 468 MSa/s 640 400 MSa/s 5 1 8283 1872
fir_sr_asym cint32 int16 16 1 1 256 1 0 0 548 467 MSa/s 630 406 MSa/s 5 1 10314 2180
fir_sr_asym cint16 int16 8 1 1 256 1 0 0 165 1551 MSa/s 265 966 MSa/s 5 1 5978 1894
fir_sr_asym cint16 int16 8 1 1 256 1 0 0 165 1551 MSa/s 265 966 MSa/s 5 1 5978 1894
fir_sr_asym cint32 cint16 16 1 1 256 1 0 0 1060 241 MSa/s 1143 223 MSa/s 5 1 10442 2738
fir_sr_asym cint32 cint16 32 1 1 256 1 0 0 2203 116 MSa/s 2287 111 MSa/s 5 1 10954 3464
fir_sr_asym cint32 cint32 16 1 1 256 1 0 0 2271 112 MSa/s 2354 108 MSa/s 5 1 10700 3758
fir_sr_asym cint32 cint32 32 1 1 256 1 0 0 4389 58 MSa/s 4473 57 MSa/s 5 1 11468 5494
fir_sr_asym cint32 int16 32 1 1 256 1 0 0 1060 241 MSa/s 1145 223 MSa/s 5 1 10698 3040
fir_sr_asym int32 int32 32 1 1 256 1 0 0 1181 216 MSa/s 1260 203 MSa/s 5 1 6602 3832
fir_sr_asym cint32 int32 32 1 1 256 1 0 0 2203 116 MSa/s 2287 111 MSa/s 5 1 10954 3464
fir_sr_asym float float 16 1 1 256 1 0 0 1612 158 MSa/s 1690 151 MSa/s 5 1 6473 3020
fir_sr_asym float float 32 1 1 256 1 0 0 2958 86 MSa/s 3037 84 MSa/s 5 1 6857 3896
fir_sr_asym int16 int16 16 1 1 256 1 0 0 163 1570 MSa/s 238 1075 MSa/s 5 1 3986 2376
fir_sr_asym int16 int16 32 1 1 256 1 0 0 294 870 MSa/s 370 691 MSa/s 5 1 4178 3292
fir_sr_asym int32 int16 16 1 1 256 1 0 0 292 876 MSa/s 370 691 MSa/s 5 1 6090 2232
fir_sr_asym int32 int16 32 1 1 256 1 0 0 548 467 MSa/s 626 408 MSa/s 5 1 6346 2990
fir_sr_asym int32 int32 16 1 1 256 1 0 0 550 465 MSa/s 628 407 MSa/s 5 1 6218 3096
fir_sr_asym cint16 int16 8 1 1 256 1 0 1 165 1551 MSa/s 285 898 MSa/s 8 1 6014 2514
fir_sr_asym cint32 int32 16 1 1 256 1 0 0 1060 241 MSa/s 1143 223 MSa/s 5 1 10442 2738
fir_sr_asym cint16 int16 8 1 1 256 1 0 0 165 1551 MSa/s 265 966 MSa/s 5 1 5978 1894
fir_sr_asym cint16 int16 8 1 1 64 1 0 0 69 927 MSa/s 143 447 MSa/s 5 1 2906 1894
fir_sr_asym cint16 int16 64 1 1 256 1 0 1 1129 226 MSa/s 1302 196 MSa/s 8 1 6990 3726
fir_sr_asym cint16 int16 8 1 1 128 1 0 0 101 1267 MSa/s 176 727 MSa/s 5 1 3930 1894
fir_sr_asym cfloat cfloat 16 1 1 256 1 0 0 4185 61 MSa/s 4267 59 MSa/s 5 1 10969 3808
fir_sr_asym cfloat cfloat 32 1 1 256 1 0 0 8326 30 MSa/s 8410 30 MSa/s 5 1 11737 5206
fir_sr_asym cfloat float 16 1 1 256 1 0 0 2337 109 MSa/s 2419 105 MSa/s 5 1 10713 2634
fir_sr_asym cfloat float 32 1 1 256 1 0 0 4492 56 MSa/s 4576 55 MSa/s 5 1 11225 3704
fir_sr_asym cint16 cint16 24 1 1 256 1 0 0 900 284 MSa/s 994 257 MSa/s 5 1 6410 2780
fir_sr_asym cint16 cint16 32 1 1 256 1 0 0 1181 216 MSa/s 1260 203 MSa/s 5 1 6602 3832
fir_sr_asym cint16 int16 128 1 1 256 1 0 0 2188 117 MSa/s 2284 112 MSa/s 5 1 7882 5144
fir_sr_asym cint16 int16 128 1 1 256 1 0 1 2317 110 MSa/s 2436 105 MSa/s 8 1 8142 5462
fir_sr_asym cint16 cint16 16 1 1 256 1 0 0 550 465 MSa/s 628 407 MSa/s 5 1 6218 3096
fir_sr_asym cint16 int16 128 1 1 256 3 0 0 831 868 895 308 294 286 MSa/s 1035 1036 1038 247 247 246 MSa/s 9 3 19838 3908 3626 3708
fir_sr_asym cint16 int16 128 1 1 256 4 0 0 747 658 683 712 342 389 374 359 MSa/s 890 887 888 889 287 288 288 287 MSa/s 11 4 27304 3144 2962 2962 3032
fir_sr_asym cint16 int16 128 1 1 256 5 0 0 597 620 643 672 578 428 412 398 380 442 MSa/s 813 814 815 817 812 314 314 314 313 315 MSa/s 13 5 36274 2952 2722 2722 2722 2828
fir_sr_asym cint16 int16 64 1 1 256 1 0 0 1129 226 MSa/s 1213 211 MSa/s 5 1 6858 3570
fir_sr_asym cint16 int16 16 1 1 256 1 0 0 292 876 MSa/s 370 691 MSa/s 5 1 6090 2232
fir_sr_asym cint16 int16 16 1 1 256 1 0 1 327 782 MSa/s 432 592 MSa/s 8 1 6126 2988
fir_sr_asym cint16 int16 32 1 1 256 1 0 0 548 467 MSa/s 626 408 MSa/s 5 1 6346 2990
fir_sr_asym cint16 int16 32 1 1 256 1 0 1 548 467 MSa/s 675 379 MSa/s 8 1 6414 3294
fir_sr_asym cint16 int16 128 1 1 256 2 0 0 1665 1539 153 166 MSa/s 1762 1760 145 145 MSa/s 7 2 13268 3674 3362
fir_sr_sym cint32 cint16 16 1 1 256 1 0 0 816 313 MSa/s 899 284 MSa/s 5 1 10342 1900
fir_sr_sym cint16 int16 89 1 1 256 1 0 0 1054 242 MSa/s 1150 222 MSa/s 5 1 7064 2546
fir_sr_sym cint16 int16 8 1 1 256 1 0 0 117 2188 MSa/s 264 969 MSa/s 5 1 5974 1872
fir_sr_sym cint16 int16 8 1 1 256 1 0 0 117 2188 MSa/s 264 969 MSa/s 5 1 5974 1872
fir_sr_sym cint16 int16 8 1 1 64 1 0 0 59 1084 MSa/s 133 481 MSa/s 5 1 2902 1776
fir_sr_sym cint16 int16 8 1 1 256 1 0 1 117 2188 MSa/s 266 962 MSa/s 8 1 5994 2332
fir_sr_sym cint32 cint16 32 1 1 256 1 0 0 1329 192 MSa/s 1417 180 MSa/s 5 1 10758 2088
fir_sr_sym cint16 int16 8 1 1 256 1 0 0 117 2188 MSa/s 264 969 MSa/s 5 1 5974 1872
fir_sr_sym cint32 cint32 16 1 1 256 1 0 0 1573 162 MSa/s 1654 154 MSa/s 5 1 10502 2052
fir_sr_sym int32 int16 16 1 1 256 1 0 0 166 1542 MSa/s 266 962 MSa/s 5 1 6054 2098
fir_sr_sym cint32 int16 16 1 1 256 1 0 0 687 372 MSa/s 769 332 MSa/s 5 1 10278 1756
fir_sr_sym cint32 int16 32 1 1 256 1 0 0 1071 239 MSa/s 1159 220 MSa/s 5 1 10598 1864
fir_sr_sym cint32 int32 16 1 1 256 1 0 0 816 313 MSa/s 899 284 MSa/s 5 1 10342 1900
fir_sr_sym cint32 int32 32 1 1 256 1 0 0 1329 192 MSa/s 1417 180 MSa/s 5 1 10758 2088
fir_sr_sym float float 16 1 1 256 1 0 0 1610 159 MSa/s 1688 151 MSa/s 5 1 6373 2582
fir_sr_sym int16 int16 16 1 1 256 1 0 0 101 2534 MSa/s 176 1454 MSa/s 5 1 3942 2090
fir_sr_sym int16 int16 32 1 1 256 1 0 0 164 1560 MSa/s 241 1062 MSa/s 5 1 4070 2532
fir_sr_sym int16 int16 96 1 1 512 1 0 0 942 543 MSa/s 1029 497 MSa/s 5 1 6694 2800
fir_sr_sym cint16 int16 8 1 1 128 1 0 0 77 1662 MSa/s 152 842 MSa/s 5 1 3926 1872
fir_sr_sym int32 int16 32 1 1 256 1 0 0 455 562 MSa/s 535 478 MSa/s 5 1 6246 1908
fir_sr_sym cint32 cint32 32 1 1 256 1 0 0 2599 98 MSa/s 2686 95 MSa/s 5 1 11078 2710
fir_sr_sym cint16 int16 64 1 1 256 1 0 1 921 277 MSa/s 1054 242 MSa/s 8 1 6730 2348
fir_sr_sym cint16 int16 16 1 1 256 1 0 0 164 1560 MSa/s 276 927 MSa/s 5 1 6054 1950
fir_sr_sym cint16 int16 63 1 1 256 1 0 0 858 298 MSa/s 947 270 MSa/s 5 1 6660 2068
fir_sr_sym int32 int32 16 1 1 256 1 0 0 292 876 MSa/s 370 691 MSa/s 5 1 6118 2476
fir_sr_sym cint16 cint16 16 1 1 256 1 0 0 292 876 MSa/s 370 691 MSa/s 5 1 6118 2476
fir_sr_sym cint16 cint16 24 1 1 256 1 0 0 421 608 MSa/s 515 497 MSa/s 5 1 6310 2964
fir_sr_sym cint16 cint16 24 1 1 256 2 0 0 288 295 888 867 MSa/s 424 416 603 615 MSa/s 7 2 9900 2562 2296
fir_sr_sym cint16 cint16 30 1 1 512 3 0 0 851 868 891 601 589 574 MSa/s 1103 1110 1118 464 461 457 MSa/s 9 3 20282 1838 1698 1772
fir_sr_sym cint16 cint16 32 1 1 256 1 0 0 653 392 MSa/s 733 349 MSa/s 5 1 6406 2072
fir_sr_sym cint16 int16 128 1 1 256 1 0 0 1436 178 MSa/s 1532 167 MSa/s 5 1 7494 2656
fir_sr_sym cint16 int16 128 1 1 256 1 0 1 1436 178 MSa/s 1611 158 MSa/s 8 1 7626 2886
fir_sr_sym cint16 int16 128 1 1 256 2 0 0 928 917 275 279 MSa/s 1046 1045 244 244 MSa/s 7 2 12236 2416 2238
fir_sr_sym cint16 int16 64 1 1 256 1 0 0 921 277 MSa/s 1005 254 MSa/s 5 1 6662 2092
fir_sr_sym cint16 int16 128 1 1 256 3 0 0 788 787 797 324 325 321 MSa/s 964 965 966 265 265 265 MSa/s 9 3 17810 1970 1758 1844
fir_sr_sym cint16 int16 128 1 1 256 5 0 0 669 659 656 658 660 382 388 390 389 387 MSa/s 840 835 836 837 838 304 306 306 305 305 MSa/s 13 5 31454 1938 1648 1648 1648 1750
fir_sr_sym cint16 int16 129 1 1 256 1 0 0 1442 177 MSa/s 1544 165 MSa/s 5 1 7656 2774
fir_sr_sym cint16 int16 16 1 1 256 1 0 0 166 1542 MSa/s 266 962 MSa/s 5 1 6054 1950
fir_sr_sym cint16 int16 16 1 1 256 1 0 1 166 1542 MSa/s 295 867 MSa/s 8 1 6090 2586
fir_sr_sym cint16 int16 199 1 1 256 1 0 0 2018 126 MSa/s 2132 120 MSa/s 5 1 8500 3138
fir_sr_sym cint16 int16 240 1 1 256 1 0 0 2337 109 MSa/s 2442 104 MSa/s 5 1 8998 3120
fir_sr_sym cint16 int16 240 1 1 256 1 0 1 2469 103 MSa/s 2602 98 MSa/s 8 1 9226 3456
fir_sr_sym cint16 int16 32 1 1 256 1 0 0 663 386 MSa/s 742 345 MSa/s 5 1 6246 1804
fir_sr_sym cint16 int16 32 1 1 256 1 0 1 699 366 MSa/s 804 318 MSa/s 8 1 6282 2536
fir_sr_sym cint16 int16 128 1 1 256 4 0 0 682 659 658 661 375 388 389 387 MSa/s 838 835 836 837 305 306 306 305 MSa/s 11 4 24216 1938 1702 1702 1788
fir_sr_sym int32 int32 32 1 1 256 1 0 0 653 392 MSa/s 733 349 MSa/s 5 1 6406 2072

FFT

Following table gives results for the FFT/IFFT function with a wide variety of supported parameters, which are defined in: L2 FFT configuration parameters.

fft_benchmark.csv

FFT benchmark
Library Element DATA_TYPE TWIDDLE_TYPE POINT_SIZE FFT_NIFFT CASC_LEN DYN_PT_SIZE WINDOW_VSIZE cycleCountAvg throughputAvg initiationInterval throughpuInitIntAvg NUM_BANKS NUM_AIE DATA_MEMORY PROGRAM_MEMORY
fft_ifft_dit_1ch cfloat cfloat 1024 1 1 1 1024 2578 397 MSa/s 3083 332 MSa/s 13 1 55864 5116
fft_ifft_dit_1ch cint16 cint16 64 1 1 1 64 520 123 MSa/s 582 109 MSa/s 8 1 11832 6568
fft_ifft_dit_1ch cint32 cint16 1024 1 1 0 1024 1515 675 MSa/s 2067 495 MSa/s 9 1 51640 5638
fft_ifft_dit_1ch cint32 cint16 1024 1 1 1 1024 1019 1004 MSa/s 2068 495 MSa/s 13 1 51768 6620
fft_ifft_dit_1ch cint32 cint16 1024 1 2 1 1024 508 812 2015 1261 MSa/s 2071 2073 494 493 MSa/s 22 2 76848 2 0 2 0 4926 3 0 3 0 6508
fft_ifft_dit_1ch cint32 cint16 1024 1 3 1 1024 426 812 368 2403 1261 2782 MSa/s 2076 2077 2075 493 493 493 MSa/s 31 3 101928 1 0 1 0 4942 2 0 2 0 4942 3 0 3 0 6508
fft_ifft_dit_1ch cint32 cint16 1024 1 4 1 1024 494 426 368 637 2072 2403 2782 1607 MSa/s 2080 2081 2083 2079 492 492 491 492 MSa/s 40 4 127008 1 0 1 0 6540 1 1 1 1 4910 2 1 2 1 4942 3 0 3 0 4942
fft_ifft_dit_1ch cint32 cint16 1024 1 5 1 1024 426 408 494 482 368 2403 2509 2072 2124 2782 MSa/s 2084 2086 2085 2088 2083 491 490 491 490 491 MSa/s 47 5 152088 1 0 1 0 4942 1 1 1 1 4942 2 2 2 2 4910 3 0 3 0 3902 3 2 3 2 4732
fft_ifft_dit_1ch cint32 cint16 128 1 1 0 1024 2327 55 MSa/s 2408 53 MSa/s 7 1 42424 4536
fft_ifft_dit_1ch cint32 cint16 128 1 2 0 1024 492 1838 260 69 MSa/s 2075 2077 61 61 MSa/s 12 2 67440 0 0 0 0 2296 1 0 1 0 3788
fft_ifft_dit_1ch cint32 cint16 128 1 1 0 128 295 433 MSa/s 350 365 MSa/s 7 1 13752 4270
fft_ifft_dit_1ch cint32 cint16 128 1 1 1 128 524 244 MSa/s 587 218 MSa/s 7 1 13880 6520
fft_ifft_dit_1ch cint32 cint16 16 1 1 0 1024 3929 4 MSa/s 4002 3 MSa/s 7 1 42424 2376
fft_ifft_dit_1ch cint32 cint16 16 1 1 0 16 80 200 MSa/s 134 119 MSa/s 7 1 10168 1900
fft_ifft_dit_1ch cint32 cint16 256 1 1 0 1024 1782 143 MSa/s 2071 123 MSa/s 7 1 43448 5016
fft_ifft_dit_1ch cint32 cint16 256 1 2 0 1024 852 938 300 272 MSa/s 2070 2071 123 123 MSa/s 12 2 68464 0 0 0 0 3274 1 0 1 0 3388
fft_ifft_dit_1ch cint32 cint16 256 1 1 0 256 436 587 MSa/s 524 488 MSa/s 7 1 18872 4742
fft_ifft_dit_1ch cint32 cint16 256 1 1 1 256 580 441 MSa/s 646 396 MSa/s 7 1 19000 6476
fft_ifft_dit_1ch cint32 cint16 32 1 1 0 1024 4516 7 MSa/s 4589 6 MSa/s 7 1 42424 2756
fft_ifft_dit_1ch cint32 cint16 32 1 2 0 1024 1116 3390 28 9 MSa/s 3474 3476 9 9 MSa/s 12 2 67440 0 0 0 0 1920 1 0 1 0 2398
fft_ifft_dit_1ch cint32 cint16 32 1 1 0 32 147 217 MSa/s 201 159 MSa/s 7 1 10680 2586
fft_ifft_dit_1ch cint32 cint16 32 1 1 1 32 423 75 MSa/s 485 65 MSa/s 7 1 10808 6504
fft_ifft_dit_1ch cint32 cint16 512 1 1 0 512 861 594 MSa/s 1039 492 MSa/s 8 1 29624 5118
fft_ifft_dit_1ch cint32 cint16 512 1 1 1 512 736 695 MSa/s 1059 483 MSa/s 8 1 29752 6580
fft_ifft_dit_1ch cint32 cint16 512 1 1 0 1024 1739 294 MSa/s 2070 247 MSa/s 8 1 46008 5428
fft_ifft_dit_1ch cint32 cint16 512 1 2 0 1024 1064 704 481 727 MSa/s 2070 2071 247 247 MSa/s 14 2 71024 0 0 0 0 3824 1 0 1 0 3298
fft_ifft_dit_1ch cint32 cint16 64 1 1 0 1024 3145 20 MSa/s 3218 19 MSa/s 7 1 42424 3434
fft_ifft_dit_1ch cint32 cint16 64 1 2 0 1024 892 2239 71 28 MSa/s 2329 2330 27 27 MSa/s 12 2 67440 0 0 0 0 2120 1 0 1 0 2854
fft_ifft_dit_1ch cint16 cint16 64 1 1 0 64 209 306 MSa/s 271 236 MSa/s 8 1 11704 3308
fft_ifft_dit_1ch cint32 cint16 64 1 1 0 64 207 309 MSa/s 261 245 MSa/s 7 1 11704 3220
fft_ifft_dit_1ch cint16 cint16 512 1 1 0 512 852 600 MSa/s 910 562 MSa/s 9 1 25528 5174
fft_ifft_dit_1ch cint16 cint16 32 0 1 1 32 390 82 MSa/s 452 70 MSa/s 8 1 11320 6524
fft_ifft_dit_1ch cfloat cfloat 128 1 1 1 128 885 144 MSa/s 948 135 MSa/s 7 1 13880 5076
fft_ifft_dit_1ch cfloat cfloat 16 1 1 0 16 142 112 MSa/s 197 81 MSa/s 7 1 10168 2574
fft_ifft_dit_1ch cfloat cfloat 256 1 1 0 1024 5391 47 MSa/s 5472 46 MSa/s 7 1 43448 5218
fft_ifft_dit_1ch cfloat cfloat 256 1 1 0 256 1341 190 MSa/s 1399 182 MSa/s 7 1 18872 4980
fft_ifft_dit_1ch cfloat cfloat 256 1 1 1 256 1147 223 MSa/s 1213 211 MSa/s 7 1 19000 5084
fft_ifft_dit_1ch cfloat cfloat 256 1 2 0 1024 2961 2488 86 102 MSa/s 3068 3069 83 83 MSa/s 12 2 68464 0 0 0 0 3452 1 0 1 0 3502
fft_ifft_dit_1ch cfloat cfloat 32 1 1 0 1024 6740 4 MSa/s 6821 4 MSa/s 7 1 42424 3738
fft_ifft_dit_1ch cfloat cfloat 32 1 1 0 32 227 140 MSa/s 282 113 MSa/s 7 1 10680 3500
fft_ifft_dit_1ch cfloat cfloat 32 1 1 1 32 682 46 MSa/s 744 43 MSa/s 7 1 10808 5076
fft_ifft_dit_1ch cfloat cfloat 32 1 2 0 1024 2566 4343 12 7 MSa/s 4450 4452 7 7 MSa/s 12 2 67440 0 0 0 0 2380 1 0 1 0 2864
fft_ifft_dit_1ch cfloat cfloat 512 1 1 0 1024 5661 90 MSa/s 5742 89 MSa/s 8 1 47544 5876
fft_ifft_dit_1ch cfloat cfloat 512 1 1 0 512 2809 182 MSa/s 2873 178 MSa/s 8 1 31160 5500
fft_ifft_dit_1ch cfloat cfloat 512 1 2 0 1024 2792 2845 183 179 MSa/s 2954 2955 173 173 MSa/s 14 2 72560 2 0 2 0 3476 3 0 3 0 3944
fft_ifft_dit_1ch cfloat cfloat 64 1 1 0 1024 5739 11 MSa/s 5820 10 MSa/s 7 1 42424 4322
fft_ifft_dit_1ch cfloat cfloat 64 1 1 0 64 369 173 MSa/s 424 150 MSa/s 7 1 11704 4052
fft_ifft_dit_1ch cfloat cfloat 64 1 1 1 64 731 87 MSa/s 793 80 MSa/s 7 1 11832 5068
fft_ifft_dit_1ch cfloat cfloat 64 1 2 0 1024 3056 2781 20 23 MSa/s 3165 3166 20 20 MSa/s 12 2 67440 0 0 0 0 2908 1 0 1 0 3008
fft_ifft_dit_1ch cint16 cint16 1024 0 1 0 1024 1647 621 MSa/s 1718 596 MSa/s 10 1 43448 5742
fft_ifft_dit_1ch cint16 cint16 1024 1 1 0 1024 1647 621 MSa/s 1718 596 MSa/s 10 1 43448 5742
fft_ifft_dit_1ch cint16 cint16 128 0 1 0 128 293 436 MSa/s 347 368 MSa/s 8 1 12728 4326
fft_ifft_dit_1ch cint16 cint16 128 1 1 0 128 293 436 MSa/s 347 368 MSa/s 8 1 12728 4326
fft_ifft_dit_1ch cint16 cint16 16 1 1 0 16 78 205 MSa/s 132 121 MSa/s 8 1 10936 1944
fft_ifft_dit_1ch cint16 cint16 2048 0 1 0 2048 3344 612 MSa/s 3425 597 MSa/s 13 1 79288 6054
fft_ifft_dit_1ch cint16 cint16 2048 1 1 0 2048 3344 612 MSa/s 3425 597 MSa/s 13 1 79288 6054
fft_ifft_dit_1ch cint16 cint16 256 0 1 0 256 454 563 MSa/s 511 500 MSa/s 8 1 16824 4750
fft_ifft_dit_1ch cint16 cint16 256 1 1 0 256 454 563 MSa/s 511 500 MSa/s 8 1 16824 4750
fft_ifft_dit_1ch cint16 cint16 32 0 1 0 32 136 235 MSa/s 198 161 MSa/s 8 1 11192 2562
fft_ifft_dit_1ch cint16 cint16 512 0 1 0 512 852 600 MSa/s 910 562 MSa/s 9 1 25528 5174
fft_ifft_dit_1ch cint32 cint16 64 1 1 1 64 519 123 MSa/s 582 109 MSa/s 7 1 11832 6512

Matrix Multiply

Following table gives results for the matrix multiply function with a wide variety of supported parameters, which are defined in: L2 Matrix Multiply Configuration Parameters.

Note

cycleCountAvg does not include the cycle count information for the additional shuffling/tiling widget kernels, but initiationInterval and PROGRAM_MEMORY do include shuffling/tiling widget kernels.

mat_mult_benchmark.csv

Matrix Multiply benchmark
Library Element T_DATA_A T_DATA_B P_DIM_A P_DIM_AB P_DIM_B P_ADD_TILING_A P_ADD_TILING_B P_ADD_DETILING_OUT P_INPUT_WINDOW_VSIZE_A P_INPUT_WINDOW_VSIZE_B P_CASC_LEN NITER cycleCountAvg initiationInterval NUM_BANKS NUM_AIE DATA_MEMORY PROGRAM_MEMORY
matrix_mult cfloat cfloat 8 64 4 1 1 1 512 256 4 16 643 680 718 738 844 859 875 851 890 867 883 898 906 29 9 49080 1830 3246 1830 3492 1830 3492 1358 1830 3748
matrix_mult cint16 cint16 8 8 8 1 1 1 64 64 1 100 105 204 205 206 9 3 9809 1894 2460 1390
matrix_mult cint16 cint16 8 8 8 1 1 1 64 64 1 100 105 204 205 206 9 3 9809 1894 2460 1390
matrix_mult cint16 int32 8 64 4 1 1 1 512 256 4 16 248 263 275 272 363 379 394 371 409 386 402 417 425 33 9 33988 1830 2814 1830 2910 1830 2910 1830 1358 3172
matrix_mult cint32 cint16 8 64 4 1 1 1 512 256 4 16 193 212 215 203 305 320 335 312 351 328 343 358 366 24 9 42180 1918 1756 1918 1798 1918 1798 1934 1246 2032
matrix_mult cint32 cint32 8 64 4 1 1 1 512 256 4 16 363 373 382 373 477 492 508 485 523 500 516 531 538 22 9 46776 1894 1964 1894 2136 1894 2136 1894 1246 2300
matrix_mult cint32 int32 8 64 4 1 1 1 512 256 4 16 193 212 215 203 305 320 335 312 351 328 343 358 366 24 9 42180 1918 1756 1918 1798 1918 1798 1934 1246 2032
matrix_mult float cfloat 8 64 4 1 1 1 512 256 4 16 383 395 405 401 498 513 505 544 521 529 551 536 559 32 9 40384 2726 2406 2726 2532 2572 2742 1246 2710 2768
matrix_mult int16 cint16 16 16 16 1 1 1 256 256 1 16 336 439 446 454 11 3 16976 2246 2472 1678
matrix_mult int16 cint32 8 64 4 1 1 1 512 256 4 16 167 172 174 162 254 269 285 262 300 277 292 308 316 25 9 34244 2318 1846 2302 1854 2318 1854 2302 1246 2072
matrix_mult int16 int16 16 16 16 1 1 1 256 256 1 16 242 348 356 363 11 3 12878 1894 2076 1406
matrix_mult int32 cint16 8 64 4 1 1 1 512 256 4 16 248 263 275 272 363 379 394 371 409 386 402 417 425 33 9 33988 1830 2814 1830 2910 1830 2910 1830 1358 3172
matrix_mult cint16 cint16 8 64 4 1 1 1 512 256 4 16 113 131 132 126 215 230 246 223 261 238 254 269 277 30 9 33604 1830 1912 1830 2060 1830 2060 1830 1246 2202
matrix_mult cint16 cint16 8 64 4 1 1 1 512 256 1 16 287 561 569 576 8 3 19537 1830 2106 1246
matrix_mult cint16 cint16 8 4 64 1 1 1 32 256 1 16 289 526 534 542 10 3 19281 1294 1922 1478
matrix_mult cint16 cint16 8 4 512 1 1 1 32 2048 1 16 2079 4025 4033 4040 15 3 105297 1310 1922 1494
matrix_mult cfloat float 8 64 4 1 1 1 512 256 4 16 346 354 364 356 457 472 488 465 503 480 496 511 519 19 9 44480 1990 2384 1990 2456 1990 2456 1990 1246 2688
matrix_mult cint16 cint16 1024 4 4 1 1 1 4096 16 1 100 2462 4159 4160 4161 17 3 105297 1102 1944 1286
matrix_mult cint16 cint16 1024 4 4 1 1 1 4096 16 1 16 2462 4468 4476 4484 17 3 105297 1102 1944 1286
matrix_mult cint16 cint16 16 16 16 0 0 0 256 256 1 16 593 678 7 1 8329 2766
matrix_mult cint16 cint16 16 16 16 1 1 1 256 256 1 100 615 691 692 693 8 3 19025 2326 2750 1678
matrix_mult cint16 cint16 16 16 16 1 1 1 256 256 1 16 614 723 731 739 8 3 19025 2326 2750 1678
matrix_mult cint16 cint16 16 16 16 1 1 1 256 256 1 100 612 688 689 690 8 3 19025 2326 2750 1678
matrix_mult cint16 cint16 16 16 16 1 1 1 256 256 1 100 612 688 689 690 8 3 19025 2326 2750 1678
matrix_mult cint16 cint16 16 16 16 1 1 1 256 256 1 100 615 691 692 693 8 3 19025 2326 2750 1678
matrix_mult cint16 cint16 16 16 16 1 1 1 256 256 1 100 615 691 693 694 8 3 19025 2326 2750 1678
matrix_mult cint16 cint16 16 16 16 1 1 1 256 256 1 100 615 691 692 693 8 3 19025 2326 2750 1678
matrix_mult cint16 cint16 24 4 4 1 1 1 96 16 1 16 87 169 177 184 11 3 9297 1968 1286 1102
matrix_mult cint16 cint16 32 32 32 0 0 0 1024 1024 1 16 4227 4360 7 1 26761 3878
matrix_mult cint16 cint16 32 32 32 1 1 1 1024 1024 1 100 4486 4576 4577 4579 10 3 55889 2326 3878 1678
matrix_mult cint16 cint16 32 32 64 0 0 0 1024 2048 1 16 8357 8554 7 1 43145 3878
matrix_mult cint16 cint16 32 64 32 0 0 0 2048 2048 1 16 8325 8522 7 1 43145 3878
matrix_mult cint16 cint16 64 64 64 0 0 0 4096 4096 1 16 33069 33394 13 1 100489 3862
matrix_mult cint16 cint16 8 4 4 1 1 1 32 16 1 16 46 124 132 139 11 3 7761 1694 1230 1102
matrix_mult cint16 cint16 8 4 512 1 1 1 32 2048 1 100 2079 4091 4092 4093 15 3 105297 1310 1922 1494
matrix_mult cint16 cint16 16 16 16 1 1 1 256 256 1 100 615 691 693 694 8 3 19025 2326 2750 1678
matrix_mult int32 int32 8 64 4 1 1 1 512 256 4 16 113 131 132 126 215 230 246 223 261 238 254 269 277 30 9 33604 1830 1912 1830 2060 1830 2060 1830 1246 2202

Widgets

Following table gives results for the widgets with a wide variety of supported parameters, which are defined in: L2 Widgets Configuration Parameters.

widget_benchmark.csv

Widgets benchmark
Library Element DATA_TYPE IN_API OUT_API NUM_INPUTS WINDOW_VSIZE NUM_OUTPUT_CLONES cycleCountAvg throughputAvg initiationInterval throughpuInitIntAvg NUM_BANKS NUM_AIE DATA_MEMORY PROGRAM_MEMORY
widget_api_cast cfloat 0 0 1 256 1 88 2909 MSa/s 517 495 MSa/s 5 1 8980 1388
widget_api_cast cfloat 0 1 1 256 2 531 482 MSa/s 574 445 MSa/s 3 1 4884 1224
widget_api_cast cfloat 1 0 2 256 3 531 482 MSa/s 600 426 MSa/s 7 1 13076 1698
widget_api_cast cint16 0 0 1 256 1 56 4571 MSa/s 263 973 MSa/s 5 1 4884 1388
widget_api_cast cint16 0 0 1 256 2 90 2844 MSa/s 264 969 MSa/s 7 1 6932 1660
widget_api_cast cint16 0 0 1 256 3 124 2064 MSa/s 265 966 MSa/s 9 1 8980 1904
widget_api_cast cint16 0 1 1 256 1 274 934 MSa/s 314 815 MSa/s 3 1 2836 1208
widget_api_cast cint16 1 0 1 256 1 272 941 MSa/s 310 825 MSa/s 3 1 2836 1202
widget_api_cast cint16 1 0 1 256 2 275 930 MSa/s 329 778 MSa/s 5 1 4884 1406
widget_api_cast cint16 1 0 1 256 3 280 914 MSa/s 349 733 MSa/s 7 1 6932 1658
widget_api_cast cint16 1 0 2 256 1 273 937 MSa/s 311 823 MSa/s 3 1 2836 1218
widget_api_cast cint32 0 1 1 256 2 531 482 MSa/s 574 445 MSa/s 3 1 4884 1224
widget_api_cast cint32 1 0 1 256 4 535 478 MSa/s 619 413 MSa/s 9 1 17172 1848
widget_api_cast cint32 1 0 2 256 4 535 478 MSa/s 622 411 MSa/s 9 1 17172 1920
widget_api_cast int32 0 1 1 256 2 275 930 MSa/s 315 812 MSa/s 3 1 2836 1224
widget_api_cast int32 1 0 2 256 2 274 934 MSa/s 329 778 MSa/s 5 1 4884 1390