Benchmark/QoR¶
This section provides the L2 performance benchmarks and QoR (Quality of Results) for AIE DSP library elements with various configurations. The results are extracted from hardware emulation based simulations using the Makefile flow defined in: Compiling and Simulating Using the Makefile.
The QoR are reflected using the below metrics:
- cycleCountAvg - average cycle count that takes to execute kernel function (not including kernel/window buffer overheads).
- throughputAvg - input throughput calculated based on cycleCountAvg, taking into account input window size.
- initiationInterval - time that must pass between two consecutive iterations execution starts of a given function, including overheads i.e., time between a function start and its previous start.
- throughpuInitIntAvg - input throughput calculated based on initiationInterval, taking into account input window size.
- NUM_BANKS - number of memory banks used by the design
- NUM_AIE - number of AIE tiles used by the design
- DATA_MEMORY - total data memory in Bytes used by the design
- PROGRAM_MEMORY - total program memory in Bytes used by the design
In addition, QoR metrics are harvested for each kernel the design consists of. For example a FIR configured to be implemented on two tiles (CASC_LEN=2) will have two sets of figures displayed in the table below (space deliminated).
Filters¶
Following table gives results for FIR filter with a wide variety of supported parameters, which are defined in: L2 FIR configuration parameters
Library Element | DATA_TYPE | COEFF_TYPE | FIR_LEN | INTERPOLATE_FACTOR | DECIMATE_FACTOR | INPUT_WINDOW_VSIZE | CASC_LEN | DUAL_IP | USE_COEFF_RELOAD | cycleCountAvg | throughputAvg | initiationInterval | throughpuInitIntAvg | NUM_BANKS | NUM_AIE | DATA_MEMORY | PROGRAM_MEMORY |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
fir_decimate_asym | cfloat | cfloat | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 9358 | 41 MSa/s | 9440 | 40 MSa/s | 5 | 1 | 11361 | 4956 |
fir_decimate_asym | cint32 | cint32 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 2167 | 177 MSa/s | 2251 | 170 MSa/s | 5 | 1 | 11484 | 3014 |
fir_decimate_asym | cint32 | cint16 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 3706 | 103 MSa/s | 3815 | 100 MSa/s | 5 | 1 | 13206 | 4040 |
fir_decimate_asym | cint32 | cint16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 451 | 851 MSa/s | 777 | 494 MSa/s | 5 | 1 | 10382 | 1748 |
fir_decimate_asym | cint32 | cint16 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 1125 | 341 MSa/s | 1208 | 317 MSa/s | 5 | 1 | 10978 | 2902 |
fir_decimate_asym | cint32 | cint16 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 839 | 457 MSa/s | 927 | 414 MSa/s | 5 | 1 | 10718 | 2184 |
fir_decimate_asym | cint32 | cint32 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 741 | 518 MSa/s | 817 | 470 MSa/s | 5 | 1 | 10516 | 2336 |
fir_decimate_asym | cint16 | int16 | 99 | 1 | 3 | 384 | 2 | 0 | 0 | 557 544 | 689 705 MSa/s | 686 683 | 559 562 MSa/s | 7 | 2 | 13184 | 2526 2308 |
fir_decimate_asym | cint16 | int16 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 935 | 410 MSa/s | 1025 | 374 MSa/s | 5 | 1 | 7568 | 2840 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 1 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 8 | 1 | 6120 | 2878 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6108 | 2210 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 192 | 1 | 0 | 0 | 94 | 2042 MSa/s | 197 | 974 MSa/s | 5 | 1 | 4060 | 2210 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 402 | 955 MSa/s | 5 | 1 | 6108 | 2210 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6108 | 2226 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6108 | 2226 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6108 | 2226 |
fir_decimate_asym | cint16 | int16 | 99 | 1 | 3 | 384 | 1 | 0 | 1 | 1066 | 360 MSa/s | 1183 | 324 MSa/s | 8 | 1 | 7748 | 3198 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6108 | 2210 |
fir_decimate_asym | cint32 | cint32 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 7271 | 52 MSa/s | 7380 | 52 MSa/s | 5 | 1 | 14756 | 4286 |
fir_decimate_asym | cint32 | int16 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 674 | 569 MSa/s | 777 | 494 MSa/s | 5 | 1 | 10726 | 2256 |
fir_decimate_asym | int32 | int32 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 1770 | 216 MSa/s | 1860 | 206 MSa/s | 5 | 1 | 8342 | 3798 |
fir_decimate_asym | int32 | int32 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 214 | 1794 MSa/s | 391 | 982 MSa/s | 5 | 1 | 6222 | 2698 |
fir_decimate_asym | int32 | int32 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 642 | 598 MSa/s | 719 | 534 MSa/s | 5 | 1 | 6626 | 2406 |
fir_decimate_asym | int32 | int32 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 485 | 791 MSa/s | 565 | 679 MSa/s | 5 | 1 | 6430 | 1850 |
fir_decimate_asym | int32 | int16 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 3086 | 124 MSa/s | 3177 | 120 MSa/s | 5 | 1 | 7568 | 4294 |
fir_decimate_asym | int32 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 238 | 1613 MSa/s | 391 | 982 MSa/s | 5 | 1 | 6108 | 2126 |
fir_decimate_asym | int32 | int16 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 1010 | 380 MSa/s | 1089 | 352 MSa/s | 5 | 1 | 6374 | 2996 |
fir_decimate_asym | int32 | int16 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 803 | 478 MSa/s | 884 | 434 MSa/s | 5 | 1 | 6292 | 2612 |
fir_decimate_asym | cint32 | int16 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 545 | 704 MSa/s | 778 | 493 MSa/s | 5 | 1 | 10580 | 1916 |
fir_decimate_asym | float | float | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 10829 | 35 MSa/s | 10920 | 35 MSa/s | 5 | 1 | 8599 | 7274 |
fir_decimate_asym | float | float | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 3715 | 103 MSa/s | 3793 | 101 MSa/s | 5 | 1 | 6883 | 3712 |
fir_decimate_asym | float | float | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 2625 | 146 MSa/s | 2701 | 142 MSa/s | 5 | 1 | 6687 | 3474 |
fir_decimate_asym | cint32 | int32 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 3706 | 103 MSa/s | 3815 | 100 MSa/s | 5 | 1 | 13206 | 4040 |
fir_decimate_asym | cint32 | int32 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 451 | 851 MSa/s | 777 | 494 MSa/s | 5 | 1 | 10382 | 1748 |
fir_decimate_asym | cint32 | int32 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 1125 | 341 MSa/s | 1208 | 317 MSa/s | 5 | 1 | 10978 | 2902 |
fir_decimate_asym | cint32 | int32 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 839 | 457 MSa/s | 927 | 414 MSa/s | 5 | 1 | 10718 | 2184 |
fir_decimate_asym | cint32 | int16 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 1806 | 212 MSa/s | 1914 | 200 MSa/s | 5 | 1 | 12432 | 3856 |
fir_decimate_asym | cint32 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 354 | 1084 MSa/s | 776 | 494 MSa/s | 5 | 1 | 10268 | 1676 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6108 | 2210 |
fir_decimate_asym | cint32 | cint32 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 1510 | 254 MSa/s | 1592 | 241 MSa/s | 5 | 1 | 11092 | 3174 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6108 | 2210 |
fir_decimate_asym | cint16 | int16 | 144 | 1 | 2 | 432 | 1 | 0 | 0 | 2154 | 200 MSa/s | 2251 | 191 MSa/s | 5 | 1 | 9258 | 3256 |
fir_decimate_asym | cint16 | int16 | 140 | 1 | 7 | 420 | 1 | 0 | 0 | 1678 | 250 MSa/s | 1776 | 236 MSa/s | 5 | 1 | 7906 | 3226 |
fir_decimate_asym | cint16 | int16 | 140 | 1 | 5 | 420 | 1 | 0 | 0 | 888 | 472 MSa/s | 985 | 426 MSa/s | 5 | 1 | 8098 | 3232 |
fir_decimate_asym | cint16 | int16 | 12 | 1 | 4 | 384 | 1 | 0 | 0 | 135 | 2844 MSa/s | 390 | 984 MSa/s | 5 | 1 | 5858 | 1872 |
fir_decimate_asym | cint16 | int16 | 12 | 1 | 3 | 384 | 1 | 0 | 1 | 192 | 2000 MSa/s | 391 | 982 MSa/s | 8 | 1 | 6134 | 2914 |
fir_decimate_asym | cint16 | int16 | 12 | 1 | 3 | 384 | 1 | 0 | 0 | 151 | 2543 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6114 | 2230 |
fir_decimate_asym | cint16 | int16 | 12 | 1 | 2 | 384 | 1 | 0 | 0 | 183 | 2098 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6626 | 1878 |
fir_decimate_asym | cint16 | int16 | 12 | 1 | 3 | 384 | 1 | 0 | 0 | 151 | 2543 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6114 | 2230 |
fir_decimate_asym | cint16 | cint16 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 1691 | 227 MSa/s | 1782 | 215 MSa/s | 5 | 1 | 8342 | 3810 |
fir_decimate_asym | cint16 | cint16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 214 | 1794 MSa/s | 391 | 982 MSa/s | 5 | 1 | 6222 | 2698 |
fir_decimate_asym | cint16 | cint16 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 642 | 598 MSa/s | 719 | 534 MSa/s | 5 | 1 | 6626 | 2390 |
fir_decimate_asym | cfloat | float | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 38309 | 10 MSa/s | 38418 | 9 MSa/s | 5 | 1 | 13477 | 11006 |
fir_decimate_asym | cfloat | float | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 3786 | 101 MSa/s | 3869 | 99 MSa/s | 5 | 1 | 10653 | 2738 |
fir_decimate_asym | cfloat | float | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 12139 | 31 MSa/s | 12222 | 31 MSa/s | 5 | 1 | 11249 | 4784 |
fir_decimate_asym | cfloat | float | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 8714 | 44 MSa/s | 8795 | 43 MSa/s | 5 | 1 | 10989 | 4162 |
fir_decimate_asym | cfloat | cfloat | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 44340 | 8 MSa/s | 44449 | 8 MSa/s | 5 | 1 | 15025 | 12474 |
fir_decimate_asym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 150 | 2560 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6108 | 2226 |
fir_decimate_asym | cfloat | cfloat | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 12341 | 31 MSa/s | 12426 | 30 MSa/s | 5 | 1 | 11753 | 5274 |
fir_decimate_asym | cint16 | int16 | 144 | 1 | 4 | 432 | 1 | 0 | 0 | 1128 | 382 MSa/s | 1225 | 352 MSa/s | 5 | 1 | 8394 | 3240 |
fir_decimate_asym | cint16 | int16 | 144 | 1 | 6 | 432 | 1 | 0 | 0 | 1970 | 219 MSa/s | 2068 | 208 MSa/s | 5 | 1 | 8106 | 3114 |
fir_decimate_asym | cfloat | cfloat | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 4589 | 83 MSa/s | 4666 | 82 MSa/s | 5 | 1 | 10785 | 3350 |
fir_decimate_asym | cint16 | int16 | 15 | 1 | 3 | 384 | 1 | 0 | 1 | 224 | 1714 MSa/s | 392 | 979 MSa/s | 8 | 1 | 6156 | 3142 |
fir_decimate_asym | cint16 | int16 | 15 | 1 | 3 | 384 | 1 | 0 | 0 | 183 | 2098 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6120 | 2394 |
fir_decimate_asym | cint16 | int16 | 63 | 1 | 3 | 384 | 1 | 0 | 1 | 742 | 517 MSa/s | 853 | 450 MSa/s | 8 | 1 | 7020 | 2704 |
fir_decimate_asym | cint16 | int16 | 60 | 1 | 6 | 384 | 1 | 0 | 0 | 523 | 734 MSa/s | 605 | 634 MSa/s | 5 | 1 | 6370 | 2748 |
fir_decimate_asym | cint16 | int16 | 60 | 1 | 5 | 320 | 1 | 0 | 0 | 354 | 903 MSa/s | 437 | 732 MSa/s | 5 | 1 | 5858 | 2440 |
fir_decimate_asym | cint16 | int16 | 60 | 1 | 4 | 512 | 1 | 0 | 0 | 674 | 759 MSa/s | 757 | 676 MSa/s | 5 | 1 | 7906 | 2440 |
fir_decimate_asym | cint16 | int16 | 60 | 1 | 3 | 384 | 1 | 0 | 0 | 611 | 628 MSa/s | 694 | 553 MSa/s | 5 | 1 | 6882 | 2432 |
fir_decimate_asym | cint16 | int16 | 33 | 1 | 3 | 384 | 1 | 0 | 0 | 421 | 912 MSa/s | 500 | 768 MSa/s | 5 | 1 | 6540 | 2216 |
fir_decimate_asym | cint16 | int16 | 28 | 1 | 7 | 448 | 1 | 0 | 0 | 310 | 1445 MSa/s | 453 | 988 MSa/s | 5 | 1 | 6370 | 2168 |
fir_decimate_asym | cint16 | int16 | 63 | 1 | 7 | 448 | 1 | 0 | 0 | 570 | 785 MSa/s | 654 | 685 MSa/s | 5 | 1 | 6888 | 2832 |
fir_decimate_asym | cint16 | int16 | 24 | 1 | 6 | 384 | 1 | 0 | 0 | 262 | 1465 MSa/s | 391 | 982 MSa/s | 5 | 1 | 5786 | 1806 |
fir_decimate_asym | cint16 | int16 | 237 | 1 | 3 | 768 | 1 | 0 | 0 | 4169 | 184 MSa/s | 4274 | 179 MSa/s | 5 | 1 | 13796 | 4128 |
fir_decimate_asym | cint16 | int16 | 237 | 1 | 3 | 384 | 1 | 0 | 1 | 2396 | 160 MSa/s | 2529 | 151 MSa/s | 8 | 1 | 10176 | 4602 |
fir_decimate_asym | cint16 | int16 | 237 | 1 | 3 | 384 | 1 | 0 | 0 | 2121 | 181 MSa/s | 2226 | 172 MSa/s | 5 | 1 | 9700 | 4128 |
fir_decimate_asym | cint16 | int16 | 21 | 1 | 3 | 384 | 1 | 0 | 1 | 387 | 992 MSa/s | 494 | 777 MSa/s | 8 | 1 | 6320 | 2648 |
fir_decimate_asym | cint16 | int16 | 18 | 1 | 3 | 384 | 1 | 0 | 1 | 323 | 1188 MSa/s | 428 | 897 MSa/s | 8 | 1 | 6298 | 2608 |
fir_decimate_hb | cint32 | cint32 | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 6423 | 39 MSa/s | 6556 | 39 MSa/s | 5 | 1 | 15196 | 4352 |
fir_decimate_hb | cint32 | cint32 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 261 | 980 MSa/s | 519 | 493 MSa/s | 5 | 1 | 8188 | 2426 |
fir_decimate_hb | cint32 | int16 | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 21162 | 12 MSa/s | 21295 | 12 MSa/s | 5 | 1 | 12610 | 7252 |
fir_decimate_hb | cint32 | int16 | 11 | 1 | 2 | 256 | 1 | 0 | 0 | 1576 | 162 MSa/s | 1655 | 154 MSa/s | 5 | 1 | 8186 | 2446 |
fir_decimate_hb | cint32 | int16 | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 2343 | 109 MSa/s | 2423 | 105 MSa/s | 5 | 1 | 8258 | 2740 |
fir_decimate_hb | cint16 | int16 | 99 | 1 | 2 | 128 | 4 | 0 | 0 | 230 196 120 124 | 556 653 1066 1032 MSa/s | 273 270 271 272 | 468 474 472 470 MSa/s | 11 | 4 | 16328 | 2174 1768 1768 1998 |
fir_decimate_hb | cint32 | cint32 | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 2616 | 97 MSa/s | 2714 | 94 MSa/s | 5 | 1 | 10972 | 4170 |
fir_decimate_hb | cint32 | cint32 | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 549 | 466 MSa/s | 628 | 407 MSa/s | 5 | 1 | 8476 | 2130 |
fir_decimate_hb | cint32 | cint16 | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 518 | 494 MSa/s | 598 | 428 MSa/s | 5 | 1 | 8320 | 1808 |
fir_decimate_hb | cint32 | cint16 | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 2184 | 117 MSa/s | 2282 | 112 MSa/s | 5 | 1 | 10288 | 3314 |
fir_decimate_hb | cint32 | cint16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 518 | 494 MSa/s | 5 | 1 | 8128 | 2110 |
fir_decimate_hb | cint32 | cint16 | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 5047 | 50 MSa/s | 5180 | 49 MSa/s | 5 | 1 | 13472 | 4360 |
fir_decimate_hb | cint32 | cint16 | 11 | 1 | 2 | 256 | 1 | 0 | 0 | 454 | 563 MSa/s | 533 | 480 MSa/s | 5 | 1 | 8208 | 1948 |
fir_decimate_hb | cint16 | int16 | 99 | 1 | 2 | 256 | 1 | 0 | 1 | 552 | 463 MSa/s | 713 | 359 MSa/s | 8 | 1 | 6102 | 4230 |
fir_decimate_hb | cint16 | int16 | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 552 | 463 MSa/s | 639 | 400 MSa/s | 5 | 1 | 6058 | 2374 |
fir_decimate_hb | cint16 | int16 | 99 | 1 | 2 | 128 | 5 | 0 | 0 | 123 127 230 196 120 | 1040 1007 556 653 1066 MSa/s | 271 273 274 269 271 | 472 468 467 475 472 MSa/s | 13 | 5 | 21170 | 2174 1768 1842 1842 2030 |
fir_decimate_hb | cint32 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 1576 | 162 MSa/s | 1654 | 154 MSa/s | 5 | 1 | 8114 | 2426 |
fir_decimate_hb | cint32 | cint32 | 11 | 1 | 2 | 256 | 1 | 0 | 0 | 453 | 565 MSa/s | 531 | 482 MSa/s | 5 | 1 | 8284 | 2114 |
fir_decimate_hb | cint32 | int16 | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 8741 | 29 MSa/s | 8844 | 28 MSa/s | 5 | 1 | 9898 | 4024 |
fir_decimate_hb | float | float | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 5333 | 48 MSa/s | 5419 | 47 MSa/s | 5 | 1 | 6704 | 4978 |
fir_decimate_hb | cint32 | int32 | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 518 | 494 MSa/s | 598 | 428 MSa/s | 5 | 1 | 8320 | 1808 |
fir_decimate_hb | cint16 | int16 | 99 | 1 | 2 | 128 | 3 | 0 | 0 | 256 222 152 | 500 576 842 MSa/s | 298 295 297 | 429 433 430 MSa/s | 9 | 3 | 11934 | 2358 1952 2106 |
fir_decimate_hb | int32 | int32 | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 1162 | 220 MSa/s | 1249 | 204 MSa/s | 5 | 1 | 6448 | 2320 |
fir_decimate_hb | int32 | int32 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 118 | 2169 MSa/s | 262 | 977 MSa/s | 5 | 1 | 4992 | 2182 |
fir_decimate_hb | int32 | int32 | 27 | 1 | 2 | 256 | 1 | 0 | 0 | 485 | 527 MSa/s | 562 | 455 MSa/s | 5 | 1 | 5296 | 1816 |
fir_decimate_hb | int32 | int32 | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 1430 | 179 MSa/s | 1533 | 166 MSa/s | 5 | 1 | 8480 | 3522 |
fir_decimate_hb | int32 | int32 | 23 | 1 | 2 | 256 | 1 | 0 | 0 | 389 | 658 MSa/s | 465 | 550 MSa/s | 5 | 1 | 5216 | 1800 |
fir_decimate_hb | int32 | int16 | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 3764 | 68 MSa/s | 3852 | 66 MSa/s | 5 | 1 | 6058 | 3656 |
fir_decimate_hb | int32 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 86 | 2976 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 2106 |
fir_decimate_hb | cint32 | int32 | 11 | 1 | 2 | 256 | 1 | 0 | 0 | 454 | 563 MSa/s | 533 | 480 MSa/s | 5 | 1 | 8208 | 1860 |
fir_decimate_hb | int32 | int16 | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 8316 | 30 MSa/s | 8419 | 30 MSa/s | 5 | 1 | 7618 | 6370 |
fir_decimate_hb | int32 | int16 | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 103 | 2485 MSa/s | 261 | 980 MSa/s | 5 | 1 | 5058 | 2090 |
fir_decimate_hb | float | float | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 295 | 867 MSa/s | 370 | 691 MSa/s | 5 | 1 | 5248 | 2348 |
fir_decimate_hb | float | float | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 12290 | 20 MSa/s | 12393 | 20 MSa/s | 5 | 1 | 8736 | 8680 |
fir_decimate_hb | float | float | 19 | 1 | 2 | 256 | 1 | 0 | 0 | 1287 | 198 MSa/s | 1364 | 187 MSa/s | 5 | 1 | 5456 | 3096 |
fir_decimate_hb | float | float | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 787 | 325 MSa/s | 863 | 296 MSa/s | 5 | 1 | 5376 | 2454 |
fir_decimate_hb | cint32 | int32 | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 2184 | 117 MSa/s | 2282 | 112 MSa/s | 5 | 1 | 10288 | 3314 |
fir_decimate_hb | cint32 | int32 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 518 | 494 MSa/s | 5 | 1 | 8128 | 2038 |
fir_decimate_hb | cint32 | int32 | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 5047 | 50 MSa/s | 5180 | 49 MSa/s | 5 | 1 | 13472 | 4360 |
fir_decimate_hb | int32 | int16 | 19 | 1 | 2 | 256 | 1 | 0 | 0 | 982 | 260 MSa/s | 1059 | 241 MSa/s | 5 | 1 | 5130 | 2572 |
fir_decimate_hb | cint16 | int16 | 99 | 1 | 2 | 128 | 2 | 0 | 0 | 264 230 | 484 556 MSa/s | 305 303 | 419 422 MSa/s | 7 | 2 | 7988 | 2472 2242 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cfloat | float | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 32114 | 7 MSa/s | 32247 | 7 MSa/s | 5 | 1 | 13728 | 10120 |
fir_decimate_hb | cint16 | int16 | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 103 | 2485 MSa/s | 261 | 980 MSa/s | 5 | 1 | 5058 | 1956 |
fir_decimate_hb | cint16 | int16 | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 103 | 2485 MSa/s | 272 | 941 MSa/s | 5 | 1 | 5058 | 1972 |
fir_decimate_hb | cint16 | int16 | 11 | 1 | 2 | 256 | 1 | 0 | 1 | 71 | 3605 MSa/s | 262 | 977 MSa/s | 8 | 1 | 5070 | 2358 |
fir_decimate_hb | cint16 | int16 | 11 | 1 | 2 | 256 | 1 | 0 | 0 | 71 | 3605 MSa/s | 261 | 980 MSa/s | 5 | 1 | 5050 | 1904 |
fir_decimate_hb | cint16 | cint16 | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 1158 | 221 MSa/s | 1245 | 205 MSa/s | 5 | 1 | 6448 | 2336 |
fir_decimate_hb | cint16 | cint16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 118 | 2169 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4992 | 2264 |
fir_decimate_hb | cint16 | cint16 | 27 | 1 | 2 | 256 | 1 | 0 | 0 | 485 | 527 MSa/s | 562 | 455 MSa/s | 5 | 1 | 5296 | 1816 |
fir_decimate_hb | cint16 | cint16 | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 1421 | 180 MSa/s | 1524 | 167 MSa/s | 5 | 1 | 8480 | 3478 |
fir_decimate_hb | cint16 | int16 | 15 | 1 | 2 | 256 | 1 | 0 | 1 | 103 | 2485 MSa/s | 262 | 977 MSa/s | 8 | 1 | 5086 | 2428 |
fir_decimate_hb | cint16 | cint16 | 23 | 1 | 2 | 256 | 1 | 0 | 0 | 389 | 658 MSa/s | 465 | 550 MSa/s | 5 | 1 | 5216 | 1800 |
fir_decimate_hb | cfloat | float | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 1703 | 150 MSa/s | 1781 | 143 MSa/s | 5 | 1 | 8384 | 2484 |
fir_decimate_hb | cfloat | float | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 2630 | 97 MSa/s | 2710 | 94 MSa/s | 5 | 1 | 8576 | 2996 |
fir_decimate_hb | cfloat | float | 11 | 1 | 2 | 256 | 1 | 0 | 0 | 1577 | 162 MSa/s | 1657 | 154 MSa/s | 5 | 1 | 8464 | 2548 |
fir_decimate_hb | cfloat | cfloat | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 16527 | 15 MSa/s | 16625 | 15 MSa/s | 5 | 1 | 11228 | 6426 |
fir_decimate_hb | cfloat | cfloat | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 1579 | 162 MSa/s | 1658 | 154 MSa/s | 5 | 1 | 8444 | 2570 |
fir_decimate_hb | cfloat | cfloat | 15 | 1 | 2 | 256 | 1 | 0 | 0 | 3298 | 77 MSa/s | 3379 | 75 MSa/s | 5 | 1 | 8732 | 2618 |
fir_decimate_hb | cfloat | cfloat | 11 | 1 | 2 | 256 | 1 | 0 | 0 | 1961 | 130 MSa/s | 2041 | 125 MSa/s | 5 | 1 | 8540 | 2794 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cfloat | float | 99 | 1 | 2 | 256 | 1 | 0 | 0 | 13803 | 18 MSa/s | 13901 | 18 MSa/s | 5 | 1 | 10544 | 5698 |
fir_decimate_hb | cint16 | int16 | 19 | 1 | 2 | 256 | 1 | 0 | 0 | 104 | 2461 MSa/s | 262 | 977 MSa/s | 5 | 1 | 5130 | 1956 |
fir_decimate_hb | cint16 | int16 | 11 | 1 | 2 | 256 | 1 | 0 | 0 | 71 | 3605 MSa/s | 267 | 958 MSa/s | 5 | 1 | 5050 | 1904 |
fir_decimate_hb | cint16 | int16 | 23 | 1 | 2 | 256 | 1 | 0 | 0 | 104 | 2461 MSa/s | 272 | 941 MSa/s | 5 | 1 | 5138 | 1892 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cint16 | int16 | 19 | 1 | 2 | 256 | 1 | 0 | 1 | 104 | 2461 MSa/s | 262 | 977 MSa/s | 8 | 1 | 5158 | 2492 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 64 | 1 | 0 | 0 | 51 | 1254 MSa/s | 123 | 520 MSa/s | 5 | 1 | 2674 | 1722 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 512 | 1 | 0 | 0 | 102 | 5019 MSa/s | 517 | 990 MSa/s | 5 | 1 | 8050 | 1862 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 1 | 70 | 3657 MSa/s | 262 | 977 MSa/s | 8 | 1 | 4998 | 2278 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 1024 | 1 | 0 | 0 | 166 | 6168 MSa/s | 1030 | 994 MSa/s | 5 | 1 | 14194 | 1862 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 256 | 1 | 0 | 0 | 70 | 3657 MSa/s | 261 | 980 MSa/s | 5 | 1 | 4978 | 1878 |
fir_decimate_hb | cint16 | int16 | 55 | 1 | 2 | 256 | 1 | 1 | 0 | 173 | 1479 MSa/s | 320 | 800 MSa/s | 7 | 1 | 7986 | 2414 |
fir_decimate_hb | cint16 | int16 | 27 | 1 | 2 | 256 | 1 | 0 | 1 | 235 | 1089 MSa/s | 340 | 752 MSa/s | 8 | 1 | 5246 | 2510 |
fir_decimate_hb | cint16 | int16 | 27 | 1 | 2 | 256 | 1 | 0 | 0 | 201 | 1273 MSa/s | 279 | 917 MSa/s | 5 | 1 | 5210 | 1914 |
fir_decimate_hb | cint16 | int16 | 27 | 1 | 2 | 256 | 1 | 1 | 0 | 139 | 1841 MSa/s | 277 | 924 MSa/s | 7 | 1 | 7514 | 2218 |
fir_decimate_hb | cint16 | int16 | 239 | 1 | 2 | 256 | 1 | 0 | 1 | 1278 | 200 MSa/s | 1410 | 181 MSa/s | 8 | 1 | 7742 | 3952 |
fir_decimate_hb | cint16 | int16 | 7 | 1 | 2 | 128 | 1 | 0 | 0 | 57 | 2245 MSa/s | 133 | 962 MSa/s | 5 | 1 | 3442 | 1816 |
fir_decimate_hb | cint16 | int16 | 239 | 1 | 2 | 256 | 1 | 0 | 0 | 1216 | 210 MSa/s | 1319 | 194 MSa/s | 5 | 1 | 7618 | 3638 |
fir_decimate_sym | cint16 | int16 | 99 | 1 | 3 | 384 | 5 | 0 | 0 | 327 317 317 318 319 | 1174 1211 1211 1207 1203 MSa/s | 519 515 516 517 518 | 739 745 744 742 741 MSa/s | 13 | 5 | 32656 | 2448 2248 2100 2106 2376 |
fir_decimate_sym | cint16 | int16 | 99 | 1 | 3 | 384 | 4 | 0 | 0 | 358 317 317 331 | 1072 1211 1211 1160 MSa/s | 526 523 524 525 | 730 734 732 731 MSa/s | 11 | 4 | 25056 | 2448 2248 2242 2392 |
fir_decimate_sym | cint16 | int16 | 99 | 1 | 3 | 384 | 3 | 0 | 0 | 391 349 361 | 982 1100 1063 MSa/s | 558 556 557 | 688 690 689 MSa/s | 9 | 3 | 18288 | 2494 2272 2538 |
fir_decimate_sym | cint16 | int16 | 99 | 1 | 3 | 384 | 2 | 0 | 0 | 453 421 | 847 912 MSa/s | 564 564 | 680 680 MSa/s | 7 | 2 | 12352 | 2722 2718 |
fir_decimate_sym | cint16 | int16 | 99 | 1 | 3 | 384 | 1 | 0 | 1 | 773 | 496 MSa/s | 890 | 431 MSa/s | 8 | 1 | 7324 | 3056 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 192 | 1 | 0 | 0 | 77 | 2493 MSa/s | 196 | 979 MSa/s | 5 | 1 | 3995 | 2160 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 1 | 117 | 3282 MSa/s | 390 | 984 MSa/s | 8 | 1 | 6071 | 2508 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6043 | 2160 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 394 | 974 MSa/s | 5 | 1 | 6043 | 2160 |
fir_decimate_sym | cint32 | cint16 | 24 | 1 | 2 | 384 | 1 | 0 | 0 | 900 | 426 MSa/s | 987 | 389 MSa/s | 5 | 1 | 11627 | 2306 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6043 | 2176 |
fir_decimate_sym | cint16 | int16 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 711 | 540 MSa/s | 801 | 479 MSa/s | 5 | 1 | 7248 | 3650 |
fir_decimate_sym | cint32 | cint16 | 30 | 1 | 2 | 384 | 1 | 0 | 0 | 1619 | 237 MSa/s | 1708 | 224 MSa/s | 5 | 1 | 11779 | 2426 |
fir_decimate_sym | cint16 | int16 | 28 | 1 | 2 | 256 | 1 | 0 | 0 | 608 | 421 MSa/s | 686 | 373 MSa/s | 5 | 1 | 5218 | 2244 |
fir_decimate_sym | cint32 | cint16 | 96 | 1 | 2 | 384 | 1 | 0 | 0 | 2635 | 145 MSa/s | 2742 | 140 MSa/s | 5 | 1 | 13451 | 3546 |
fir_decimate_sym | cint32 | cint32 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 1137 | 337 MSa/s | 1219 | 315 MSa/s | 5 | 1 | 10772 | 2694 |
fir_decimate_sym | cint32 | cint32 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 1247 | 307 MSa/s | 1331 | 288 MSa/s | 5 | 1 | 11068 | 2926 |
fir_decimate_sym | cint32 | cint32 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 496 | 774 MSa/s | 777 | 494 MSa/s | 5 | 1 | 10388 | 2108 |
fir_decimate_sym | cint32 | cint32 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 4350 | 88 MSa/s | 4459 | 86 MSa/s | 5 | 1 | 13572 | 4026 |
fir_decimate_sym | cint32 | int32 | 24 | 1 | 2 | 384 | 1 | 0 | 0 | 900 | 426 MSa/s | 987 | 389 MSa/s | 5 | 1 | 11627 | 2294 |
fir_decimate_sym | cint32 | int32 | 30 | 1 | 2 | 384 | 1 | 0 | 0 | 1619 | 237 MSa/s | 1708 | 224 MSa/s | 5 | 1 | 11779 | 2414 |
fir_decimate_sym | cint32 | int32 | 8 | 1 | 2 | 384 | 1 | 0 | 0 | 372 | 1032 MSa/s | 776 | 494 MSa/s | 5 | 1 | 11180 | 1814 |
fir_decimate_sym | cint32 | int32 | 96 | 1 | 2 | 384 | 1 | 0 | 0 | 2635 | 145 MSa/s | 2742 | 140 MSa/s | 5 | 1 | 13451 | 3546 |
fir_decimate_sym | int32 | int32 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 547 | 702 MSa/s | 626 | 613 MSa/s | 5 | 1 | 6304 | 2090 |
fir_decimate_sym | int32 | int32 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 739 | 519 MSa/s | 819 | 468 MSa/s | 5 | 1 | 6404 | 2264 |
fir_decimate_sym | int32 | int32 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 147 | 2612 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6095 | 2404 |
fir_decimate_sym | int32 | int32 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 1135 | 338 MSa/s | 1225 | 313 MSa/s | 5 | 1 | 7736 | 3064 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6043 | 2176 |
fir_decimate_sym | cint32 | cint16 | 8 | 1 | 2 | 384 | 1 | 0 | 0 | 372 | 1032 MSa/s | 776 | 494 MSa/s | 5 | 1 | 11180 | 1814 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6043 | 2176 |
fir_decimate_sym | cint16 | int16 | 237 | 1 | 3 | 384 | 1 | 0 | 1 | 1467 | 261 MSa/s | 1600 | 240 MSa/s | 8 | 1 | 9224 | 3970 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6043 | 2176 |
fir_decimate_sym | cint16 | cint16 | 21 | 1 | 3 | 384 | 1 | 0 | 0 | 548 | 700 MSa/s | 627 | 612 MSa/s | 5 | 1 | 6302 | 2102 |
fir_decimate_sym | cint16 | cint16 | 30 | 1 | 3 | 384 | 1 | 0 | 0 | 739 | 519 MSa/s | 819 | 468 MSa/s | 5 | 1 | 6402 | 2276 |
fir_decimate_sym | cint16 | cint16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 147 | 2612 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6094 | 2408 |
fir_decimate_sym | cint16 | cint16 | 99 | 1 | 3 | 384 | 1 | 0 | 0 | 1135 | 338 MSa/s | 1225 | 313 MSa/s | 5 | 1 | 7734 | 3080 |
fir_decimate_sym | cint16 | int16 | 100 | 1 | 2 | 256 | 1 | 0 | 0 | 1383 | 185 MSa/s | 1472 | 173 MSa/s | 5 | 1 | 6226 | 2794 |
fir_decimate_sym | cint16 | int16 | 12 | 1 | 3 | 384 | 1 | 0 | 0 | 118 | 3254 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6049 | 2248 |
fir_decimate_sym | cint16 | int16 | 12 | 1 | 2 | 256 | 1 | 0 | 0 | 102 | 2509 MSa/s | 260 | 984 MSa/s | 5 | 1 | 5025 | 2076 |
fir_decimate_sym | cint16 | int16 | 12 | 1 | 3 | 384 | 1 | 0 | 0 | 118 | 3254 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6049 | 2248 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6043 | 2176 |
fir_decimate_sym | cint16 | int16 | 15 | 1 | 3 | 384 | 1 | 0 | 0 | 258 | 1488 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6055 | 1894 |
fir_decimate_sym | cint16 | int16 | 15 | 1 | 3 | 384 | 1 | 0 | 1 | 284 | 1352 MSa/s | 393 | 977 MSa/s | 8 | 1 | 6091 | 2220 |
fir_decimate_sym | cint16 | int16 | 16 | 1 | 2 | 384 | 1 | 0 | 0 | 134 | 2865 MSa/s | 396 | 969 MSa/s | 5 | 1 | 6699 | 2086 |
fir_decimate_sym | cint16 | int16 | 18 | 1 | 3 | 384 | 1 | 0 | 0 | 259 | 1482 MSa/s | 390 | 984 MSa/s | 5 | 1 | 6158 | 1982 |
fir_decimate_sym | cint16 | int16 | 18 | 1 | 3 | 384 | 1 | 0 | 1 | 286 | 1342 MSa/s | 393 | 977 MSa/s | 8 | 1 | 6170 | 2324 |
fir_decimate_sym | cint16 | int16 | 12 | 1 | 3 | 384 | 1 | 0 | 1 | 118 | 3254 MSa/s | 390 | 984 MSa/s | 8 | 1 | 6077 | 2620 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6043 | 2176 |
fir_decimate_sym | cint16 | int16 | 237 | 1 | 3 | 384 | 1 | 0 | 0 | 1323 | 290 MSa/s | 1427 | 269 MSa/s | 5 | 1 | 8996 | 3580 |
fir_decimate_sym | cint16 | int16 | 9 | 1 | 3 | 384 | 1 | 0 | 0 | 117 | 3282 MSa/s | 389 | 987 MSa/s | 5 | 1 | 6043 | 2160 |
fir_decimate_sym | cint16 | int16 | 8 | 1 | 2 | 256 | 1 | 0 | 0 | 69 | 3710 MSa/s | 260 | 984 MSa/s | 5 | 1 | 4952 | 1908 |
fir_decimate_sym | cint16 | int16 | 60 | 1 | 3 | 384 | 1 | 0 | 0 | 484 | 793 MSa/s | 567 | 677 MSa/s | 5 | 1 | 6658 | 2708 |
fir_decimate_sym | cint16 | int16 | 60 | 1 | 2 | 384 | 1 | 0 | 0 | 1377 | 278 MSa/s | 1460 | 263 MSa/s | 5 | 1 | 7170 | 2696 |
fir_decimate_sym | cint16 | int16 | 60 | 1 | 2 | 384 | 1 | 1 | 0 | 662 | 580 MSa/s | 792 | 484 MSa/s | 7 | 1 | 10754 | 3004 |
fir_decimate_sym | cint16 | int16 | 27 | 1 | 3 | 384 | 1 | 0 | 0 | 358 | 1072 MSa/s | 438 | 876 MSa/s | 5 | 1 | 6240 | 2182 |
fir_decimate_sym | cint16 | int16 | 26 | 1 | 2 | 256 | 1 | 0 | 0 | 356 | 719 MSa/s | 434 | 589 MSa/s | 5 | 1 | 5214 | 2158 |
fir_decimate_sym | cint16 | int16 | 24 | 1 | 2 | 256 | 1 | 0 | 0 | 259 | 988 MSa/s | 336 | 761 MSa/s | 5 | 1 | 5146 | 2004 |
fir_decimate_sym | cint16 | int16 | 240 | 1 | 2 | 256 | 1 | 0 | 0 | 1296 | 197 MSa/s | 1401 | 182 MSa/s | 5 | 1 | 7978 | 3314 |
fir_decimate_sym | cint16 | int16 | 237 | 1 | 3 | 768 | 1 | 0 | 0 | 2603 | 295 MSa/s | 2707 | 283 MSa/s | 5 | 1 | 13092 | 3580 |
fir_decimate_sym | cint16 | int16 | 27 | 1 | 3 | 384 | 1 | 0 | 1 | 394 | 974 MSa/s | 500 | 768 MSa/s | 8 | 1 | 6268 | 2476 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1962 |
fir_interpolate_asym | cint32 | cint16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 554 | 462 MSa/s | 1028 | 249 MSa/s | 5 | 1 | 14300 | 2428 |
fir_interpolate_asym | cint32 | cint16 | 64 | 2 | 1 | 256 | 1 | 0 | 0 | 4570 | 56 MSa/s | 4656 | 54 MSa/s | 5 | 1 | 16092 | 4314 |
fir_interpolate_asym | cint32 | cint16 | 32 | 2 | 1 | 256 | 1 | 0 | 0 | 2362 | 108 MSa/s | 2443 | 104 MSa/s | 5 | 1 | 15068 | 2972 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1962 |
fir_interpolate_asym | cint32 | cint32 | 32 | 2 | 1 | 256 | 1 | 0 | 0 | 5041 | 50 MSa/s | 5122 | 49 MSa/s | 5 | 1 | 15836 | 3580 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1946 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1946 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1930 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1930 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 64 | 1 | 0 | 0 | 71 | 901 MSa/s | 144 | 444 MSa/s | 5 | 1 | 3484 | 1930 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1962 |
fir_interpolate_asym | cint32 | cint32 | 64 | 2 | 1 | 256 | 1 | 0 | 0 | 9426 | 27 MSa/s | 9510 | 26 MSa/s | 5 | 1 | 17628 | 4104 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 512 | 1 | 0 | 0 | 295 | 1735 MSa/s | 1023 | 500 MSa/s | 5 | 1 | 14236 | 1946 |
fir_interpolate_asym | cint32 | int16 | 32 | 2 | 1 | 256 | 1 | 0 | 0 | 1644 | 155 MSa/s | 1726 | 148 MSa/s | 5 | 1 | 14684 | 2178 |
fir_interpolate_asym | cint32 | int16 | 64 | 2 | 1 | 256 | 1 | 0 | 0 | 2681 | 95 MSa/s | 2765 | 92 MSa/s | 5 | 1 | 15324 | 3112 |
fir_interpolate_asym | cint32 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 296 | 864 MSa/s | 1023 | 250 MSa/s | 5 | 1 | 14204 | 1850 |
fir_interpolate_asym | cint32 | int32 | 32 | 2 | 1 | 256 | 1 | 0 | 0 | 2362 | 108 MSa/s | 2443 | 104 MSa/s | 5 | 1 | 15068 | 2972 |
fir_interpolate_asym | cint32 | int32 | 64 | 2 | 1 | 256 | 1 | 0 | 0 | 4570 | 56 MSa/s | 4656 | 54 MSa/s | 5 | 1 | 16092 | 4314 |
fir_interpolate_asym | cint32 | int32 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 554 | 462 MSa/s | 1028 | 249 MSa/s | 5 | 1 | 14300 | 2428 |
fir_interpolate_asym | int32 | int16 | 32 | 2 | 1 | 256 | 1 | 0 | 0 | 748 | 342 MSa/s | 826 | 309 MSa/s | 5 | 1 | 8540 | 2130 |
fir_interpolate_asym | int32 | int16 | 64 | 2 | 1 | 256 | 1 | 0 | 0 | 1209 | 211 MSa/s | 1287 | 198 MSa/s | 5 | 1 | 9180 | 3118 |
fir_interpolate_asym | int32 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1930 |
fir_interpolate_asym | int32 | int32 | 32 | 2 | 1 | 256 | 1 | 0 | 0 | 1144 | 223 MSa/s | 1221 | 209 MSa/s | 5 | 1 | 9052 | 2884 |
fir_interpolate_asym | int32 | int32 | 64 | 2 | 1 | 256 | 1 | 0 | 0 | 2202 | 116 MSa/s | 2280 | 112 MSa/s | 5 | 1 | 10204 | 4210 |
fir_interpolate_asym | int32 | int32 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 297 | 861 MSa/s | 519 | 493 MSa/s | 5 | 1 | 8220 | 2536 |
fir_interpolate_asym | cint32 | cint32 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 1724 | 148 MSa/s | 1801 | 142 MSa/s | 5 | 1 | 14492 | 2844 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 1 | 167 | 1532 MSa/s | 520 | 492 MSa/s | 8 | 1 | 8128 | 2950 |
fir_interpolate_asym | cint16 | int16 | 64 | 2 | 1 | 256 | 1 | 0 | 0 | 1209 | 211 MSa/s | 1287 | 198 MSa/s | 5 | 1 | 9180 | 3118 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 128 | 1 | 0 | 0 | 103 | 1242 MSa/s | 263 | 486 MSa/s | 5 | 1 | 5020 | 1930 |
fir_interpolate_asym | cint16 | cint16 | 32 | 2 | 1 | 256 | 1 | 0 | 0 | 1144 | 223 MSa/s | 1221 | 209 MSa/s | 5 | 1 | 9052 | 2884 |
fir_interpolate_asym | cint16 | cint16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 297 | 861 MSa/s | 519 | 493 MSa/s | 5 | 1 | 8220 | 2536 |
fir_interpolate_asym | cint16 | int16 | 128 | 2 | 1 | 256 | 1 | 0 | 0 | 2331 | 109 MSa/s | 2415 | 106 MSa/s | 5 | 1 | 10460 | 4492 |
fir_interpolate_asym | cint16 | int16 | 128 | 2 | 1 | 256 | 1 | 0 | 1 | 2541 | 100 MSa/s | 2651 | 96 MSa/s | 8 | 1 | 10720 | 5330 |
fir_interpolate_asym | cint16 | int16 | 128 | 2 | 1 | 256 | 2 | 0 | 0 | 1298 1267 | 197 202 MSa/s | 1386 1385 | 184 184 MSa/s | 7 | 2 | 16680 | 3176 3114 |
fir_interpolate_asym | cint16 | int16 | 128 | 2 | 1 | 256 | 3 | 0 | 0 | 992 1008 1034 | 258 253 247 MSa/s | 1174 1159 1161 | 218 220 220 MSa/s | 9 | 3 | 24132 | 2858 3288 3398 |
fir_interpolate_asym | cint16 | int16 | 128 | 2 | 1 | 256 | 4 | 0 | 0 | 782 741 749 765 | 327 345 341 334 MSa/s | 912 908 909 910 | 280 281 281 281 MSa/s | 11 | 4 | 32528 | 2192 2036 2036 2122 |
fir_interpolate_asym | cint16 | int16 | 128 | 2 | 1 | 256 | 5 | 0 | 0 | 753 764 740 742 749 | 339 335 345 345 341 MSa/s | 894 917 891 892 893 | 286 279 287 286 286 MSa/s | 13 | 5 | 42476 | 3160 3008 3008 3060 2034 |
fir_interpolate_asym | cint16 | int16 | 16 | 2 | 1 | 256 | 1 | 0 | 0 | 296 | 864 MSa/s | 519 | 493 MSa/s | 5 | 1 | 8220 | 2412 |
fir_interpolate_asym | cint16 | int16 | 16 | 2 | 1 | 256 | 1 | 0 | 1 | 296 | 864 MSa/s | 530 | 483 MSa/s | 8 | 1 | 8256 | 4550 |
fir_interpolate_asym | cint16 | int16 | 240 | 2 | 1 | 256 | 1 | 0 | 0 | 4524 | 56 MSa/s | 4619 | 55 MSa/s | 5 | 1 | 12700 | 4750 |
fir_interpolate_asym | cint16 | int16 | 240 | 2 | 1 | 256 | 1 | 0 | 1 | 4925 | 51 MSa/s | 5043 | 50 MSa/s | 8 | 1 | 13184 | 5624 |
fir_interpolate_asym | cint16 | cint16 | 64 | 2 | 1 | 256 | 1 | 0 | 0 | 2202 | 116 MSa/s | 2280 | 112 MSa/s | 5 | 1 | 10204 | 4210 |
fir_interpolate_asym | cint16 | int16 | 24 | 3 | 1 | 256 | 1 | 0 | 0 | 592 | 432 MSa/s | 774 | 330 MSa/s | 5 | 1 | 10680 | 3546 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 1024 | 1 | 0 | 0 | 551 | 1858 MSa/s | 2037 | 502 MSa/s | 5 | 1 | 26524 | 1946 |
fir_interpolate_asym | cint16 | int16 | 64 | 2 | 1 | 256 | 1 | 0 | 1 | 1311 | 195 MSa/s | 1418 | 180 MSa/s | 8 | 1 | 9312 | 3714 |
fir_interpolate_asym | cint16 | int16 | 8 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8092 | 1930 |
fir_interpolate_asym | cint16 | int16 | 24 | 2 | 1 | 256 | 1 | 0 | 0 | 620 | 412 MSa/s | 697 | 367 MSa/s | 5 | 1 | 8444 | 1988 |
fir_interpolate_asym | cint16 | int16 | 32 | 2 | 1 | 256 | 1 | 0 | 0 | 748 | 342 MSa/s | 826 | 309 MSa/s | 5 | 1 | 8540 | 2130 |
fir_interpolate_asym | cint16 | int16 | 32 | 2 | 1 | 256 | 1 | 0 | 1 | 806 | 317 MSa/s | 910 | 281 MSa/s | 8 | 1 | 8608 | 3000 |
fir_interpolate_asym | cint16 | int16 | 30 | 3 | 1 | 256 | 1 | 0 | 0 | 855 | 299 MSa/s | 933 | 274 MSa/s | 5 | 1 | 10876 | 3826 |
fir_interpolate_asym | cint16 | int16 | 30 | 2 | 1 | 256 | 1 | 0 | 0 | 812 | 315 MSa/s | 888 | 288 MSa/s | 5 | 1 | 8532 | 3244 |
fir_interpolate_asym | cint16 | int16 | 24 | 8 | 1 | 256 | 1 | 0 | 0 | 1408 | 181 MSa/s | 2045 | 125 MSa/s | 5 | 1 | 20900 | 3212 |
fir_interpolate_asym | cint16 | int16 | 24 | 6 | 1 | 256 | 1 | 0 | 0 | 1098 | 233 MSa/s | 1537 | 166 MSa/s | 5 | 1 | 16812 | 3898 |
fir_interpolate_asym | cint16 | int16 | 24 | 4 | 1 | 256 | 1 | 0 | 0 | 742 | 345 MSa/s | 1030 | 248 MSa/s | 5 | 1 | 12500 | 2010 |
fir_interpolate_asym | cint16 | int16 | 30 | 5 | 1 | 256 | 1 | 0 | 0 | 918 | 278 MSa/s | 1282 | 199 MSa/s | 5 | 1 | 15052 | 3826 |
fir_interpolate_fract_asym | cint16 | int16 | 60 | 5 | 2 | 256 | 1 | 0 | 0 | 640 | 400 MSa/s | 722 | 354 MSa/s | 5 | 1 | 10196 | 3674 |
fir_interpolate_fract_asym | cint16 | int16 | 60 | 5 | 2 | 240 | 1 | 0 | 0 | 603 | 398 MSa/s | 685 | 350 MSa/s | 5 | 1 | 9748 | 3674 |
fir_interpolate_fract_asym | cint16 | int16 | 48 | 4 | 3 | 144 | 1 | 0 | 0 | 266 | 541 MSa/s | 351 | 410 MSa/s | 5 | 1 | 5364 | 2016 |
fir_interpolate_fract_asym | cint16 | int16 | 41 | 5 | 4 | 256 | 1 | 0 | 0 | 339 | 755 MSa/s | 422 | 606 MSa/s | 5 | 1 | 7350 | 3842 |
fir_interpolate_fract_asym | cint16 | int16 | 36 | 9 | 5 | 360 | 1 | 0 | 0 | 441 | 816 MSa/s | 642 | 560 MSa/s | 5 | 1 | 11492 | 5170 |
fir_interpolate_fract_asym | cint16 | int16 | 40 | 10 | 3 | 120 | 1 | 0 | 0 | 264 | 454 MSa/s | 403 | 297 MSa/s | 5 | 1 | 7388 | 4624 |
fir_interpolate_fract_asym | cint16 | int16 | 36 | 9 | 8 | 576 | 1 | 0 | 0 | 479 | 1202 MSa/s | 647 | 890 MSa/s | 5 | 1 | 13220 | 4596 |
fir_interpolate_fract_asym | cint16 | int16 | 36 | 9 | 7 | 504 | 1 | 0 | 0 | 475 | 1061 MSa/s | 647 | 778 MSa/s | 5 | 1 | 12644 | 5216 |
fir_interpolate_fract_asym | cint16 | int16 | 60 | 5 | 3 | 240 | 1 | 0 | 0 | 480 | 500 MSa/s | 562 | 427 MSa/s | 5 | 1 | 8148 | 3754 |
fir_interpolate_fract_asym | cint16 | int16 | 40 | 10 | 7 | 280 | 1 | 0 | 0 | 294 | 952 MSa/s | 407 | 687 MSa/s | 5 | 1 | 8668 | 4702 |
fir_interpolate_fract_asym | cint16 | int16 | 60 | 5 | 4 | 480 | 1 | 0 | 0 | 704 | 681 MSa/s | 786 | 610 MSa/s | 5 | 1 | 11668 | 3720 |
fir_interpolate_fract_asym | cint16 | int16 | 84 | 7 | 6 | 336 | 1 | 0 | 0 | 482 | 697 MSa/s | 565 | 594 MSa/s | 5 | 1 | 9604 | 4266 |
fir_interpolate_fract_asym | cint16 | int16 | 75 | 5 | 4 | 160 | 3 | 0 | 1 | 501 478 429 | 319 334 372 MSa/s | 538 546 530 | 297 293 301 MSa/s | 12 | 3 | 15746 | 4280 3868 3768 |
fir_interpolate_fract_asym | cint16 | int16 | 84 | 7 | 2 | 336 | 1 | 0 | 0 | 1142 | 294 MSa/s | 1225 | 274 MSa/s | 5 | 1 | 15876 | 4262 |
fir_interpolate_fract_asym | cint16 | int16 | 84 | 7 | 3 | 168 | 1 | 0 | 0 | 452 | 371 MSa/s | 534 | 314 MSa/s | 5 | 1 | 8260 | 4138 |
fir_interpolate_fract_asym | cint16 | int16 | 84 | 7 | 4 | 672 | 1 | 0 | 0 | 1350 | 497 MSa/s | 1432 | 469 MSa/s | 5 | 1 | 18564 | 4194 |
fir_interpolate_fract_asym | cint16 | int16 | 84 | 7 | 5 | 840 | 1 | 0 | 0 | 1373 | 611 MSa/s | 1456 | 576 MSa/s | 5 | 1 | 19908 | 4234 |
fir_interpolate_fract_asym | cint16 | int16 | 95 | 8 | 5 | 320 | 1 | 0 | 0 | 576 | 555 MSa/s | 658 | 486 MSa/s | 5 | 1 | 10708 | 4010 |
fir_interpolate_fract_asym | cint16 | int16 | 96 | 8 | 3 | 192 | 1 | 0 | 0 | 487 | 394 MSa/s | 569 | 337 MSa/s | 5 | 1 | 9700 | 3374 |
fir_interpolate_fract_asym | cint16 | int16 | 96 | 8 | 5 | 480 | 1 | 0 | 0 | 840 | 571 MSa/s | 922 | 520 MSa/s | 5 | 1 | 14052 | 3470 |
fir_interpolate_fract_asym | cint16 | int16 | 96 | 8 | 7 | 672 | 1 | 0 | 0 | 906 | 741 MSa/s | 987 | 680 MSa/s | 5 | 1 | 15588 | 3416 |
fir_interpolate_fract_asym | cint32 | cint32 | 60 | 5 | 2 | 256 | 1 | 0 | 0 | 5361 | 47 MSa/s | 5445 | 47 MSa/s | 5 | 1 | 21660 | 4986 |
fir_interpolate_fract_asym | cint16 | int16 | 36 | 9 | 4 | 288 | 1 | 0 | 0 | 444 | 648 MSa/s | 643 | 447 MSa/s | 5 | 1 | 10916 | 4484 |
fir_interpolate_fract_asym | cint16 | int16 | 72 | 6 | 5 | 360 | 1 | 0 | 0 | 517 | 696 MSa/s | 599 | 601 MSa/s | 5 | 1 | 9644 | 3248 |
fir_interpolate_fract_asym | cint16 | int16 | 36 | 9 | 2 | 144 | 1 | 0 | 0 | 395 | 364 MSa/s | 637 | 226 MSa/s | 5 | 1 | 9764 | 4418 |
fir_interpolate_fract_asym | cint16 | int16 | 40 | 10 | 9 | 360 | 1 | 0 | 0 | 305 | 1180 MSa/s | 408 | 882 MSa/s | 5 | 1 | 9308 | 4822 |
fir_interpolate_fract_asym | cint16 | int16 | 32 | 8 | 7 | 224 | 1 | 0 | 0 | 195 | 1148 MSa/s | 282 | 794 MSa/s | 5 | 1 | 6436 | 1812 |
fir_interpolate_fract_asym | cint16 | int16 | 36 | 3 | 2 | 144 | 1 | 0 | 0 | 292 | 493 MSa/s | 374 | 385 MSa/s | 5 | 1 | 5412 | 4700 |
fir_interpolate_fract_asym | cint16 | int16 | 108 | 9 | 2 | 432 | 1 | 0 | 0 | 1775 | 243 MSa/s | 1944 | 222 MSa/s | 5 | 1 | 25204 | 5930 |
fir_interpolate_fract_asym | cint16 | int16 | 108 | 9 | 4 | 864 | 1 | 0 | 0 | 1961 | 440 MSa/s | 2044 | 422 MSa/s | 5 | 1 | 28660 | 6074 |
fir_interpolate_fract_asym | cint16 | int16 | 108 | 9 | 5 | 1080 | 1 | 0 | 0 | 2094 | 515 MSa/s | 2177 | 496 MSa/s | 5 | 1 | 30388 | 5874 |
fir_interpolate_fract_asym | cint16 | int16 | 108 | 9 | 8 | 1728 | 1 | 0 | 0 | 2256 | 765 MSa/s | 2338 | 739 MSa/s | 5 | 1 | 35572 | 6022 |
fir_interpolate_fract_asym | cint16 | int16 | 120 | 10 | 3 | 240 | 1 | 0 | 0 | 781 | 307 MSa/s | 863 | 278 MSa/s | 5 | 1 | 14172 | 4778 |
fir_interpolate_fract_asym | cint16 | int16 | 120 | 10 | 7 | 840 | 1 | 0 | 0 | 1449 | 579 MSa/s | 1531 | 548 MSa/s | 5 | 1 | 22172 | 4802 |
fir_interpolate_fract_asym | cint16 | int16 | 120 | 10 | 9 | 720 | 1 | 0 | 0 | 992 | 725 MSa/s | 1075 | 669 MSa/s | 5 | 1 | 18012 | 4754 |
fir_interpolate_fract_asym | cint16 | int16 | 12 | 3 | 2 | 48 | 1 | 0 | 0 | 80 | 600 MSa/s | 158 | 303 MSa/s | 5 | 1 | 3028 | 1768 |
fir_interpolate_fract_asym | cint16 | int16 | 16 | 4 | 3 | 48 | 1 | 0 | 0 | 75 | 640 MSa/s | 153 | 313 MSa/s | 5 | 1 | 2996 | 1724 |
fir_interpolate_fract_asym | cint16 | int16 | 20 | 5 | 2 | 80 | 1 | 0 | 0 | 144 | 555 MSa/s | 224 | 357 MSa/s | 5 | 1 | 4548 | 2022 |
fir_interpolate_fract_asym | cint16 | int16 | 20 | 5 | 3 | 120 | 1 | 0 | 0 | 161 | 745 MSa/s | 243 | 493 MSa/s | 5 | 1 | 4868 | 2082 |
fir_interpolate_fract_asym | cint16 | int16 | 108 | 9 | 7 | 1512 | 1 | 0 | 0 | 2311 | 654 MSa/s | 2394 | 631 MSa/s | 5 | 1 | 33844 | 6008 |
fir_interpolate_fract_asym | cint16 | int16 | 20 | 5 | 4 | 256 | 1 | 0 | 0 | 237 | 1080 MSa/s | 341 | 750 MSa/s | 5 | 1 | 6878 | 2046 |
fir_interpolate_fract_asym | cint16 | int16 | 20 | 5 | 4 | 160 | 1 | 0 | 0 | 162 | 987 MSa/s | 247 | 647 MSa/s | 5 | 1 | 5188 | 2046 |
fir_interpolate_fract_asym | cint16 | int16 | 32 | 8 | 3 | 96 | 1 | 0 | 0 | 171 | 561 MSa/s | 266 | 360 MSa/s | 5 | 1 | 5412 | 1762 |
fir_interpolate_fract_asym | cint16 | int16 | 28 | 7 | 5 | 280 | 1 | 0 | 0 | 287 | 975 MSa/s | 400 | 700 MSa/s | 5 | 1 | 7924 | 3548 |
fir_interpolate_fract_asym | cint16 | int16 | 28 | 7 | 4 | 224 | 1 | 0 | 0 | 279 | 802 MSa/s | 399 | 561 MSa/s | 5 | 1 | 7476 | 3490 |
fir_interpolate_fract_asym | cint16 | int16 | 28 | 7 | 6 | 336 | 1 | 0 | 0 | 287 | 1170 MSa/s | 400 | 840 MSa/s | 5 | 1 | 8372 | 3544 |
fir_interpolate_fract_asym | cint16 | int16 | 28 | 7 | 3 | 168 | 1 | 0 | 0 | 263 | 638 MSa/s | 397 | 423 MSa/s | 5 | 1 | 7028 | 3478 |
fir_interpolate_fract_asym | cint16 | int16 | 28 | 7 | 2 | 112 | 1 | 0 | 0 | 259 | 432 MSa/s | 396 | 282 MSa/s | 5 | 1 | 6580 | 3416 |
fir_interpolate_fract_asym | cint16 | int16 | 24 | 6 | 5 | 120 | 1 | 0 | 0 | 125 | 960 MSa/s | 203 | 591 MSa/s | 5 | 1 | 4460 | 2866 |
fir_interpolate_fract_asym | cint16 | int16 | 24 | 3 | 2 | 256 | 1 | 0 | 0 | 428 | 598 MSa/s | 533 | 480 MSa/s | 7 | 1 | 10460 | 2362 |
fir_interpolate_fract_asym | cint16 | int16 | 32 | 8 | 5 | 160 | 1 | 0 | 0 | 187 | 855 MSa/s | 272 | 588 MSa/s | 5 | 1 | 5924 | 1802 |
fir_interpolate_hb | cint32 | cint32 | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 6315 | 40 MSa/s | 6406 | 39 MSa/s | 5 | 1 | 16381 | 3224 |
fir_interpolate_hb | cint32 | cint16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 391 | 654 MSa/s | 1025 | 249 MSa/s | 5 | 1 | 14209 | 2060 |
fir_interpolate_hb | cint32 | cint32 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 423 | 605 MSa/s | 1025 | 249 MSa/s | 5 | 1 | 14301 | 2334 |
fir_interpolate_hb | cint32 | cint32 | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 8252 | 31 MSa/s | 8357 | 30 MSa/s | 5 | 1 | 19453 | 4140 |
fir_interpolate_hb | cint32 | cint32 | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 679 | 377 MSa/s | 1030 | 248 MSa/s | 5 | 1 | 14525 | 2814 |
fir_interpolate_hb | cint32 | cint32 | 11 | 2 | 1 | 256 | 1 | 0 | 0 | 552 | 463 MSa/s | 1028 | 249 MSa/s | 5 | 1 | 14397 | 2570 |
fir_interpolate_hb | cint32 | cint16 | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 2025 | 126 MSa/s | 2116 | 120 MSa/s | 5 | 1 | 15665 | 3150 |
fir_interpolate_hb | cint32 | int16 | 11 | 2 | 1 | 256 | 1 | 0 | 0 | 1143 | 223 MSa/s | 1224 | 209 MSa/s | 5 | 1 | 14267 | 2630 |
fir_interpolate_hb | cint16 | int16 | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 1508 | 169 MSa/s | 1591 | 160 MSa/s | 5 | 1 | 8747 | 2326 |
fir_interpolate_hb | cint32 | cint16 | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 455 | 562 MSa/s | 1027 | 249 MSa/s | 5 | 1 | 14337 | 2296 |
fir_interpolate_hb | cint32 | cint16 | 11 | 2 | 1 | 256 | 1 | 0 | 0 | 392 | 653 MSa/s | 1025 | 249 MSa/s | 5 | 1 | 14289 | 2202 |
fir_interpolate_hb | cint16 | int16 | 99 | 2 | 1 | 256 | 1 | 0 | 1 | 1508 | 169 MSa/s | 1654 | 154 MSa/s | 8 | 1 | 8791 | 2590 |
fir_interpolate_hb | cint16 | int16 | 99 | 2 | 1 | 128 | 5 | 0 | 0 | 448 448 361 349 332 | 285 285 354 366 385 MSa/s | 466 467 468 463 464 | 274 274 273 276 275 MSa/s | 13 | 5 | 22359 | 1880 1842 1806 1806 1834 |
fir_interpolate_hb | int32 | int32 | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 1522 | 168 MSa/s | 1605 | 159 MSa/s | 5 | 1 | 9137 | 2738 |
fir_interpolate_hb | cint16 | int16 | 99 | 2 | 1 | 128 | 3 | 0 | 0 | 459 405 419 | 278 316 305 MSa/s | 543 540 541 | 235 237 236 MSa/s | 9 | 3 | 12993 | 1880 1842 1904 |
fir_interpolate_hb | cint32 | int16 | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 1127 | 227 MSa/s | 1208 | 211 MSa/s | 5 | 1 | 14275 | 2706 |
fir_interpolate_hb | cint16 | int16 | 99 | 2 | 1 | 128 | 2 | 0 | 0 | 515 481 | 248 266 MSa/s | 583 582 | 219 219 MSa/s | 7 | 2 | 9078 | 1948 1944 |
fir_interpolate_hb | cint32 | cint16 | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 6711 | 38 MSa/s | 6816 | 37 MSa/s | 5 | 1 | 17697 | 4098 |
fir_interpolate_hb | cint32 | int16 | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 2416 | 105 MSa/s | 2521 | 101 MSa/s | 5 | 1 | 16835 | 3324 |
fir_interpolate_hb | int32 | int16 | 19 | 2 | 1 | 256 | 1 | 0 | 0 | 232 | 1103 MSa/s | 518 | 494 MSa/s | 5 | 1 | 8139 | 2116 |
fir_interpolate_hb | cint32 | int16 | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 2788 | 91 MSa/s | 2880 | 88 MSa/s | 5 | 1 | 15275 | 2578 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2104 |
fir_interpolate_hb | int32 | int32 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8065 | 2068 |
fir_interpolate_hb | int32 | int32 | 27 | 2 | 1 | 256 | 1 | 0 | 0 | 359 | 713 MSa/s | 520 | 492 MSa/s | 5 | 1 | 8241 | 2682 |
fir_interpolate_hb | int32 | int32 | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 2416 | 105 MSa/s | 2511 | 101 MSa/s | 5 | 1 | 10593 | 3222 |
fir_interpolate_hb | int32 | int32 | 23 | 2 | 1 | 256 | 1 | 0 | 0 | 296 | 864 MSa/s | 519 | 493 MSa/s | 5 | 1 | 8225 | 2574 |
fir_interpolate_hb | int32 | int16 | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 774 | 330 MSa/s | 857 | 298 MSa/s | 5 | 1 | 8747 | 2512 |
fir_interpolate_hb | int32 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 230 | 1113 MSa/s | 518 | 494 MSa/s | 5 | 1 | 8051 | 1882 |
fir_interpolate_hb | int32 | int16 | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 1167 | 219 MSa/s | 1262 | 202 MSa/s | 5 | 1 | 9731 | 3218 |
fir_interpolate_hb | cint32 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 422 | 606 MSa/s | 1026 | 249 MSa/s | 5 | 1 | 14195 | 1914 |
fir_interpolate_hb | int32 | int16 | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 231 | 1108 MSa/s | 518 | 494 MSa/s | 5 | 1 | 8067 | 2010 |
fir_interpolate_hb | float | float | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 859 | 298 MSa/s | 935 | 273 MSa/s | 5 | 1 | 8321 | 2258 |
fir_interpolate_hb | float | float | 19 | 2 | 1 | 256 | 1 | 0 | 0 | 1675 | 152 MSa/s | 1753 | 146 MSa/s | 5 | 1 | 8465 | 2794 |
fir_interpolate_hb | float | float | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 1219 | 210 MSa/s | 1295 | 197 MSa/s | 5 | 1 | 8385 | 2550 |
fir_interpolate_hb | cint32 | int32 | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 2025 | 126 MSa/s | 2116 | 120 MSa/s | 5 | 1 | 15665 | 3150 |
fir_interpolate_hb | cint32 | int32 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 391 | 654 MSa/s | 1025 | 249 MSa/s | 5 | 1 | 14209 | 1966 |
fir_interpolate_hb | cint32 | int32 | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 6711 | 38 MSa/s | 6816 | 37 MSa/s | 5 | 1 | 17697 | 4098 |
fir_interpolate_hb | cint32 | int32 | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 455 | 562 MSa/s | 1027 | 249 MSa/s | 5 | 1 | 14337 | 2296 |
fir_interpolate_hb | cint32 | int32 | 11 | 2 | 1 | 256 | 1 | 0 | 0 | 392 | 653 MSa/s | 1025 | 249 MSa/s | 5 | 1 | 14289 | 2076 |
fir_interpolate_hb | float | float | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 10400 | 24 MSa/s | 10483 | 24 MSa/s | 5 | 1 | 9393 | 3884 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2104 |
fir_interpolate_hb | cint16 | int16 | 99 | 2 | 1 | 128 | 4 | 0 | 0 | 367 349 336 349 | 348 366 380 366 MSa/s | 467 463 464 466 | 274 276 275 274 MSa/s | 11 | 4 | 17420 | 1880 1842 1842 1894 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2088 |
fir_interpolate_hb | cint16 | int16 | 15 | 2 | 1 | 256 | 1 | 0 | 1 | 176 | 1454 MSa/s | 518 | 494 MSa/s | 8 | 1 | 8095 | 2480 |
fir_interpolate_hb | cint16 | int16 | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8067 | 2000 |
fir_interpolate_hb | cint16 | int16 | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 501 | 510 MSa/s | 5 | 1 | 8067 | 2016 |
fir_interpolate_hb | cint16 | int16 | 11 | 2 | 1 | 256 | 1 | 0 | 1 | 176 | 1454 MSa/s | 518 | 494 MSa/s | 8 | 1 | 8079 | 2552 |
fir_interpolate_hb | cint16 | int16 | 11 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8059 | 2104 |
fir_interpolate_hb | cint16 | int16 | 11 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 501 | 510 MSa/s | 5 | 1 | 8059 | 2104 |
fir_interpolate_hb | cint16 | cint16 | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 1516 | 168 MSa/s | 1599 | 160 MSa/s | 5 | 1 | 9137 | 2746 |
fir_interpolate_hb | cint16 | cint16 | 27 | 2 | 1 | 256 | 1 | 0 | 0 | 359 | 713 MSa/s | 520 | 492 MSa/s | 5 | 1 | 8241 | 2702 |
fir_interpolate_hb | cint16 | int16 | 19 | 2 | 1 | 256 | 1 | 0 | 1 | 231 | 1108 MSa/s | 519 | 493 MSa/s | 8 | 1 | 8167 | 2706 |
fir_interpolate_hb | cint16 | cint16 | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 2413 | 106 MSa/s | 2508 | 102 MSa/s | 5 | 1 | 10593 | 3218 |
fir_interpolate_hb | cfloat | float | 99 | 2 | 1 | 256 | 1 | 0 | 0 | 21536 | 11 MSa/s | 21627 | 11 MSa/s | 5 | 1 | 15921 | 4044 |
fir_interpolate_hb | cfloat | float | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 1691 | 151 MSa/s | 1769 | 144 MSa/s | 5 | 1 | 14465 | 2290 |
fir_interpolate_hb | cfloat | float | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 2460 | 104 MSa/s | 2540 | 100 MSa/s | 5 | 1 | 14593 | 2582 |
fir_interpolate_hb | cfloat | cfloat | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 2735 | 93 MSa/s | 2813 | 91 MSa/s | 5 | 1 | 14557 | 2832 |
fir_interpolate_hb | cfloat | cfloat | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 45363 | 5 MSa/s | 45468 | 5 MSa/s | 5 | 1 | 19709 | 8226 |
fir_interpolate_hb | cfloat | cfloat | 15 | 2 | 1 | 256 | 1 | 0 | 0 | 3326 | 76 MSa/s | 3406 | 75 MSa/s | 5 | 1 | 14781 | 3092 |
fir_interpolate_hb | cfloat | cfloat | 11 | 2 | 1 | 256 | 1 | 0 | 0 | 2767 | 92 MSa/s | 2847 | 89 MSa/s | 5 | 1 | 14653 | 2786 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2104 |
fir_interpolate_hb | cint16 | cint16 | 23 | 2 | 1 | 256 | 1 | 0 | 0 | 296 | 864 MSa/s | 519 | 493 MSa/s | 5 | 1 | 8225 | 2582 |
fir_interpolate_hb | cint16 | int16 | 23 | 2 | 1 | 256 | 1 | 0 | 0 | 231 | 1108 MSa/s | 508 | 503 MSa/s | 5 | 1 | 8147 | 2262 |
fir_interpolate_hb | cint16 | cint16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 516 | 496 MSa/s | 5 | 1 | 8065 | 2182 |
fir_interpolate_hb | cint16 | int16 | 239 | 2 | 1 | 256 | 1 | 0 | 0 | 2853 | 89 MSa/s | 2948 | 86 MSa/s | 5 | 1 | 9731 | 2890 |
fir_interpolate_hb | cint16 | int16 | 23 | 2 | 1 | 256 | 1 | 0 | 0 | 167 | 1532 MSa/s | 500 | 512 MSa/s | 5 | 1 | 8151 | 1988 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2104 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2088 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 64 | 1 | 0 | 0 | 74 | 864 MSa/s | 148 | 432 MSa/s | 5 | 1 | 3443 | 2088 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 512 | 1 | 0 | 0 | 312 | 1641 MSa/s | 1024 | 500 MSa/s | 5 | 1 | 14195 | 2088 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 32 | 1 | 0 | 0 | 93 | 344 MSa/s | 168 | 190 MSa/s | 5 | 1 | 2675 | 2230 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2088 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 128 | 1 | 0 | 0 | 108 | 1185 MSa/s | 263 | 486 MSa/s | 5 | 1 | 4979 | 2088 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2088 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 1 | 176 | 1454 MSa/s | 518 | 494 MSa/s | 8 | 1 | 8071 | 2504 |
fir_interpolate_hb | cint16 | int16 | 7 | 2 | 1 | 256 | 1 | 0 | 0 | 176 | 1454 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8051 | 2104 |
fir_interpolate_hb | cint16 | int16 | 47 | 2 | 1 | 256 | 1 | 1 | 0 | 232 | 1103 MSa/s | 514 | 498 MSa/s | 7 | 1 | 10538 | 2338 |
fir_interpolate_hb | cint16 | int16 | 27 | 2 | 1 | 256 | 1 | 0 | 0 | 231 | 1108 MSa/s | 508 | 503 MSa/s | 5 | 1 | 8155 | 2310 |
fir_interpolate_hb | cint16 | int16 | 27 | 2 | 1 | 256 | 1 | 0 | 0 | 231 | 1108 MSa/s | 518 | 494 MSa/s | 5 | 1 | 8155 | 2310 |
fir_interpolate_hb | cint16 | int16 | 27 | 2 | 1 | 256 | 1 | 0 | 1 | 231 | 1108 MSa/s | 520 | 492 MSa/s | 8 | 1 | 8191 | 2918 |
fir_interpolate_hb | cint16 | int16 | 239 | 2 | 1 | 256 | 1 | 0 | 1 | 2853 | 89 MSa/s | 3031 | 84 MSa/s | 8 | 1 | 9855 | 3090 |
fir_interpolate_hb | cint16 | int16 | 43 | 2 | 1 | 256 | 1 | 1 | 0 | 234 | 1094 MSa/s | 514 | 498 MSa/s | 7 | 1 | 10530 | 2334 |
fir_interpolate_hb | cint16 | int16 | 47 | 2 | 1 | 256 | 1 | 0 | 0 | 928 | 275 MSa/s | 1021 | 250 MSa/s | 5 | 1 | 8291 | 1872 |
fir_interpolate_hb | cint16 | int16 | 43 | 2 | 1 | 256 | 1 | 0 | 0 | 547 | 468 MSa/s | 640 | 400 MSa/s | 5 | 1 | 8283 | 1872 |
fir_sr_asym | cint32 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 548 | 467 MSa/s | 630 | 406 MSa/s | 5 | 1 | 10314 | 2180 |
fir_sr_asym | cint16 | int16 | 8 | 1 | 1 | 256 | 1 | 0 | 0 | 165 | 1551 MSa/s | 265 | 966 MSa/s | 5 | 1 | 5978 | 1894 |
fir_sr_asym | cint16 | int16 | 8 | 1 | 1 | 256 | 1 | 0 | 0 | 165 | 1551 MSa/s | 265 | 966 MSa/s | 5 | 1 | 5978 | 1894 |
fir_sr_asym | cint32 | cint16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 1060 | 241 MSa/s | 1143 | 223 MSa/s | 5 | 1 | 10442 | 2738 |
fir_sr_asym | cint32 | cint16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 2203 | 116 MSa/s | 2287 | 111 MSa/s | 5 | 1 | 10954 | 3464 |
fir_sr_asym | cint32 | cint32 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 2271 | 112 MSa/s | 2354 | 108 MSa/s | 5 | 1 | 10700 | 3758 |
fir_sr_asym | cint32 | cint32 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 4389 | 58 MSa/s | 4473 | 57 MSa/s | 5 | 1 | 11468 | 5494 |
fir_sr_asym | cint32 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 1060 | 241 MSa/s | 1145 | 223 MSa/s | 5 | 1 | 10698 | 3040 |
fir_sr_asym | int32 | int32 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 1181 | 216 MSa/s | 1260 | 203 MSa/s | 5 | 1 | 6602 | 3832 |
fir_sr_asym | cint32 | int32 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 2203 | 116 MSa/s | 2287 | 111 MSa/s | 5 | 1 | 10954 | 3464 |
fir_sr_asym | float | float | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 1612 | 158 MSa/s | 1690 | 151 MSa/s | 5 | 1 | 6473 | 3020 |
fir_sr_asym | float | float | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 2958 | 86 MSa/s | 3037 | 84 MSa/s | 5 | 1 | 6857 | 3896 |
fir_sr_asym | int16 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 163 | 1570 MSa/s | 238 | 1075 MSa/s | 5 | 1 | 3986 | 2376 |
fir_sr_asym | int16 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 294 | 870 MSa/s | 370 | 691 MSa/s | 5 | 1 | 4178 | 3292 |
fir_sr_asym | int32 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 292 | 876 MSa/s | 370 | 691 MSa/s | 5 | 1 | 6090 | 2232 |
fir_sr_asym | int32 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 548 | 467 MSa/s | 626 | 408 MSa/s | 5 | 1 | 6346 | 2990 |
fir_sr_asym | int32 | int32 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 550 | 465 MSa/s | 628 | 407 MSa/s | 5 | 1 | 6218 | 3096 |
fir_sr_asym | cint16 | int16 | 8 | 1 | 1 | 256 | 1 | 0 | 1 | 165 | 1551 MSa/s | 285 | 898 MSa/s | 8 | 1 | 6014 | 2514 |
fir_sr_asym | cint32 | int32 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 1060 | 241 MSa/s | 1143 | 223 MSa/s | 5 | 1 | 10442 | 2738 |
fir_sr_asym | cint16 | int16 | 8 | 1 | 1 | 256 | 1 | 0 | 0 | 165 | 1551 MSa/s | 265 | 966 MSa/s | 5 | 1 | 5978 | 1894 |
fir_sr_asym | cint16 | int16 | 8 | 1 | 1 | 64 | 1 | 0 | 0 | 69 | 927 MSa/s | 143 | 447 MSa/s | 5 | 1 | 2906 | 1894 |
fir_sr_asym | cint16 | int16 | 64 | 1 | 1 | 256 | 1 | 0 | 1 | 1129 | 226 MSa/s | 1302 | 196 MSa/s | 8 | 1 | 6990 | 3726 |
fir_sr_asym | cint16 | int16 | 8 | 1 | 1 | 128 | 1 | 0 | 0 | 101 | 1267 MSa/s | 176 | 727 MSa/s | 5 | 1 | 3930 | 1894 |
fir_sr_asym | cfloat | cfloat | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 4185 | 61 MSa/s | 4267 | 59 MSa/s | 5 | 1 | 10969 | 3808 |
fir_sr_asym | cfloat | cfloat | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 8326 | 30 MSa/s | 8410 | 30 MSa/s | 5 | 1 | 11737 | 5206 |
fir_sr_asym | cfloat | float | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 2337 | 109 MSa/s | 2419 | 105 MSa/s | 5 | 1 | 10713 | 2634 |
fir_sr_asym | cfloat | float | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 4492 | 56 MSa/s | 4576 | 55 MSa/s | 5 | 1 | 11225 | 3704 |
fir_sr_asym | cint16 | cint16 | 24 | 1 | 1 | 256 | 1 | 0 | 0 | 900 | 284 MSa/s | 994 | 257 MSa/s | 5 | 1 | 6410 | 2780 |
fir_sr_asym | cint16 | cint16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 1181 | 216 MSa/s | 1260 | 203 MSa/s | 5 | 1 | 6602 | 3832 |
fir_sr_asym | cint16 | int16 | 128 | 1 | 1 | 256 | 1 | 0 | 0 | 2188 | 117 MSa/s | 2284 | 112 MSa/s | 5 | 1 | 7882 | 5144 |
fir_sr_asym | cint16 | int16 | 128 | 1 | 1 | 256 | 1 | 0 | 1 | 2317 | 110 MSa/s | 2436 | 105 MSa/s | 8 | 1 | 8142 | 5462 |
fir_sr_asym | cint16 | cint16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 550 | 465 MSa/s | 628 | 407 MSa/s | 5 | 1 | 6218 | 3096 |
fir_sr_asym | cint16 | int16 | 128 | 1 | 1 | 256 | 3 | 0 | 0 | 831 868 895 | 308 294 286 MSa/s | 1035 1036 1038 | 247 247 246 MSa/s | 9 | 3 | 19838 | 3908 3626 3708 |
fir_sr_asym | cint16 | int16 | 128 | 1 | 1 | 256 | 4 | 0 | 0 | 747 658 683 712 | 342 389 374 359 MSa/s | 890 887 888 889 | 287 288 288 287 MSa/s | 11 | 4 | 27304 | 3144 2962 2962 3032 |
fir_sr_asym | cint16 | int16 | 128 | 1 | 1 | 256 | 5 | 0 | 0 | 597 620 643 672 578 | 428 412 398 380 442 MSa/s | 813 814 815 817 812 | 314 314 314 313 315 MSa/s | 13 | 5 | 36274 | 2952 2722 2722 2722 2828 |
fir_sr_asym | cint16 | int16 | 64 | 1 | 1 | 256 | 1 | 0 | 0 | 1129 | 226 MSa/s | 1213 | 211 MSa/s | 5 | 1 | 6858 | 3570 |
fir_sr_asym | cint16 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 292 | 876 MSa/s | 370 | 691 MSa/s | 5 | 1 | 6090 | 2232 |
fir_sr_asym | cint16 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 1 | 327 | 782 MSa/s | 432 | 592 MSa/s | 8 | 1 | 6126 | 2988 |
fir_sr_asym | cint16 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 548 | 467 MSa/s | 626 | 408 MSa/s | 5 | 1 | 6346 | 2990 |
fir_sr_asym | cint16 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 1 | 548 | 467 MSa/s | 675 | 379 MSa/s | 8 | 1 | 6414 | 3294 |
fir_sr_asym | cint16 | int16 | 128 | 1 | 1 | 256 | 2 | 0 | 0 | 1665 1539 | 153 166 MSa/s | 1762 1760 | 145 145 MSa/s | 7 | 2 | 13268 | 3674 3362 |
fir_sr_sym | cint32 | cint16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 816 | 313 MSa/s | 899 | 284 MSa/s | 5 | 1 | 10342 | 1900 |
fir_sr_sym | cint16 | int16 | 89 | 1 | 1 | 256 | 1 | 0 | 0 | 1054 | 242 MSa/s | 1150 | 222 MSa/s | 5 | 1 | 7064 | 2546 |
fir_sr_sym | cint16 | int16 | 8 | 1 | 1 | 256 | 1 | 0 | 0 | 117 | 2188 MSa/s | 264 | 969 MSa/s | 5 | 1 | 5974 | 1872 |
fir_sr_sym | cint16 | int16 | 8 | 1 | 1 | 256 | 1 | 0 | 0 | 117 | 2188 MSa/s | 264 | 969 MSa/s | 5 | 1 | 5974 | 1872 |
fir_sr_sym | cint16 | int16 | 8 | 1 | 1 | 64 | 1 | 0 | 0 | 59 | 1084 MSa/s | 133 | 481 MSa/s | 5 | 1 | 2902 | 1776 |
fir_sr_sym | cint16 | int16 | 8 | 1 | 1 | 256 | 1 | 0 | 1 | 117 | 2188 MSa/s | 266 | 962 MSa/s | 8 | 1 | 5994 | 2332 |
fir_sr_sym | cint32 | cint16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 1329 | 192 MSa/s | 1417 | 180 MSa/s | 5 | 1 | 10758 | 2088 |
fir_sr_sym | cint16 | int16 | 8 | 1 | 1 | 256 | 1 | 0 | 0 | 117 | 2188 MSa/s | 264 | 969 MSa/s | 5 | 1 | 5974 | 1872 |
fir_sr_sym | cint32 | cint32 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 1573 | 162 MSa/s | 1654 | 154 MSa/s | 5 | 1 | 10502 | 2052 |
fir_sr_sym | int32 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 166 | 1542 MSa/s | 266 | 962 MSa/s | 5 | 1 | 6054 | 2098 |
fir_sr_sym | cint32 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 687 | 372 MSa/s | 769 | 332 MSa/s | 5 | 1 | 10278 | 1756 |
fir_sr_sym | cint32 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 1071 | 239 MSa/s | 1159 | 220 MSa/s | 5 | 1 | 10598 | 1864 |
fir_sr_sym | cint32 | int32 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 816 | 313 MSa/s | 899 | 284 MSa/s | 5 | 1 | 10342 | 1900 |
fir_sr_sym | cint32 | int32 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 1329 | 192 MSa/s | 1417 | 180 MSa/s | 5 | 1 | 10758 | 2088 |
fir_sr_sym | float | float | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 1610 | 159 MSa/s | 1688 | 151 MSa/s | 5 | 1 | 6373 | 2582 |
fir_sr_sym | int16 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 101 | 2534 MSa/s | 176 | 1454 MSa/s | 5 | 1 | 3942 | 2090 |
fir_sr_sym | int16 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 164 | 1560 MSa/s | 241 | 1062 MSa/s | 5 | 1 | 4070 | 2532 |
fir_sr_sym | int16 | int16 | 96 | 1 | 1 | 512 | 1 | 0 | 0 | 942 | 543 MSa/s | 1029 | 497 MSa/s | 5 | 1 | 6694 | 2800 |
fir_sr_sym | cint16 | int16 | 8 | 1 | 1 | 128 | 1 | 0 | 0 | 77 | 1662 MSa/s | 152 | 842 MSa/s | 5 | 1 | 3926 | 1872 |
fir_sr_sym | int32 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 455 | 562 MSa/s | 535 | 478 MSa/s | 5 | 1 | 6246 | 1908 |
fir_sr_sym | cint32 | cint32 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 2599 | 98 MSa/s | 2686 | 95 MSa/s | 5 | 1 | 11078 | 2710 |
fir_sr_sym | cint16 | int16 | 64 | 1 | 1 | 256 | 1 | 0 | 1 | 921 | 277 MSa/s | 1054 | 242 MSa/s | 8 | 1 | 6730 | 2348 |
fir_sr_sym | cint16 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 164 | 1560 MSa/s | 276 | 927 MSa/s | 5 | 1 | 6054 | 1950 |
fir_sr_sym | cint16 | int16 | 63 | 1 | 1 | 256 | 1 | 0 | 0 | 858 | 298 MSa/s | 947 | 270 MSa/s | 5 | 1 | 6660 | 2068 |
fir_sr_sym | int32 | int32 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 292 | 876 MSa/s | 370 | 691 MSa/s | 5 | 1 | 6118 | 2476 |
fir_sr_sym | cint16 | cint16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 292 | 876 MSa/s | 370 | 691 MSa/s | 5 | 1 | 6118 | 2476 |
fir_sr_sym | cint16 | cint16 | 24 | 1 | 1 | 256 | 1 | 0 | 0 | 421 | 608 MSa/s | 515 | 497 MSa/s | 5 | 1 | 6310 | 2964 |
fir_sr_sym | cint16 | cint16 | 24 | 1 | 1 | 256 | 2 | 0 | 0 | 288 295 | 888 867 MSa/s | 424 416 | 603 615 MSa/s | 7 | 2 | 9900 | 2562 2296 |
fir_sr_sym | cint16 | cint16 | 30 | 1 | 1 | 512 | 3 | 0 | 0 | 851 868 891 | 601 589 574 MSa/s | 1103 1110 1118 | 464 461 457 MSa/s | 9 | 3 | 20282 | 1838 1698 1772 |
fir_sr_sym | cint16 | cint16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 653 | 392 MSa/s | 733 | 349 MSa/s | 5 | 1 | 6406 | 2072 |
fir_sr_sym | cint16 | int16 | 128 | 1 | 1 | 256 | 1 | 0 | 0 | 1436 | 178 MSa/s | 1532 | 167 MSa/s | 5 | 1 | 7494 | 2656 |
fir_sr_sym | cint16 | int16 | 128 | 1 | 1 | 256 | 1 | 0 | 1 | 1436 | 178 MSa/s | 1611 | 158 MSa/s | 8 | 1 | 7626 | 2886 |
fir_sr_sym | cint16 | int16 | 128 | 1 | 1 | 256 | 2 | 0 | 0 | 928 917 | 275 279 MSa/s | 1046 1045 | 244 244 MSa/s | 7 | 2 | 12236 | 2416 2238 |
fir_sr_sym | cint16 | int16 | 64 | 1 | 1 | 256 | 1 | 0 | 0 | 921 | 277 MSa/s | 1005 | 254 MSa/s | 5 | 1 | 6662 | 2092 |
fir_sr_sym | cint16 | int16 | 128 | 1 | 1 | 256 | 3 | 0 | 0 | 788 787 797 | 324 325 321 MSa/s | 964 965 966 | 265 265 265 MSa/s | 9 | 3 | 17810 | 1970 1758 1844 |
fir_sr_sym | cint16 | int16 | 128 | 1 | 1 | 256 | 5 | 0 | 0 | 669 659 656 658 660 | 382 388 390 389 387 MSa/s | 840 835 836 837 838 | 304 306 306 305 305 MSa/s | 13 | 5 | 31454 | 1938 1648 1648 1648 1750 |
fir_sr_sym | cint16 | int16 | 129 | 1 | 1 | 256 | 1 | 0 | 0 | 1442 | 177 MSa/s | 1544 | 165 MSa/s | 5 | 1 | 7656 | 2774 |
fir_sr_sym | cint16 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 0 | 166 | 1542 MSa/s | 266 | 962 MSa/s | 5 | 1 | 6054 | 1950 |
fir_sr_sym | cint16 | int16 | 16 | 1 | 1 | 256 | 1 | 0 | 1 | 166 | 1542 MSa/s | 295 | 867 MSa/s | 8 | 1 | 6090 | 2586 |
fir_sr_sym | cint16 | int16 | 199 | 1 | 1 | 256 | 1 | 0 | 0 | 2018 | 126 MSa/s | 2132 | 120 MSa/s | 5 | 1 | 8500 | 3138 |
fir_sr_sym | cint16 | int16 | 240 | 1 | 1 | 256 | 1 | 0 | 0 | 2337 | 109 MSa/s | 2442 | 104 MSa/s | 5 | 1 | 8998 | 3120 |
fir_sr_sym | cint16 | int16 | 240 | 1 | 1 | 256 | 1 | 0 | 1 | 2469 | 103 MSa/s | 2602 | 98 MSa/s | 8 | 1 | 9226 | 3456 |
fir_sr_sym | cint16 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 663 | 386 MSa/s | 742 | 345 MSa/s | 5 | 1 | 6246 | 1804 |
fir_sr_sym | cint16 | int16 | 32 | 1 | 1 | 256 | 1 | 0 | 1 | 699 | 366 MSa/s | 804 | 318 MSa/s | 8 | 1 | 6282 | 2536 |
fir_sr_sym | cint16 | int16 | 128 | 1 | 1 | 256 | 4 | 0 | 0 | 682 659 658 661 | 375 388 389 387 MSa/s | 838 835 836 837 | 305 306 306 305 MSa/s | 11 | 4 | 24216 | 1938 1702 1702 1788 |
fir_sr_sym | int32 | int32 | 32 | 1 | 1 | 256 | 1 | 0 | 0 | 653 | 392 MSa/s | 733 | 349 MSa/s | 5 | 1 | 6406 | 2072 |
FFT¶
Following table gives results for the FFT/IFFT function with a wide variety of supported parameters, which are defined in: L2 FFT configuration parameters.
Library Element | DATA_TYPE | TWIDDLE_TYPE | POINT_SIZE | FFT_NIFFT | CASC_LEN | DYN_PT_SIZE | WINDOW_VSIZE | cycleCountAvg | throughputAvg | initiationInterval | throughpuInitIntAvg | NUM_BANKS | NUM_AIE | DATA_MEMORY | PROGRAM_MEMORY |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
fft_ifft_dit_1ch | cfloat | cfloat | 1024 | 1 | 1 | 1 | 1024 | 2578 | 397 MSa/s | 3083 | 332 MSa/s | 13 | 1 | 55864 | 5116 |
fft_ifft_dit_1ch | cint16 | cint16 | 64 | 1 | 1 | 1 | 64 | 520 | 123 MSa/s | 582 | 109 MSa/s | 8 | 1 | 11832 | 6568 |
fft_ifft_dit_1ch | cint32 | cint16 | 1024 | 1 | 1 | 0 | 1024 | 1515 | 675 MSa/s | 2067 | 495 MSa/s | 9 | 1 | 51640 | 5638 |
fft_ifft_dit_1ch | cint32 | cint16 | 1024 | 1 | 1 | 1 | 1024 | 1019 | 1004 MSa/s | 2068 | 495 MSa/s | 13 | 1 | 51768 | 6620 |
fft_ifft_dit_1ch | cint32 | cint16 | 1024 | 1 | 2 | 1 | 1024 | 508 812 | 2015 1261 MSa/s | 2071 2073 | 494 493 MSa/s | 22 | 2 | 76848 | 2 0 2 0 4926 3 0 3 0 6508 |
fft_ifft_dit_1ch | cint32 | cint16 | 1024 | 1 | 3 | 1 | 1024 | 426 812 368 | 2403 1261 2782 MSa/s | 2076 2077 2075 | 493 493 493 MSa/s | 31 | 3 | 101928 | 1 0 1 0 4942 2 0 2 0 4942 3 0 3 0 6508 |
fft_ifft_dit_1ch | cint32 | cint16 | 1024 | 1 | 4 | 1 | 1024 | 494 426 368 637 | 2072 2403 2782 1607 MSa/s | 2080 2081 2083 2079 | 492 492 491 492 MSa/s | 40 | 4 | 127008 | 1 0 1 0 6540 1 1 1 1 4910 2 1 2 1 4942 3 0 3 0 4942 |
fft_ifft_dit_1ch | cint32 | cint16 | 1024 | 1 | 5 | 1 | 1024 | 426 408 494 482 368 | 2403 2509 2072 2124 2782 MSa/s | 2084 2086 2085 2088 2083 | 491 490 491 490 491 MSa/s | 47 | 5 | 152088 | 1 0 1 0 4942 1 1 1 1 4942 2 2 2 2 4910 3 0 3 0 3902 3 2 3 2 4732 |
fft_ifft_dit_1ch | cint32 | cint16 | 128 | 1 | 1 | 0 | 1024 | 2327 | 55 MSa/s | 2408 | 53 MSa/s | 7 | 1 | 42424 | 4536 |
fft_ifft_dit_1ch | cint32 | cint16 | 128 | 1 | 2 | 0 | 1024 | 492 1838 | 260 69 MSa/s | 2075 2077 | 61 61 MSa/s | 12 | 2 | 67440 | 0 0 0 0 2296 1 0 1 0 3788 |
fft_ifft_dit_1ch | cint32 | cint16 | 128 | 1 | 1 | 0 | 128 | 295 | 433 MSa/s | 350 | 365 MSa/s | 7 | 1 | 13752 | 4270 |
fft_ifft_dit_1ch | cint32 | cint16 | 128 | 1 | 1 | 1 | 128 | 524 | 244 MSa/s | 587 | 218 MSa/s | 7 | 1 | 13880 | 6520 |
fft_ifft_dit_1ch | cint32 | cint16 | 16 | 1 | 1 | 0 | 1024 | 3929 | 4 MSa/s | 4002 | 3 MSa/s | 7 | 1 | 42424 | 2376 |
fft_ifft_dit_1ch | cint32 | cint16 | 16 | 1 | 1 | 0 | 16 | 80 | 200 MSa/s | 134 | 119 MSa/s | 7 | 1 | 10168 | 1900 |
fft_ifft_dit_1ch | cint32 | cint16 | 256 | 1 | 1 | 0 | 1024 | 1782 | 143 MSa/s | 2071 | 123 MSa/s | 7 | 1 | 43448 | 5016 |
fft_ifft_dit_1ch | cint32 | cint16 | 256 | 1 | 2 | 0 | 1024 | 852 938 | 300 272 MSa/s | 2070 2071 | 123 123 MSa/s | 12 | 2 | 68464 | 0 0 0 0 3274 1 0 1 0 3388 |
fft_ifft_dit_1ch | cint32 | cint16 | 256 | 1 | 1 | 0 | 256 | 436 | 587 MSa/s | 524 | 488 MSa/s | 7 | 1 | 18872 | 4742 |
fft_ifft_dit_1ch | cint32 | cint16 | 256 | 1 | 1 | 1 | 256 | 580 | 441 MSa/s | 646 | 396 MSa/s | 7 | 1 | 19000 | 6476 |
fft_ifft_dit_1ch | cint32 | cint16 | 32 | 1 | 1 | 0 | 1024 | 4516 | 7 MSa/s | 4589 | 6 MSa/s | 7 | 1 | 42424 | 2756 |
fft_ifft_dit_1ch | cint32 | cint16 | 32 | 1 | 2 | 0 | 1024 | 1116 3390 | 28 9 MSa/s | 3474 3476 | 9 9 MSa/s | 12 | 2 | 67440 | 0 0 0 0 1920 1 0 1 0 2398 |
fft_ifft_dit_1ch | cint32 | cint16 | 32 | 1 | 1 | 0 | 32 | 147 | 217 MSa/s | 201 | 159 MSa/s | 7 | 1 | 10680 | 2586 |
fft_ifft_dit_1ch | cint32 | cint16 | 32 | 1 | 1 | 1 | 32 | 423 | 75 MSa/s | 485 | 65 MSa/s | 7 | 1 | 10808 | 6504 |
fft_ifft_dit_1ch | cint32 | cint16 | 512 | 1 | 1 | 0 | 512 | 861 | 594 MSa/s | 1039 | 492 MSa/s | 8 | 1 | 29624 | 5118 |
fft_ifft_dit_1ch | cint32 | cint16 | 512 | 1 | 1 | 1 | 512 | 736 | 695 MSa/s | 1059 | 483 MSa/s | 8 | 1 | 29752 | 6580 |
fft_ifft_dit_1ch | cint32 | cint16 | 512 | 1 | 1 | 0 | 1024 | 1739 | 294 MSa/s | 2070 | 247 MSa/s | 8 | 1 | 46008 | 5428 |
fft_ifft_dit_1ch | cint32 | cint16 | 512 | 1 | 2 | 0 | 1024 | 1064 704 | 481 727 MSa/s | 2070 2071 | 247 247 MSa/s | 14 | 2 | 71024 | 0 0 0 0 3824 1 0 1 0 3298 |
fft_ifft_dit_1ch | cint32 | cint16 | 64 | 1 | 1 | 0 | 1024 | 3145 | 20 MSa/s | 3218 | 19 MSa/s | 7 | 1 | 42424 | 3434 |
fft_ifft_dit_1ch | cint32 | cint16 | 64 | 1 | 2 | 0 | 1024 | 892 2239 | 71 28 MSa/s | 2329 2330 | 27 27 MSa/s | 12 | 2 | 67440 | 0 0 0 0 2120 1 0 1 0 2854 |
fft_ifft_dit_1ch | cint16 | cint16 | 64 | 1 | 1 | 0 | 64 | 209 | 306 MSa/s | 271 | 236 MSa/s | 8 | 1 | 11704 | 3308 |
fft_ifft_dit_1ch | cint32 | cint16 | 64 | 1 | 1 | 0 | 64 | 207 | 309 MSa/s | 261 | 245 MSa/s | 7 | 1 | 11704 | 3220 |
fft_ifft_dit_1ch | cint16 | cint16 | 512 | 1 | 1 | 0 | 512 | 852 | 600 MSa/s | 910 | 562 MSa/s | 9 | 1 | 25528 | 5174 |
fft_ifft_dit_1ch | cint16 | cint16 | 32 | 0 | 1 | 1 | 32 | 390 | 82 MSa/s | 452 | 70 MSa/s | 8 | 1 | 11320 | 6524 |
fft_ifft_dit_1ch | cfloat | cfloat | 128 | 1 | 1 | 1 | 128 | 885 | 144 MSa/s | 948 | 135 MSa/s | 7 | 1 | 13880 | 5076 |
fft_ifft_dit_1ch | cfloat | cfloat | 16 | 1 | 1 | 0 | 16 | 142 | 112 MSa/s | 197 | 81 MSa/s | 7 | 1 | 10168 | 2574 |
fft_ifft_dit_1ch | cfloat | cfloat | 256 | 1 | 1 | 0 | 1024 | 5391 | 47 MSa/s | 5472 | 46 MSa/s | 7 | 1 | 43448 | 5218 |
fft_ifft_dit_1ch | cfloat | cfloat | 256 | 1 | 1 | 0 | 256 | 1341 | 190 MSa/s | 1399 | 182 MSa/s | 7 | 1 | 18872 | 4980 |
fft_ifft_dit_1ch | cfloat | cfloat | 256 | 1 | 1 | 1 | 256 | 1147 | 223 MSa/s | 1213 | 211 MSa/s | 7 | 1 | 19000 | 5084 |
fft_ifft_dit_1ch | cfloat | cfloat | 256 | 1 | 2 | 0 | 1024 | 2961 2488 | 86 102 MSa/s | 3068 3069 | 83 83 MSa/s | 12 | 2 | 68464 | 0 0 0 0 3452 1 0 1 0 3502 |
fft_ifft_dit_1ch | cfloat | cfloat | 32 | 1 | 1 | 0 | 1024 | 6740 | 4 MSa/s | 6821 | 4 MSa/s | 7 | 1 | 42424 | 3738 |
fft_ifft_dit_1ch | cfloat | cfloat | 32 | 1 | 1 | 0 | 32 | 227 | 140 MSa/s | 282 | 113 MSa/s | 7 | 1 | 10680 | 3500 |
fft_ifft_dit_1ch | cfloat | cfloat | 32 | 1 | 1 | 1 | 32 | 682 | 46 MSa/s | 744 | 43 MSa/s | 7 | 1 | 10808 | 5076 |
fft_ifft_dit_1ch | cfloat | cfloat | 32 | 1 | 2 | 0 | 1024 | 2566 4343 | 12 7 MSa/s | 4450 4452 | 7 7 MSa/s | 12 | 2 | 67440 | 0 0 0 0 2380 1 0 1 0 2864 |
fft_ifft_dit_1ch | cfloat | cfloat | 512 | 1 | 1 | 0 | 1024 | 5661 | 90 MSa/s | 5742 | 89 MSa/s | 8 | 1 | 47544 | 5876 |
fft_ifft_dit_1ch | cfloat | cfloat | 512 | 1 | 1 | 0 | 512 | 2809 | 182 MSa/s | 2873 | 178 MSa/s | 8 | 1 | 31160 | 5500 |
fft_ifft_dit_1ch | cfloat | cfloat | 512 | 1 | 2 | 0 | 1024 | 2792 2845 | 183 179 MSa/s | 2954 2955 | 173 173 MSa/s | 14 | 2 | 72560 | 2 0 2 0 3476 3 0 3 0 3944 |
fft_ifft_dit_1ch | cfloat | cfloat | 64 | 1 | 1 | 0 | 1024 | 5739 | 11 MSa/s | 5820 | 10 MSa/s | 7 | 1 | 42424 | 4322 |
fft_ifft_dit_1ch | cfloat | cfloat | 64 | 1 | 1 | 0 | 64 | 369 | 173 MSa/s | 424 | 150 MSa/s | 7 | 1 | 11704 | 4052 |
fft_ifft_dit_1ch | cfloat | cfloat | 64 | 1 | 1 | 1 | 64 | 731 | 87 MSa/s | 793 | 80 MSa/s | 7 | 1 | 11832 | 5068 |
fft_ifft_dit_1ch | cfloat | cfloat | 64 | 1 | 2 | 0 | 1024 | 3056 2781 | 20 23 MSa/s | 3165 3166 | 20 20 MSa/s | 12 | 2 | 67440 | 0 0 0 0 2908 1 0 1 0 3008 |
fft_ifft_dit_1ch | cint16 | cint16 | 1024 | 0 | 1 | 0 | 1024 | 1647 | 621 MSa/s | 1718 | 596 MSa/s | 10 | 1 | 43448 | 5742 |
fft_ifft_dit_1ch | cint16 | cint16 | 1024 | 1 | 1 | 0 | 1024 | 1647 | 621 MSa/s | 1718 | 596 MSa/s | 10 | 1 | 43448 | 5742 |
fft_ifft_dit_1ch | cint16 | cint16 | 128 | 0 | 1 | 0 | 128 | 293 | 436 MSa/s | 347 | 368 MSa/s | 8 | 1 | 12728 | 4326 |
fft_ifft_dit_1ch | cint16 | cint16 | 128 | 1 | 1 | 0 | 128 | 293 | 436 MSa/s | 347 | 368 MSa/s | 8 | 1 | 12728 | 4326 |
fft_ifft_dit_1ch | cint16 | cint16 | 16 | 1 | 1 | 0 | 16 | 78 | 205 MSa/s | 132 | 121 MSa/s | 8 | 1 | 10936 | 1944 |
fft_ifft_dit_1ch | cint16 | cint16 | 2048 | 0 | 1 | 0 | 2048 | 3344 | 612 MSa/s | 3425 | 597 MSa/s | 13 | 1 | 79288 | 6054 |
fft_ifft_dit_1ch | cint16 | cint16 | 2048 | 1 | 1 | 0 | 2048 | 3344 | 612 MSa/s | 3425 | 597 MSa/s | 13 | 1 | 79288 | 6054 |
fft_ifft_dit_1ch | cint16 | cint16 | 256 | 0 | 1 | 0 | 256 | 454 | 563 MSa/s | 511 | 500 MSa/s | 8 | 1 | 16824 | 4750 |
fft_ifft_dit_1ch | cint16 | cint16 | 256 | 1 | 1 | 0 | 256 | 454 | 563 MSa/s | 511 | 500 MSa/s | 8 | 1 | 16824 | 4750 |
fft_ifft_dit_1ch | cint16 | cint16 | 32 | 0 | 1 | 0 | 32 | 136 | 235 MSa/s | 198 | 161 MSa/s | 8 | 1 | 11192 | 2562 |
fft_ifft_dit_1ch | cint16 | cint16 | 512 | 0 | 1 | 0 | 512 | 852 | 600 MSa/s | 910 | 562 MSa/s | 9 | 1 | 25528 | 5174 |
fft_ifft_dit_1ch | cint32 | cint16 | 64 | 1 | 1 | 1 | 64 | 519 | 123 MSa/s | 582 | 109 MSa/s | 7 | 1 | 11832 | 6512 |
Matrix Multiply¶
Following table gives results for the matrix multiply function with a wide variety of supported parameters, which are defined in: L2 Matrix Multiply Configuration Parameters.
Note
cycleCountAvg does not include the cycle count information for the additional shuffling/tiling widget kernels, but initiationInterval and PROGRAM_MEMORY do include shuffling/tiling widget kernels.
Library Element | T_DATA_A | T_DATA_B | P_DIM_A | P_DIM_AB | P_DIM_B | P_ADD_TILING_A | P_ADD_TILING_B | P_ADD_DETILING_OUT | P_INPUT_WINDOW_VSIZE_A | P_INPUT_WINDOW_VSIZE_B | P_CASC_LEN | NITER | cycleCountAvg | initiationInterval | NUM_BANKS | NUM_AIE | DATA_MEMORY | PROGRAM_MEMORY |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
matrix_mult | cfloat | cfloat | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 643 680 718 738 | 844 859 875 851 890 867 883 898 906 | 29 | 9 | 49080 | 1830 3246 1830 3492 1830 3492 1358 1830 3748 |
matrix_mult | cint16 | cint16 | 8 | 8 | 8 | 1 | 1 | 1 | 64 | 64 | 1 | 100 | 105 | 204 205 206 | 9 | 3 | 9809 | 1894 2460 1390 |
matrix_mult | cint16 | cint16 | 8 | 8 | 8 | 1 | 1 | 1 | 64 | 64 | 1 | 100 | 105 | 204 205 206 | 9 | 3 | 9809 | 1894 2460 1390 |
matrix_mult | cint16 | int32 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 248 263 275 272 | 363 379 394 371 409 386 402 417 425 | 33 | 9 | 33988 | 1830 2814 1830 2910 1830 2910 1830 1358 3172 |
matrix_mult | cint32 | cint16 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 193 212 215 203 | 305 320 335 312 351 328 343 358 366 | 24 | 9 | 42180 | 1918 1756 1918 1798 1918 1798 1934 1246 2032 |
matrix_mult | cint32 | cint32 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 363 373 382 373 | 477 492 508 485 523 500 516 531 538 | 22 | 9 | 46776 | 1894 1964 1894 2136 1894 2136 1894 1246 2300 |
matrix_mult | cint32 | int32 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 193 212 215 203 | 305 320 335 312 351 328 343 358 366 | 24 | 9 | 42180 | 1918 1756 1918 1798 1918 1798 1934 1246 2032 |
matrix_mult | float | cfloat | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 383 395 405 401 | 498 513 505 544 521 529 551 536 559 | 32 | 9 | 40384 | 2726 2406 2726 2532 2572 2742 1246 2710 2768 |
matrix_mult | int16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 16 | 336 | 439 446 454 | 11 | 3 | 16976 | 2246 2472 1678 |
matrix_mult | int16 | cint32 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 167 172 174 162 | 254 269 285 262 300 277 292 308 316 | 25 | 9 | 34244 | 2318 1846 2302 1854 2318 1854 2302 1246 2072 |
matrix_mult | int16 | int16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 16 | 242 | 348 356 363 | 11 | 3 | 12878 | 1894 2076 1406 |
matrix_mult | int32 | cint16 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 248 263 275 272 | 363 379 394 371 409 386 402 417 425 | 33 | 9 | 33988 | 1830 2814 1830 2910 1830 2910 1830 1358 3172 |
matrix_mult | cint16 | cint16 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 113 131 132 126 | 215 230 246 223 261 238 254 269 277 | 30 | 9 | 33604 | 1830 1912 1830 2060 1830 2060 1830 1246 2202 |
matrix_mult | cint16 | cint16 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 1 | 16 | 287 | 561 569 576 | 8 | 3 | 19537 | 1830 2106 1246 |
matrix_mult | cint16 | cint16 | 8 | 4 | 64 | 1 | 1 | 1 | 32 | 256 | 1 | 16 | 289 | 526 534 542 | 10 | 3 | 19281 | 1294 1922 1478 |
matrix_mult | cint16 | cint16 | 8 | 4 | 512 | 1 | 1 | 1 | 32 | 2048 | 1 | 16 | 2079 | 4025 4033 4040 | 15 | 3 | 105297 | 1310 1922 1494 |
matrix_mult | cfloat | float | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 346 354 364 356 | 457 472 488 465 503 480 496 511 519 | 19 | 9 | 44480 | 1990 2384 1990 2456 1990 2456 1990 1246 2688 |
matrix_mult | cint16 | cint16 | 1024 | 4 | 4 | 1 | 1 | 1 | 4096 | 16 | 1 | 100 | 2462 | 4159 4160 4161 | 17 | 3 | 105297 | 1102 1944 1286 |
matrix_mult | cint16 | cint16 | 1024 | 4 | 4 | 1 | 1 | 1 | 4096 | 16 | 1 | 16 | 2462 | 4468 4476 4484 | 17 | 3 | 105297 | 1102 1944 1286 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 0 | 0 | 0 | 256 | 256 | 1 | 16 | 593 | 678 | 7 | 1 | 8329 | 2766 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 100 | 615 | 691 692 693 | 8 | 3 | 19025 | 2326 2750 1678 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 16 | 614 | 723 731 739 | 8 | 3 | 19025 | 2326 2750 1678 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 100 | 612 | 688 689 690 | 8 | 3 | 19025 | 2326 2750 1678 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 100 | 612 | 688 689 690 | 8 | 3 | 19025 | 2326 2750 1678 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 100 | 615 | 691 692 693 | 8 | 3 | 19025 | 2326 2750 1678 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 100 | 615 | 691 693 694 | 8 | 3 | 19025 | 2326 2750 1678 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 100 | 615 | 691 692 693 | 8 | 3 | 19025 | 2326 2750 1678 |
matrix_mult | cint16 | cint16 | 24 | 4 | 4 | 1 | 1 | 1 | 96 | 16 | 1 | 16 | 87 | 169 177 184 | 11 | 3 | 9297 | 1968 1286 1102 |
matrix_mult | cint16 | cint16 | 32 | 32 | 32 | 0 | 0 | 0 | 1024 | 1024 | 1 | 16 | 4227 | 4360 | 7 | 1 | 26761 | 3878 |
matrix_mult | cint16 | cint16 | 32 | 32 | 32 | 1 | 1 | 1 | 1024 | 1024 | 1 | 100 | 4486 | 4576 4577 4579 | 10 | 3 | 55889 | 2326 3878 1678 |
matrix_mult | cint16 | cint16 | 32 | 32 | 64 | 0 | 0 | 0 | 1024 | 2048 | 1 | 16 | 8357 | 8554 | 7 | 1 | 43145 | 3878 |
matrix_mult | cint16 | cint16 | 32 | 64 | 32 | 0 | 0 | 0 | 2048 | 2048 | 1 | 16 | 8325 | 8522 | 7 | 1 | 43145 | 3878 |
matrix_mult | cint16 | cint16 | 64 | 64 | 64 | 0 | 0 | 0 | 4096 | 4096 | 1 | 16 | 33069 | 33394 | 13 | 1 | 100489 | 3862 |
matrix_mult | cint16 | cint16 | 8 | 4 | 4 | 1 | 1 | 1 | 32 | 16 | 1 | 16 | 46 | 124 132 139 | 11 | 3 | 7761 | 1694 1230 1102 |
matrix_mult | cint16 | cint16 | 8 | 4 | 512 | 1 | 1 | 1 | 32 | 2048 | 1 | 100 | 2079 | 4091 4092 4093 | 15 | 3 | 105297 | 1310 1922 1494 |
matrix_mult | cint16 | cint16 | 16 | 16 | 16 | 1 | 1 | 1 | 256 | 256 | 1 | 100 | 615 | 691 693 694 | 8 | 3 | 19025 | 2326 2750 1678 |
matrix_mult | int32 | int32 | 8 | 64 | 4 | 1 | 1 | 1 | 512 | 256 | 4 | 16 | 113 131 132 126 | 215 230 246 223 261 238 254 269 277 | 30 | 9 | 33604 | 1830 1912 1830 2060 1830 2060 1830 1246 2202 |
Widgets¶
Following table gives results for the widgets with a wide variety of supported parameters, which are defined in: L2 Widgets Configuration Parameters.
Library Element | DATA_TYPE | IN_API | OUT_API | NUM_INPUTS | WINDOW_VSIZE | NUM_OUTPUT_CLONES | cycleCountAvg | throughputAvg | initiationInterval | throughpuInitIntAvg | NUM_BANKS | NUM_AIE | DATA_MEMORY | PROGRAM_MEMORY |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
widget_api_cast | cfloat | 0 | 0 | 1 | 256 | 1 | 88 | 2909 MSa/s | 517 | 495 MSa/s | 5 | 1 | 8980 | 1388 |
widget_api_cast | cfloat | 0 | 1 | 1 | 256 | 2 | 531 | 482 MSa/s | 574 | 445 MSa/s | 3 | 1 | 4884 | 1224 |
widget_api_cast | cfloat | 1 | 0 | 2 | 256 | 3 | 531 | 482 MSa/s | 600 | 426 MSa/s | 7 | 1 | 13076 | 1698 |
widget_api_cast | cint16 | 0 | 0 | 1 | 256 | 1 | 56 | 4571 MSa/s | 263 | 973 MSa/s | 5 | 1 | 4884 | 1388 |
widget_api_cast | cint16 | 0 | 0 | 1 | 256 | 2 | 90 | 2844 MSa/s | 264 | 969 MSa/s | 7 | 1 | 6932 | 1660 |
widget_api_cast | cint16 | 0 | 0 | 1 | 256 | 3 | 124 | 2064 MSa/s | 265 | 966 MSa/s | 9 | 1 | 8980 | 1904 |
widget_api_cast | cint16 | 0 | 1 | 1 | 256 | 1 | 274 | 934 MSa/s | 314 | 815 MSa/s | 3 | 1 | 2836 | 1208 |
widget_api_cast | cint16 | 1 | 0 | 1 | 256 | 1 | 272 | 941 MSa/s | 310 | 825 MSa/s | 3 | 1 | 2836 | 1202 |
widget_api_cast | cint16 | 1 | 0 | 1 | 256 | 2 | 275 | 930 MSa/s | 329 | 778 MSa/s | 5 | 1 | 4884 | 1406 |
widget_api_cast | cint16 | 1 | 0 | 1 | 256 | 3 | 280 | 914 MSa/s | 349 | 733 MSa/s | 7 | 1 | 6932 | 1658 |
widget_api_cast | cint16 | 1 | 0 | 2 | 256 | 1 | 273 | 937 MSa/s | 311 | 823 MSa/s | 3 | 1 | 2836 | 1218 |
widget_api_cast | cint32 | 0 | 1 | 1 | 256 | 2 | 531 | 482 MSa/s | 574 | 445 MSa/s | 3 | 1 | 4884 | 1224 |
widget_api_cast | cint32 | 1 | 0 | 1 | 256 | 4 | 535 | 478 MSa/s | 619 | 413 MSa/s | 9 | 1 | 17172 | 1848 |
widget_api_cast | cint32 | 1 | 0 | 2 | 256 | 4 | 535 | 478 MSa/s | 622 | 411 MSa/s | 9 | 1 | 17172 | 1920 |
widget_api_cast | int32 | 0 | 1 | 1 | 256 | 2 | 275 | 930 MSa/s | 315 | 812 MSa/s | 3 | 1 | 2836 | 1224 |
widget_api_cast | int32 | 1 | 0 | 2 | 256 | 2 | 274 | 934 MSa/s | 329 | 778 MSa/s | 5 | 1 | 4884 | 1390 |