|
| | _transform_gen (func, list inputs, output, *params, tile_size=16) |
| |
| | _transform_parallel_gen (func, list inputs, output, *params, tile_size=16) |
| |
| | transform (func, input, output, *params, tile_size=16) |
| |
| | transform_binary (func, first, second, output, *params, tile_size=16) |
| |
| | transform_parallel (func, input, output, *params, tile_size=16) |
| |
| | transform_parallel_binary (func, first, second, output, *params, tile_size=16) |
| |
| iron.algorithms.transform._transform_gen |
( |
|
func, |
|
|
list |
inputs, |
|
|
|
output, |
|
|
* |
params, |
|
|
|
tile_size = 16 |
|
) |
| |
|
protected |
General tiled transform to apply a function on inputs and obtain a single output.
Assumes all input and output shapes are the same.
Args:
func: Function to apply, either a lambda/callable or ExternalFunction.
For ExternalFunction, arg_types should be [*input_tiles, output_tile, *params]
inputs: List of input tensors (will be tiled automatically)
output: Output tensor (will be tiled automatically)
*params: Additional parameters for ExternalFunction only.
Scalar dtypes (np.int32, etc.) are passed as MLIR constants;
array types are transferred via ObjectFifos.
tile_size: Size of each tile processed by a worker (default: 16)
| iron.algorithms.transform._transform_parallel_gen |
( |
|
func, |
|
|
list |
inputs, |
|
|
|
output, |
|
|
* |
params, |
|
|
|
tile_size = 16 |
|
) |
| |
|
protected |
General parallel transform to apply a function on inputs and obtain a single output.
Distributes work across multiple AIE tiles for parallel execution.
Args:
func: Function to apply, either a lambda/callable or ExternalFunction.
For ExternalFunction, arg_types should be [*input_tiles, output_tile, *params]
inputs: List of input tensors (will be tiled automatically)
output: Output tensor (will be tiled automatically)
*params: Additional parameters for ExternalFunction only.
Scalar dtypes (np.int32, etc.) are passed as MLIR constants;
array types are transferred via ObjectFifos.
tile_size: Size of each tile processed by a worker (default: 16)