Tushar Kaithakkulam
04/25/2021, 11:56 PM@task(name="process_document")
def process_document(document):
processed_document = process_document_pages.map(document)
return processed_document
with Flow(name="Local Test") as flow:
documents = process_document.map(documents)Kevin Kho
Tushar Kaithakkulam
04/26/2021, 1:34 PMKevin Kho
flatten
and then map
on the flattened list and that will parallelize it for the document pages. You would then need some other task to bring it back into a List of List format. Conceptually a map of map doesn’t provide more parallelism because the first map is already parallelizing across available resources.