I have started using the PostgresExecutor task and after rea Prefect Community #ask-community

I have started using the PostgresExecutor task and...

Chris Hart

04/03/2020, 4:45 PM

I have started using the PostgresExecutor task and after reading the source and trying to understand how it works, I'm not clear on transactions and exactly how lots of separate tasks get rolled into them.. (https://github.com/PrefectHQ/prefect/blob/master/src/prefect/tasks/postgres/postgres.py#L174-L203)

Chris Hart

04/03/2020, 4:46 PM

I noticed that in a mapped insert task I ran, there were quite a few rejected records (on purpose), but all the remaining good ones seemed to be inserted correctly...

Chris Hart

04/03/2020, 4:47 PM

and yet, it seems that

commit=False

is the default

Chris Hart

04/03/2020, 4:49 PM

is anyone aware of deeper dive docs on dask and psycopg?

josh

04/03/2020, 6:42 PM

The dask documentation is very good for a deeper dive into it https://docs.dask.org/en/latest/ As for the psycopg commit behavior I don’t know of any deeper dive resources but this is a nice page I found searching around which starts by explaining transactions https://www.postgresqltutorial.com/postgresql-python/transaction/

🙌 1

Chris Hart

04/03/2020, 8:22 PM

ah ok so the PostgresExecutor task does not support the connection.executemany() function inside the cursor.. this would be really helpful in mapped tasks that are doing data loading of fixed-size batches..

Chris Hart

04/03/2020, 8:23 PM

currently each run() call opens a separate connection/transaction per single query.. it would be possible to make a super giant querystring for the whole chunk, but executemany() makes it much friendlier

Chris Hart

04/03/2020, 8:25 PM

would this be something welcome as a PR? I can pretty easily extend the task class in my code, but might be useful to others... and by extension i wonder what the ambitions/roadmap are for postgres tasks

josh

04/03/2020, 8:26 PM

Absolutely welcome as a PR!

josh

04/03/2020, 8:27 PM

The task library is generally community driven so there are no formal roadmap items (yet) for the tasks in it. The goal is to have a library of easy to use tasks for performing basic functions that can also serve as templates/inspiration for others to use when making their own tasks 🙂

Chris Hart

04/03/2020, 8:33 PM

awesome thanks

2 Views

Open in Slack

Previous Next