Prefect Community

def fargate_cluster(n_workers=4):
    """Start a fargate cluster using the same image as the flow run"""
    return FargateCluster(n_workers=n_workers, image=prefect.context.image)

class LeoFlow(PrefectFlow):

    def generate_flow(self):
        with Flow(name=self.name, storage=S3(bucket="raptormaps-prefect-flows")) as flow:
            ...
        flow.executor = DaskExecutor(
            cluster_class=fargate_cluster,
            cluster_kwargs={"n_workers": 4}
        )
        return flow

In the dockerfile for the image that I’m specifying in the ECSRun, I have included the following line to install dask-cloudprovider:

RUN pip install dask-cloudprovider[aws]

However, when I execute the flow, I am hitting the following error:

Unexpected error: AttributeError("module 'aiobotocore' has no attribute 'get_session'",)
Traceback (most recent call last):
  File "/usr/local/lib/python3.6/site-packages/prefect/engine/runner.py", line 48, in inner
    new_state = method(self, state, *args, **kwargs)
  File "/usr/local/lib/python3.6/site-packages/prefect/engine/flow_runner.py", line 442, in get_flow_run_state
    with self.check_for_cancellation(), executor.start():
  File "/usr/local/lib/python3.6/contextlib.py", line 81, in __enter__
    return next(self.gen)
  File "/usr/local/lib/python3.6/site-packages/prefect/executors/dask.py", line 238, in start
    with self.cluster_class(**self.cluster_kwargs) as cluster:
  File "/rprefect/leo_flow.py", line 58, in fargate_cluster
    return FargateCluster(n_workers=n_workers, image=prefect.context.image)
  File "/usr/local/lib/python3.6/site-packages/dask_cloudprovider/aws/ecs.py", line 1361, in __init__
    super().__init__(fargate_scheduler=True, fargate_workers=True, **kwargs)
  File "/usr/local/lib/python3.6/site-packages/dask_cloudprovider/aws/ecs.py", line 726, in __init__
    self.session = aiobotocore.get_session()
AttributeError: module 'aiobotocore' has no attribute 'get_session'

Is there a specific version of dask_cloudprovider that Prefect requires?

f

k

a

3
21
125

<@ULVA73B9P> using prefect 3 how do i set working directory in a prefect deploy call using --job-var...

d

David

9 months ago

@Marvin using prefect 3 how do i set working directory in a prefect deploy call using --job-variable?

d

m

+2

4
13
124

Hello! How do we change the prefect_api_url to our server host ip in selfhosted prefect. I first ran...

d

Deepthi

over 1 year ago

Hello! How do we change the prefect_api_url to our server host ip in selfhosted prefect. I first ran the command prefect config set PREFECT_API_URL=http://MYHOSTIP:4200/api and then prefect server start, once the prefect server starts, I am seeing it pointed to http://127.0.0.1:4200/api

3
9
124

<@ULVA73B9P> how to write tqdm output to prefect logs

a

Alex Litvinov

over 1 year ago

@Marvin how to write tqdm output to prefect logs

a

m

2
5
124

Hi all, I am seeing the following errors in my self-hosted prefect server when under load. How conce...

t

Tim Galvin

over 1 year ago

Hi all, I am seeing the following errors in my self-hosted prefect server when under load. How concerned should I be?

13:15:55.415 | WARNING | prefect.server.services.failexpiredpauses - FailExpiredPauses took 29.592499 seconds to run, which is longer than its loop interval of 5.0 seconds.
13:15:55.444 | WARNING | prefect.server.services.flowrunnotifications - FlowRunNotifications took 29.644721 seconds to run, which is longer than its loop interval of 4 seconds.
13:16:28.024 | WARNING | prefect.server.services.failexpiredpauses - FailExpiredPauses took 6.629446 seconds to run, which is longer than its loop interval of 5.0 seconds.
13:16:28.037 | WARNING | prefect.server.services.marklateruns - MarkLateRuns took 6.655057 seconds to run, which is longer than its loop interval of 5.0 seconds.
13:16:28.039 | WARNING | prefect.server.services.recentdeploymentsscheduler - RecentDeploymentsScheduler took 6.648737 seconds to run, which is longer than its loop interval of 5 seconds.
13:16:42.554 | WARNING | prefect.server.services.recentdeploymentsscheduler - RecentDeploymentsScheduler took 5.717242 seconds to run, which is longer than its loop interval of 5 seconds.
13:16:42.569 | WARNING | prefect.server.services.flowrunnotifications - FlowRunNotifications took 5.730481 seconds to run, which is longer than its loop interval of 4 seconds.
13:16:42.573 | WARNING | prefect.server.services.failexpiredpauses - FailExpiredPauses took 5.735392 seconds to run, which is longer than its loop interval of 5.0 seconds.
13:16:42.613 | WARNING | prefect.server.services.marklateruns - MarkLateRuns took 5.777756 seconds to run, which is longer than its loop interval of 5.0 seconds.
13:17:10.327 | WARNING | prefect.server.services.flowrunnotifications - FlowRunNotifications took 19.093586 seconds to run, which is longer than its loop interval of 4 seconds

t

u

2
5
124

Hey there, I've a general question about best practices for data transformations, but not related to...

a

Andreas Nigg

almost 3 years ago

Hey there, I've a general question about best practices for data transformations, but not related to prefect itself. I how hope, this channel is appropriate. We use prefect to coordinate ingestion of data to our warehouse (bigquery). From there, we use dbt to transform them as we need. One of our data imports is rather huge (let's say 100GB in total to make it easy). We use airbyte to daily ingest additional 1GB. This daily ingest also creates a lot of duplicates (so the 100GB table already contains some of the rows, which are inserted with the daily insert) - this is due to underlying data structure and not much we can do about it. How would you actually go ahead and deduplicate this data? I would like to prevent daily reading 100GB of data, just for deduplication. Any ideas for that? Thanks already in advance 😄 🚀

a

r

a

3
6
124

Hi, I'm running into a few issues while trying to use Prefect 2.0 block storage. I'm receiving a COR...

a

Andrew Richards

almost 3 years ago

Hi, I'm running into a few issues while trying to use Prefect 2.0 block storage. I'm receiving a CORS error in Chrome when I try to delete a block in the UI at app.prefect.cloud. I have no issue creating block storage through the UI or through Python.

Access to XMLHttpRequest at '<https://api.prefect.cloud/api/accounts/><MY ACCOUNT ID>/workspaces/<MY WORKSPACE ID>/block_documents/<MY BLOCK ID>' from origin '<https://app.prefect.cloud>' has been blocked by CORS policy: No 'Access-Control-Allow-Origin' header is present on the requested resource.

It seems like CORS may be unhappy with the UI trying to do a cross-domain request from app.prefect.cloud to api.prefect.cloud?

✅ 1

a

3
7
124

Hi, What is prefect orion and how is it different from prefect core and prefect server. How will pre...

a

AJ

over 3 years ago

Hi, What is prefect orion and how is it different from prefect core and prefect server. How will prefect cloud get affected by this? Can we deploy jobs to prefect cloud from orion like we are able to do from core? Thanks.

a

2
1
124