prefect-community #prefect-community

Hi, how can I fix this problem "Sorry, modifying RRule schedules via the UI is currently unsupported; select a different schedule type above or modify your schedule in code."

✅ 1

John Mizerany

10/11/2022, 2:30 PM

I have had a few flows (using Prefect Cloud 1) that have gotten stuck in the middle of a run. When cancelling the flow they are also getting stuck in a “cancelling” state and have a few still cancelling for 20 minutes. We are running our agent on EC2, but this looks like memory issues for the agent?

✅ 1

Kelvin DeCosta

10/11/2022, 3:54 PM

Hey everyone! I managed to write the necessary Infrastructure as Code (via Pulumi) for a light-weight Prefect agent that runs as an ECS Service. I was hoping to get a nice welcome message in the logs. Unfortunately, I get the

prefect

cli help message over and over again. ECS console shows a list of

STOPPED

tasks, which I'm assuming are various attempts to start the service and keep it running.

✅ 2

Nathaniel Russell

10/11/2022, 4:18 PM

I have a flow running in lambda that keeps giving me this warning:

Copy code

/usr/local/lib/python3.9/site-packages/prefect/logging/handlers.py:76: UserWarning: Failed to create the Prefect home directory at /home/sbx_user1051/.prefect

It then runs the flow code correctly, but after the flow is done it crashes, and gives this error:

Copy code

[in thread]

All of my flows perform their intended code but all end with this error and say crashed. How do I fix this?

✅ 1

Jason Bertman

10/11/2022, 4:40 PM

I have k8s cluster executing some pretty large flows on Orion via RayTaskRunner. I have a remote Ray cluster deployed with the KubeRay operator and an autoscaler running on the head node. It scales in and out properly, but this morning after about 23K task runs I'm seeing this:

Copy code

Unhandled error (suppress with 'RAY_IGNORE_UNHANDLED_ERRORS=1'): ray::begin_task_run() (pid=77, ip=10.80.9.219)
  File "/usr/local/lib/python3.8/site-packages/prefect/engine.py", line 1191, in orchestrate_task_run
    state = await propose_state(
  File "/usr/local/lib/python3.8/site-packages/prefect/engine.py", line 1496, in propose_state
    raise prefect.exceptions.Abort(response.details.reason)
prefect.exceptions.Abort: This run cannot transition to the RUNNING state from the RUNNING state.

During handling of the above exception, another exception occurred:

ray::begin_task_run() (pid=77, ip=10.80.9.219)
  File "/usr/local/lib/python3.8/site-packages/prefect/utilities/asyncutils.py", line 212, in wrapper
    return run_async_in_new_loop(async_fn, *args, **kwargs)
  File "/usr/local/lib/python3.8/site-packages/prefect/utilities/asyncutils.py", line 141, in run_async_in_new_loop
    return anyio.run(partial(__fn, *args, **kwargs))
  File "/usr/local/lib/python3.8/site-packages/anyio/_core/_eventloop.py", line 70, in run
    return asynclib.run(func, *args, **backend_options)
  File "/usr/local/lib/python3.8/site-packages/anyio/_backends/_asyncio.py", line 292, in run
    return native_run(wrapper(), debug=debug)
  File "/usr/local/lib/python3.8/asyncio/runners.py", line 44, in run
    return loop.run_until_complete(main)
  File "/usr/local/lib/python3.8/asyncio/base_events.py", line 616, in run_until_complete
    return future.result()
  File "/usr/local/lib/python3.8/site-packages/anyio/_backends/_asyncio.py", line 287, in wrapper
    return await func(*args)
  File "/usr/local/lib/python3.8/site-packages/prefect/engine.py", line 1121, in begin_task_run
    task_run.state.data._cache_data(await _retrieve_result(task_run.state))
AttributeError: 'NoneType' object has no attribute '_cache_data'

It seems like the engine is mistaking a task run for not running yet?

✅ 1

👀 2

Walter Cavinaw

10/11/2022, 5:14 PM

just a quick q: does prefect support Julia now? It's mentioned on the site but can't find any examples

julia 3

✅ 1

Imran Qureshi

10/11/2022, 7:26 PM

question: Prefect seems to stop monitoring a pipeline after four hours. Anyway to change this?

✅ 1

Carlo

10/11/2022, 8:28 PM

We were considering upgrading from prefect 1.1 to 2.5. Given we rely on AWS ECS + Fargate, I'm concerned it might not be fully supported yet. It looks like 2.4 introduced ECSTask w/ a caveat that the api is fluid. Any additional color would helpful. We were only looking to update to stay relevant w/ community support, our 1.1 install has been stable.

✅ 1

David Cupp

10/11/2022, 9:44 PM

Does anyone know how to schedule something in prefect based on an rruleset? According to the docs [1] and source code [2] the

RRuleSchedule

only takes a "rrule string". It seems easy to convert a single rrule to a string, but as far as I can tell there is no standard implementation to convert an rrule set into an rrule string [3]. Any ideas? [1] https://docs.prefect.io/api-ref/orion/schemas/schedules/#prefect.orion.schemas.schedules.RRuleSchedule [2] https://github.com/PrefectHQ/prefect/blob/main/src/prefect/orion/schemas/schedules.py#L322 [3] https://github.com/dateutil/dateutil/issues/856

✅ 1

Nace Plesko

10/11/2022, 10:47 PM

Hi, I'm running Prefect V1 and I'm having problems with passing in parameter. I'm trying to pass in Parameter to flow and use that parameter to concat a

command

for

ShellTask

. But the problem is that I'm not using the

Parameter

directly and it's not showing up in the UI when I register the flow. Is there any way to get around it? Thank you in advance!

Nico Neumann

10/11/2022, 11:29 PM

I use

prefect_aws

to upload/list/download files to s3 and also for some shared AWS Secrets. For some functionality I rely on

boto3

, e.g.

boto3.client("s3", ...).generate_presigned_url(…)

. Prefect 2.5.0 is running on EKS and the flows are deployed to S3 which requires

s3fs

Copy code

To use it in a deployment: prefect deployment […] -sb s3/dev 
You need to install s3fs to use this block.

https://docs.prefect.io/concepts/filesystems/ My problem is that

prefect_aws

and

s3fs

have dependency conflicts. I am using pip-tools to set my requirements and get the following error:

Copy code

# simplified <http://requirements.in|requirements.in> (removed the package versions to might easier find matches)
prefect
prefect_aws
s3fs

Copy code

$ pip-compile <http://requirements.in|requirements.in>
Could not find a version that matches botocore<1.27.60,<1.28.0,>=1.27.53,>=1.27.59,>=1.27.89 (from prefect_aws==0.1.4->-r <http://requirements.in|requirements.in> (line 2))
Tried: 0.4.1, 0.4.2, 0.5.0, 0.5.1, 0.5.2, 0.5.3, 0.5.4, 0.6.0, 0.7.0, 0.8.0, 0.8.1, 0.8.2, 0.8.3, 0.9.0 ... [lists all versions here] 
1.27.87, 1.27.88, 1.27.88, 1.27.89, 1.27.89
Skipped pre-versions: 1.0.0a1, 1.0.0a2, 1.0.0a3, 1.0.0b1, 1.0.0b2, 1.0.0b3, 1.0.0rc1, 1.0.0rc1
There are incompatible versions in the resolved dependencies:
  botocore<1.28.0,>=1.27.89 (from boto3==1.24.89->prefect_aws==0.1.4->-r <http://requirements.in|requirements.in> (line 2))
  botocore>=1.27.53 (from prefect_aws==0.1.4->-r <http://requirements.in|requirements.in> (line 2))
  botocore<1.27.60,>=1.27.59 (from aiobotocore==2.4.0->s3fs==2022.8.2->-r <http://requirements.in|requirements.in> (line 3))

I have found this issue: https://github.com/fsspec/s3fs/issues/615#issuecomment-1094791081 but not a real solution to fix it. How can I use

prefect_aws

and also deploy flows to S3? Does anyone else have the same problem and found a solution?

✅ 1

Adam Green

10/12/2022, 12:50 AM

Is it possible to statically type the code used to deploy Prefect flows without converting it all to async? We have a script we are running to deploy flows

Copy code

from prefect.deployments import Deployment
from prefect.filesystems import S3
s3_block = S3(
    aws_access_key_id=aws_key,
    aws_secret_access_key=aws_secret,
    bucket_path=context["prefect_flows_bucket"],
)
s3_block.save("s3", overwrite=True)

Deployment.build_from_flow(
    name="alpha",
    work_queue_name="alpha",
    flow=healthcheck,
    storage=S3.load("s3"),
    infrastructure=Process(),
    apply=True,
)

When we run mypy on this code, it complains about things not being typed as async. Is it possible to type this code without converting to async?

✅ 1

Nace Plesko

10/12/2022, 1:14 AM

I'm trying to kick off a flow from within a flow in Prefect v1 and I can't figure out why it's not working. At first I had the flows in separate files and I thought that was the issue, then I combined them in the same file, trying

StartFlowRun

and

create_flow_run

, like it's in the docs and for some reason I'm running in a bunch of errors when executing the flow, but none when registering it. Right now I'm getting

Copy code

Failed to load and execute flow run: ValueError('No flows found in file.')

and previously I was getting a bunch of

Copy code

Failed to load and execute flow run: KeyError("'__name__' not in globals")`

I feel like I am missing something extremely obvious about executing a flow from within a flow that it's not even documented?

Nace Plesko

10/12/2022, 2:35 AM

Is there a way to put a tag on a Flow in Prefect V1?

✅ 1

Steph Clacksman

10/12/2022, 7:21 AM

I am having trouble passing a dataframe from a flow to a subflow - it throws a type error. I have made a minimal reproducible example:

Copy code

import pandas as pd
from prefect import flow


@flow
def test_flow() -> None:
    df = pd.DataFrame({"ID": ["123456789", "223456789"]})
    test_subflow(df)


@flow(validate_parameters=False)
def test_subflow(df: pd.DataFrame) -> None:
    print(df)


if __name__ == "__main__":
    test_flow()

✅ 1

Nic

10/12/2022, 8:12 AM

I've followed the helm chart setup, and can't access the ORION Server from the host-machine. However, it has following error, and i can't create any blocks - Error in reploy thread

✅ 1

Chern Hong Poh

10/12/2022, 9:55 AM

Hello guys, currently I am using prefect version 0.13.17 and working environment is Amazon Linux. Then I face one problem when using

ShellTask

that returns

Command failed with exit code 2

when I registered and quick run the prefect flow. I registered the flow using this command

prefect register flow --file testing.py --project staging

. Appreciated if someone can help. This has been bugging me since morning.

Copy code

## print2.py

print("hello")

Copy code

## testing.py

import os
import datetime
from datetime import timedelta
import pendulum

import prefect
from prefect import case
from prefect import Flow
from prefect import Parameter
from prefect import task
from prefect.environments.storage import S3
from prefect.schedules import filters
from prefect.schedules.clocks import IntervalClock
from prefect.schedules.schedules import Schedule
from prefect.tasks.control_flow import merge
from prefect.tasks.dbt import DbtShellTask
from prefect.tasks.shell import ShellTask

import subprocess

@task(name="Logging")
def logging_result(stuff):
    logger = prefect.context.get("logger")
    return <http://logger.info|logger.info>(stuff)

@task(name="Run Python Script", log_stdout=True)
def run_script():
    return ShellTask(command=f"python3 print2.py").run()

with Flow(name="DBT Python daily run") as flow:
    python_run = run_script()
    final = logging_result(python_run)

#flow_state = flow.run()
#shell_output = flow_state.result[python_run].result
#print(shell_output)

✅ 1

Robert Hales

10/12/2022, 11:25 AM

Hi there, seen some undesirable behaviour around 500 handling in flows. A request to create a flow run during a subflow run failed due to a 500 from the (self-hosted) prefect server. The flow in question is still marked as running, which is not the case. Interestingly, this subflows parent is also a subflow which is still marked as running - however the "root" flow is marked as crashed.

✅ 1

Todd de Quincey

10/12/2022, 11:31 AM

Hi all, I’m curious if the Prefect team have any plans to introduce concepts similar to Dagster’s Software-defined-assets or Airflow’s Data-aware scheduling? I haven’t used either of the above, but conceptually, they are very attractive solutions (especially Dagster’s implementation).

✅ 1

➕ 1

👀 1

10/12/2022, 12:40 PM

👋 Is minimal RBAC configuration for agents running on K8s and for K8s jobs documented somewhere? Going by this post some

Roles/RoleBindings

were getting generated at some point in time, but I guess something changed.

✅ 1

Himanshu

10/12/2022, 1:28 PM

Hi i am trying to register prefect flow with a schedule using django API and am getting error {'_schema': 'Invalid data type: None'}. Does anyone have any solution?

Patrick Tan

10/12/2022, 1:43 PM

Hi, This is Prefect 2.0. From time to time prefect flow crashes with time kind of message. Rerun it completed successfully. Any clue?

✅ 1

Emma Rizzi

10/12/2022, 3:17 PM

Hello! I'm stumbing accross a weird error and have trouble debugging (error in thread) I'm migrating from gitlab.com registry to dockerhub for my flows DockerStorage, and a flow that used to work is not failing on dockerhub. It is running on a kubernetes cluster, and I tried to watch logs directly from the pod, the error does not even appear it justs stops the pod when starting the first non-parameter task. How can I get any more details on the error occuring ? I'm using prefect 1.2.4

✅ 1

José Duarte

10/12/2022, 3:25 PM

Hey all, I’m running prefect orion locally and no blocks appear, yet on our servers they do? Both cases are running prefect 2.5

✅ 1

Oluremi Akinwale

10/12/2022, 3:58 PM

Hello team

Oluremi Akinwale

10/12/2022, 4:00 PM

Am new to prefect, please which tutorial can I follow step by step on how to orchestrate dbt using prefect? Am presently using Fivetran, Bigquery and DBT but need Prefect to automate DBT

✅ 1

José Duarte

10/12/2022, 4:45 PM

Hey all, I think there are two FKs missing in the “flow run” family The

flow_run_notification_queue

has both

flow_run_state_id

and

flow_run_notification_policy_id

but doesn’t connect with either table. Example:

Copy code

sqlite> .schema flow_run_notification_queue
CREATE TABLE IF NOT EXISTS "flow_run_notification_queue" (
	id CHAR(36) DEFAULT ((
        lower(hex(randomblob(4)))
        || '-'
        || lower(hex(randomblob(2)))
        || '-4'
        || substr(lower(hex(randomblob(2))),2)
        || '-'
        || substr('89ab',abs(random()) % 4 + 1, 1)
        || substr(lower(hex(randomblob(2))),2)
        || '-'
        || lower(hex(randomblob(6)))
    )) NOT NULL,
	created DATETIME DEFAULT (strftime('%Y-%m-%d %H:%M:%f000', 'now')) NOT NULL,
	updated DATETIME DEFAULT (strftime('%Y-%m-%d %H:%M:%f000', 'now')) NOT NULL,
	flow_run_notification_policy_id CHAR(36) NOT NULL,
	flow_run_state_id CHAR(36) NOT NULL,
	CONSTRAINT pk_flow_run_notification_queue PRIMARY KEY (id)
);
CREATE INDEX ix_flow_run_notification_queue__updated ON flow_run_notification_queue (updated);

Along with

CONSTRAINT pk_flow_run_notification_queue PRIMARY KEY (id)

there should be something like

CONSTRAINT fk_flow_run_notification_queue__flow_state_id__flow_state FOREIGN KEY(flow_state_id) REFERENCES flow_run (id) ON DELETE cascade

as there is on other tables. Although I am probably missing something, could anyone from the Prefect team take a look?

José Duarte

10/12/2022, 4:52 PM

Hey all (x2) Can someone help on how to create a notification policy from the orion client? I can see the Model in the docs but no way to create it https://docs.prefect.io/api-ref/orion/schemas/core/?h=flow_run_notification#prefect.orion.schemas.core.FlowRunNotificationPolicy

✅ 1

Erik Amundson

10/12/2022, 5:23 PM

Hi all, we saw in the 2.3.0 release that Docker storage was added. However it seems that it's only available for the docker infrastructure block. One feature we enjoyed in 1.0 was using Docker storage with a kubernetes agent, so that we could consolidate the run environment and flow storage. Is that available also in 2.0? We would really like to use the docker storage with the kubernetes job block if possible.

✅ 1

10/12/2022, 5:28 PM

Hi folks, on Prefect UI, when a task fails, how can I rerun this task and all the downstream tasks (which also failed, in "TriggerFailed" status). I tried "restart" the task, but realize it only rerun that specific task, not the downstream ones.

✅ 1