https://prefect.io logo
Docs
Join the conversationJoin Slack
Channels
announcements
ask-marvin
best-practices-coordination-plane
data-ecosystem
data-tricks-and-tips
events
find-a-prefect-job
geo-australia
geo-bay-area
geo-berlin
geo-boston
geo-chicago
geo-colorado
geo-dc
geo-israel
geo-japan
geo-london
geo-nyc
geo-seattle
geo-texas
gratitude
introductions
marvin-in-the-wild
prefect-ai
prefect-aws
prefect-azure
prefect-cloud
prefect-community
prefect-contributors
prefect-dbt
prefect-docker
prefect-gcp
prefect-getting-started
prefect-integrations
prefect-kubernetes
prefect-recipes
prefect-server
prefect-ui
random
show-us-what-you-got
Powered by Linen
show-us-what-you-got
  • h

    Henning Holgersen

    03/16/2022, 8:37 PM
    I have been working a lot with setting up a developer workflow and deployment options lately, and created a demo project with one possible deployment pattern using Github Actions. Very much inspired by the github issue on the same topic. In short, this is one repo that contains flows that are deployed automatically and in parallel via the Github Action. Everything is dockerized so that the flows can be completely independent. So far this is in the trial-stage. I'm looking forward to exploring how to adapt this to Orion. https://github.com/radbrt/prefect_flows
    šŸš€ 5
    šŸ™Œ 5
    :42: 8
    :upvote: 9
    šŸ’Æ 7
    a
    • 2
    • 2
  • f

    Florian Kühnlenz

    03/17/2022, 8:05 AM
    http://www.tharwan.de/mlops-gridsearch.html we used prefect, sklearn and dask for easy grid searches in the hyper parameter space of Ml models. Overall a very enjoyable toolchain.
    :upvote: 3
    šŸš€ 6
    :dask: 1
    šŸ™Œ 5
  • e

    ep

    03/17/2022, 12:13 PM
    hey guys, am trying to run dbt job with prefect, but am always getting failed exit code 127. Any suggestion?
    a
    • 2
    • 9
  • n

    Noah Holm

    03/17/2022, 10:55 PM
    Alright @Marvin, do you like Christmas? šŸŽ„ I set out to make a star with the radar plot. I figured it should be doable and would be a pretty cool visualisation. However it was difficult due to the unpredictability of which task lands where on the same level in the radar circle. Quick turnaround to new idea. I could use the same increasing amount of dependent task idea to make something that was a bit more consistent in form. Enter the Christmas tree, to kick off the very first challenge! By increasing the amount of dependent tasks at each level in the tree to create more branches I managed to get the tree structure plotted in Orion. I found that the plot extends to ā€œ3 o’clockā€ at first to then go down, in clockwise manner. This made my idea of creating a stem at the bottom hard, as it would be located at the upper part of the tree and not in the middle. I then realised that it would be possible to get a star or ball on the top of the tree which really topped it off I’d say! Hoping to see some cooler submissions in the week to come. Good luck and enjoy to anyone trying out the challenge šŸ™Œ
    :marvin: 6
    šŸ˜’tonksup: 5
    šŸŽ„ 19
    ā¤ļø 21
    j
    k
    +3
    • 6
    • 9
  • s

    StƩphan Taljaard

    03/23/2022, 9:47 PM
    I also took a go at @Marvin's challenge :marvin-duck:. Although my flow's radar plot doesn't really look impressive, the experience leading up to it has been! I hacked together a simple flow that checks for new job postings on https://boards.greenhouse.io/prefect/, keeps track and when new ones come up, send their links to me via Telegram. I'll post the code in a reply in the thread (again, it was a hacked together quickly šŸ˜… ) Note for the radar diagrams: left was my first flow run, and right the second run. The flow only notifies me when it found "new to me"-job posts. The second run skipped the sending step, because it already notified me of the current jobs. _What was
    really
    nice_:
    flows are so easy to define in Prefect 2.0! The conditional logic was done using a simple
    if
    statement. Not that 1.0's
    case
    is difficult, but 2.0's usage is super straight forward, making it even easier and more fun to use!
    šŸš€ 9
    šŸ‘€ 7
    :upvote: 4
    šŸ‘ 11
    ā¤ļø 11
    :prefect: 11
    a
    d
    a
    • 4
    • 6
  • a

    Anna Geller

    03/24/2022, 8:44 PM
    A new great blog post by @Jamie Zieziula šŸ”„ It will walk you through how to deploy the Docker agent on AWS EC2 using Terraform https://www.prefect.io/blog/announcing-the-terraform-module-to-deploy-the-prefect-docker-agent-on-aws/
    šŸ”„ 9
    šŸ‘€ 1
    :upvote: 10
    šŸ™Œ 2
  • b

    Ben Welsh

    03/29/2022, 7:16 PM
    message has been deleted
    :upvote: 8
    šŸ‘ 6
    r
    • 2
    • 2
  • a

    Alyssa Mazzina

    03/30/2022, 6:20 PM
    Some great new content by @Jean Luciano. https://www.prefect.io/blog/from-pdf-to-snowflake/
    šŸ”„ 7
    :upvote: 5
    :marvin: 3
  • a

    Alyssa Mazzina

    03/31/2022, 4:40 PM
    We’ve got both a blog and a companion video today on the Prefect and Great Expectations integration, both from our own @alex! Blog: https://www.prefect.io/blog/continuous-data-validation-with-great-expectations-and-prefect Video:

    https://www.youtube.com/watch?v=8Tw2TLUpQn8ā–¾

    :great-expectations: 5
    šŸš€ 7
  • p

    Patrick Koch

    04/04/2022, 10:43 AM
    Dear all, I've written a post about how to deploy a Prefect Flow as Kubernetes Job at an Azure Kubernetes Service. A huge thanks to @Anna Geller: you helped a lot for making this post possible! https://www.patrickkoch.dev/posts/post_15/
    :thank-you: 7
    šŸ‘ 3
    🤯 1
    šŸš€ 13
    :marvin: 7
    b
    a
    • 3
    • 2
  • c

    Chris Reuter

    04/06/2022, 10:09 PM
    Hi all - thanks again for @Henning Holgersen coming on PrefectLive. You can find PrefectLive recordings on our Youtube channel - check out

    this playlistā–¾

    !
    šŸ™Œ 5
    šŸš€ 4
    :upvote: 6
    :thank-you: 1
  • a

    Alyssa Mazzina

    04/08/2022, 6:09 PM
    New content on the Prefect blog this week! Get Integrated to Google Cloud Platform With prefect-gcp
    šŸ™Œ 2
    :marvin: 2
    šŸ”„ 2
  • a

    Alyssa Mazzina

    04/08/2022, 6:09 PM
    What is the Modern Data Stack?
    :marvin: 4
    🄪 2
    šŸ• 4
    šŸš€ 5
    a
    j
    • 3
    • 2
  • c

    Chris Reuter

    04/19/2022, 4:20 PM
    Hi there! Check out this cool guide on automating dbt exposures published today by @James Sopkin. This demonstrates how to automate the generation of your exposures using an example with dbt :dbt:, Tableau, and Github Actions. Thanks James! If you're interested in writing anything about Prefect, we'd love to feature you in our blog! You can reach out directly to me here on Slack or at chrisr@prefect.io.
    šŸ”„ 2
    :marvin: 3
  • a

    Anna Geller

    04/19/2022, 4:57 PM
    Hi Prefectionists! Today seems to be a šŸ˜›refect: + :dbt: day! If you are using Prefect with dbt, we have another fantastic recipe you should try out. @alex has just released a full tutorial allowing you to easily rerun failed dbt models from a failed dbt DAG node. You only need to add one more
    DbtShellTask
    to your flow to take advantage of this functionality, making dbt DAG orchestration and recovery from failure that much easier! For more details on that, check this Discourse announcement. And to get directly to code, visit the Customer Success Recipes repository. Happy Engineering!
    šŸ™Œ 5
    :dbt: 6
    šŸ‘€ 1
    ā¤ļø 1
  • a

    Alexander Butler

    04/21/2022, 1:02 AM
    Fully dockerized prefect with preseeded deployments, queues, and an entrypoint that spins up agents and server. Auto deployed in a GitOps fashion to GCR (GAR). Cool!
    šŸ˜ 2
    šŸ’Ŗ 4
    :marvin: 6
    :success-kid: 2
    šŸ’„ 5
    c
    • 2
    • 1
  • c

    Chris Reuter

    04/21/2022, 12:19 PM
    Ever wonder how we got to the now of modern data tools? Where did dataflow automation come from? Do you want to reminisce about Informatica and Oracle (oh wait, maybe you're still using those...)? Do you yearn to understand why cron tortures you to this day? @justabill offers his creative writing style to A Brief History of Dataflow Automation - an entertaining read to start your morning.
    šŸ’Æ 2
    šŸ”„ 5
    :thank-you: 3
  • j

    Jean-Michel Provencher

    04/21/2022, 1:07 PM
    Hello, here’s a quick blog post I wrote about integrating Sentry in Prefect Flows. Enjoy. https://source.coveo.com/2022/04/20/integrating-sentry-with-prefect-flows/
    :marvin: 4
    šŸš€ 7
    c
    • 2
    • 1
  • c

    Chris Reuter

    04/25/2022, 6:34 PM
    Hi all! I want to highlight our repo that is associated with an O'Reilly course that we helped create called 'Getting Started with Workflow Orchestration'. If you're just looking to schedule a recurring task, or you want to orchestrate your entire stack, this free repo is a good place to start.
    🤯 1
    :upvote: 8
    šŸ”„ 9
    w
    k
    +3
    • 6
    • 12
  • a

    Anna Geller

    04/28/2022, 6:18 PM
    New blog post by @Andrew Black about our partnership with Snowflake and the roadmap for šŸ˜›refect: + ā„ļø integrations in Prefect 2.0 šŸ˜›refect2: :cloud2:
    šŸ”„ 11
  • c

    Chris Reuter

    05/04/2022, 3:26 PM
    There's been a great mix of content lately that this community may find helpful: • A guide from @Jean Luciano on how to deploy flows in a CI/CD fashion using Github Action • @Anna Geller’s workflow orchestration vs data orchestration writeup that can help you explain Prefect's purpose, to people who aren't familiar with Prefect! Happy reading!
    šŸ”„ 5
    šŸ™ 3
    :upvote: 2
    šŸ“¦ 3
  • c

    Chris Reuter

    05/05/2022, 8:13 PM
    In part 2 of @James Sopkin's social media series, he covers extracting data from Youtube videos using Prefect & the Youtube API. Read it here!
    šŸ”„ 3
    a
    • 2
    • 1
  • e

    Edmondo Porcu

    05/05/2022, 8:16 PM
    Was having this discussion on #prefect-community with @Anna Geller and other folks . Workflow orchestration makes me think about micro services , BPMN, Temporal, etc
    šŸ‘ 2
  • c

    Chris Reuter

    05/07/2022, 8:00 PM
    Happy weekend all - wanted to highlight this recent community blog post covering running Prefect flows as Kubernetes jobs on Azure's Managed Kubernetes Service. Thanks for your contribution to the community @Patrick Koch! https://www.patrickkoch.dev/posts/post_15/
    šŸ‘ 5
  • i

    Ivan Fokeev

    05/09/2022, 2:01 PM
    Hey guys! Recently, I was struggling with cloud infrastructure deployment and had the idea to create a tool. If you know how to code but are not familiar with clouds then this is a great way for you-just use Cloudy templates which are actually based on Pulumi. Let’s push ā€œinfrastructure as codeā€ so it can be used by everyone! https://news.ycombinator.com/item?id=31313931 https://github.com/cloudytool/cloudy
    šŸš€ 4
    a
    • 2
    • 10
  • k

    Kevin Kho

    05/13/2022, 6:24 PM
    message has been deleted
    šŸ‘ 5
    :party-parrot: 2
    b
    c
    +2
    • 5
    • 4
  • m

    Mathijs Miermans

    05/13/2022, 9:01 PM
    At Pocket (part of Mozilla) we recently open-sourced our Prefect repo: https://github.com/Pocket/data-flows One cool thing we do is to set the Flow Readme's during continuous deployment: https://github.com/Pocket/data-flows/blob/main/deploy/register_flows.py#L71-L94 (Our continuous deployment strategy is based on an approach from mousetree.)
    šŸ’ŖšŸ½ 1
    šŸ‘šŸ½ 1
    šŸ‘ 7
    šŸ’Ŗ 2
    šŸ‘€ 8
    c
    d
    +3
    • 6
    • 6
  • p

    Paco IbaƱez

    05/13/2022, 11:24 PM
    Hello! I have being exploring Orion this week and I have put together a docker-compose to run Prefect 2.0 locally. I have tested it on mac and linux. The script runs prefect server, postgres, a docker agent, and minio to store flows. The start script configures a work queue and an example flow is included. Sharing in case someone finds it useful: https://github.com/fraibacas/prefect-orion
    šŸ’ŖšŸ½ 1
    šŸ’Æ 4
    šŸ’Ŗ 4
    šŸ‘€ 3
    m
    j
    +4
    • 7
    • 9
  • c

    Chris Reuter

    05/17/2022, 5:22 PM
    Hi all - today we're releasing the 2022 State of Workflow Orchestration report in conjunction with Gradient Flow. This was a survey of ~600 data engineers, data scientists and analysts focusing on the orchestration market. It measures use case, feature importance, job title of users, company size and industry, and more. You can read our blog summary here, and for the community we're posting the report directly in Slack hoping that you can make use of it. This may help you to explain what orchestration is, make a business case for dataflow automation tooling internally at your own organizations, or just be some pleasure reading šŸ˜‰ Please feel free to share with your friends - they can download from this landing page.
    2022 State of Workflow Orchestration by Gradient Flow.pdf
    šŸ‘€ 2
    :prefect-duck: 1
    :party-parrot: 3
    šŸ”„ 3
    :prefect: 2
    šŸš€ 4
  • v

    Valentin Baert

    05/19/2022, 11:54 AM
    Hi, I'm evaluating several tools to help build a team that deals with data integration. In the toolbox, we will probably have prefect orion I have conducted a POC where I deploy a flow on a Google Kubernetes Engine cluster which uses a Google Cloud bucket as the storage and connects to the Prefect Cloud Orion API. The flow then consumes a kafka topic hosted by Confluent Cloud and for each kafka message starts a
    @task
    to process the message (just logging for this POC). I have written the steps for my POC here if it might help other people : https://gitlab.com/idkw/prefect-orion-gke-poc
    ā¤ļø 5
    :thank-you: 2
    šŸš€ 5
    :upvote: 2
    a
    k
    • 3
    • 8
Powered by Linen
Title
v

Valentin Baert

05/19/2022, 11:54 AM
Hi, I'm evaluating several tools to help build a team that deals with data integration. In the toolbox, we will probably have prefect orion I have conducted a POC where I deploy a flow on a Google Kubernetes Engine cluster which uses a Google Cloud bucket as the storage and connects to the Prefect Cloud Orion API. The flow then consumes a kafka topic hosted by Confluent Cloud and for each kafka message starts a
@task
to process the message (just logging for this POC). I have written the steps for my POC here if it might help other people : https://gitlab.com/idkw/prefect-orion-gke-poc
ā¤ļø 5
:thank-you: 2
šŸš€ 5
:upvote: 2
a

Anna Geller

05/19/2022, 11:56 AM
This is a fantastic resource, can't wait to see how you progress with the real-time streaming use case - excited to assist you along the way! šŸ‘
v

Valentin Baert

05/19/2022, 12:00 PM
Thanks, in our other thread I was wondering what should I do about this issue : https://prefect-community.slack.com/archives/CL09KU1K7/p1652949937007829?thread_ts=1651566181.776459&cid=CL09KU1K7 I don't kow if I should have a @flow wrapping the kafka infinite polling loop and then run a task for every individual message (but then how do I cleanly stop the infinite flow when I need to update the code ?) or whether there is a better way to handle event streaming. I was under the impression prefect 2.0 was brining new answers to this but I'm a bit lost here.
Should I use a sub-flow or is there a better way ?
a

Anna Geller

05/19/2022, 12:01 PM
let's continue the discussion in our previous thread, this one is only for sharing blog posts etc - but all great questions
v

Valentin Baert

05/19/2022, 12:01 PM
ok
k

Kevin Kho

05/19/2022, 1:31 PM
That’s nice! Good to see Kafka + Prefect, which was awkward in Prefect 1.0
:upvote: 1
v

Valentin Baert

05/19/2022, 1:49 PM
I have added an additional sample
prefect_2_kafka_kub_no_deployment.py
which is a long running kafka consumer that starts a flow when receiving a kafka message but does not require a deployment nor an agent. Based on the advices given by Anna. I think it's more suitable to my streaming use-case.
a

Anna Geller

05/19/2022, 1:57 PM
nice work! šŸ™Œ
View count: 3