Hi Prefect community for anyone attending this week s Dask S Prefect Community #ask-community

Hi Prefect community, for anyone attending this we...

Joe Schmid

05/20/2021, 5:23 PM

Hi Prefect community, for anyone attending this week's Dask Summit online conference, I'll be giving one of the keynote presentations tomorrow morning at 9am ET along with my colleague @Jie Lou on our use of Prefect & Dask for machine learning in healthcare. Prefect (and Dask) have become very key parts of our technology and have delivered great results for our business. Paid tickets are required to attend live tomorrow, but I think the talks will get posted to the Dask Youtube channel for free access after the conference: https://summit.dask.org/schedule/presentation/62/dask-prefect-for-healthcare-machine-learning-on-aws/

upvote 1

👍 3

Kevin Kho

05/20/2021, 5:33 PM

~~Could you also post in #CL09KTZPX so it does not get drowned out with the support stuff?~~

Dylan

05/20/2021, 5:36 PM

Hey, Joe! 👋

👋 1

emre

05/21/2021, 3:51 PM

Will this be uploaded somewhere? I missed it by a hairs width 😢

Kevin Kho

05/21/2021, 3:52 PM

The Dask Summit in general will be uploaded all the talks to their Youtube channel in a couple of weeks. If you are registered for the Dask Summit, they are pretty good about posting their talks at the end of the day.

Eric Jurotich

06/16/2021, 12:52 AM

hello, did this ever get posted to youtube? Please share a link if it did! Thanks!

Kevin Kho

06/16/2021, 2:56 AM

As far as I know it wasn't yet but they will in the next month or two (they said 2 months after the event)

Eric Jurotich

06/16/2021, 5:55 PM

Ahh! I thought I had read 2 weeks!

Kevin Kho

06/16/2021, 5:57 PM

Did you sign up for it though? Cuz they have the Zoom recordings on their platform if you did

Eric Jurotich

06/16/2021, 5:57 PM

@Kevin Kho off hand, do you know of any repos or code examples using prefect/dask with ML? I’m curious what people’s best practices are regarding prefect flows and sklearn pipelines .

Eric Jurotich

06/16/2021, 5:58 PM

I didn't, I wish I would have but didn’t know about it :(

Kevin Kho

06/16/2021, 5:58 PM

Are you using Pandas or Dask DataFrames? I guess Pandas cuz sklearn?

Eric Jurotich

06/16/2021, 6:02 PM

Well, that’s the interesting part to me, it could be either. I’ve been looking at the dask_ml docs and have a few flows I’ve made using dask as the backend pointing to a dask cluster instead of using joblib. I know there is some integration of sklearn with dask df, So really examples of either would be awesome.

Kevin Kho

06/16/2021, 6:04 PM

I only have this

🙌 1

Eric Jurotich

06/16/2021, 6:08 PM

Awesome , that’s definitely helpful! That’s what I was wondering, if most people were doing something along the lines of what you shared which is sort of rolling your own pipeline with a prefect flow.

Kevin Kho

06/16/2021, 6:10 PM

Yes but it's more common to persist the data between tasks and load them in downstream tasks rather than pass dataframes.

Eric Jurotich

06/16/2021, 6:11 PM

When you say persist the data between tasks, you mean something like writing to a parquet file somewhere and passing the location of that file instead of passing a df, correct?

Kevin Kho

06/16/2021, 6:13 PM

Yes exactly!

Eric Jurotich

06/16/2021, 6:14 PM

You are Awesome Kevin! Thanks for the help!

🙏 1

4 Views

Open in Slack

Previous Next