So this gif above was part of a larger project (which i finally finished) that was the final task of a course I took in data engineering.
I basically pulled as much data as I could from r/dataengineering subreddit, built pipelines with prefect and dbt, did some NLP shenanigans and displayed the results in a Superset dashboard.
You can find the link to the project here and the dashboard here.
Any feedback is very welcome!
04/26/2023, 7:20 PM
dang this is awesome!! will check out your NLP shenanigans 👀
04/26/2023, 9:15 PM
Thanks for the feedback, glad you enjoyed it !
(tbf the NLP shenanigans are nothing more than playing around with TFIDF and some custom logic i came up with on the fly 😅)