System that builds a dataset containing visually-confirmed equipment losses for Russia and Ukraine. Built on Prefect, AWS, and Terraform. It refreshes daily and publishes the latest data to Kaggle.
Future plans for it include
• Apply OCR algorithms to find dates in timestamped images
• Expand the evidence extraction to handle X (Twitter) posts
• Enhance loss information with specific data about equipment
• Abstract schema documentation generator to own library and improve features
https://github.com/dominictarro/Borderlandshttps://www.kaggle.com/datasets/dominictarro/borderlands
Bring your towel and join one of the fastest growing data communities. Welcome to our second-generation open source orchestration platform, a completely rethought approach to dataflow automation.