System that builds a dataset containing visually-confirmed equipment losses for Russia and Ukraine. Built on Prefect, AWS, and Terraform. It refreshes daily and publishes the latest data to Kaggle.
Future plans for it include
• Apply OCR algorithms to find dates in timestamped images
• Expand the evidence extraction to handle X (Twitter) posts
• Enhance loss information with specific data about equipment
• Abstract schema documentation generator to own library and improve features
https://github.com/dominictarro/Borderlands
https://www.kaggle.com/datasets/dominictarro/borderlands