Issues now are how to scale things up. I adopted a results output strategy which appends but for large numbers of workers that causes conflicts as they all output to the same file.
Also presumably if I use map with larger amounts of data I will run into the same issue.