The use of open source GIS algorithms, big geographic data, and cluster computing techniques to compile a geospatial database that can be used to evaluate upstream bathing and sanitation behaviours on downstream health outcomes in Indonesia, 2000–2008