OpenStreetMap logo OpenStreetMap

Johnwhelan's Diary

Recent diary entries

Sometimes I think I’m a bottom feeder going through the map and cleaning up the obvious errors and this comes from my observations over time and cleaning up thousands of duplicate buildings.

Because many countries do not have good census data you can do a rough calculation by multiplying the number of houses by an average number of people who live in them. It isn’t perfect but if you have nothing better then it works.

So duplicate buildings are a problem. When you’re looking to see how many schools you need duplicate buildings mess the numbers up. Buildings incorrectly tagged or not tagged at all also mess the numbers up.

Then we get to the imagery used. These days I’m seeing more and more microsoft BuildingFootprints tags on buildings. They are normally very accurate and align well with Bing imagery. The largest most accurate satellites are only accurate to 60 meters so the imagery has to be aligned. My recommendation would be to use Bing imagery and find a building that aligns with it. Then switch to your preferred imagery and align it with the Bing aligned one.

This helps ensure that buildings are only mapped once and when you add buildings you don’t add an existing building and if someone imports using microsoft BuildingFootprints then your building won’t be remapped 10 meters away in the future.

Disposable mappers are often used meaning many will only may a few times. So we don’t really have time to train them.

We want to get the most accurate mapping we can from them and as many buildings as we can. With JOSM buildings_tool plugin I can highlight one building then add more by clicking and holding down the button, moving to the opposite corner of the building and releasing the mouse button. If they aren’t in line it takes another mouse button click. You get a rectangular building correctly labelled. This is far less than using iD. Plus you don’t need a validator to inspect each one to see if it is correct.

See full entry

cleaning up after a task manager task

Posted by Johnwhelan on 1 January 2022 in English. Last updated on 3 January 2022.

In the ideal world all tasks would be validated to a high standard but unfortunately this doesn’t happen.

I’ve seen HOT projects with nearly two thousand duplicate buildings, I’ve seen some with two or three hundred untagged buildings. Some are well mapped and have no errors.

So this a method to clean up after the event. Basically you load up the area into JOSM and run the duplicate building script and JOSM validator. It won’t catch everything but it’s a lot faster than validating each tile. The todo and Mapathoner plugin are required.

select the errors then add them to the todo list and work your way through them.

The key is aoiBBOX”: [6.473334,5.172193,6.870011,5.730142] which is found here: https://tasking-manager-tm4-production-api.hotosm.org/api/v2/projects/10756 for project 10756. You’ll need to search the text to extract it.

You can either feed these coordinates into JOSM and download the area directly or feed them into an overpass query or extract them from an off line version of the map. https://osm-internal.download.geofabrik.de/ is one source. If you work with the off line version you can locate the errors then just directly redownload the tiny bit to ensure the map is up to date before you correct the error.

An example of a .bat file to extract the area from an offline file is osmconvert64 e:\downloads\nigeria-latest-internal.osm.pbf -b=6.473334,5.172193,6.870011,5.730142 -o=f:\maps\nigeria10756.osm Just load nigeria10756.osm as a local file into JOSM.

Note this is not validation since no feedback is given in task manager to the mapper. It is recommended it is used when there are no active mapping taking place so a month or so after the mapping has petered out is a good time.

The story actually goes back more than five years when it was realised that some Open Data was more Open than others because of licensing issues. The City of Ottawa gave its bus stops and some other information to Google in GTFS format. Because of the need to announce bus stops for improved accessibility all the bus stops were very accurately re-calibrated. This made the bus stops a very attractive high quality import but since the City of Ottawa’s Open Data license did not align with OSM it couldn’t be done but it provided the motivation to get the licenses sorted out.

The Canadian Treasury Board is responsible for standards and open data within federal government in Canada and they set about consulting with many would be users to come up with the 2.0 license. They have been working with a number of African governments on Open Data licensing by the way.

Once this license was in place Ottawa city council acted to ensure that all users had equal access to their data, ie bus stops, by releasing the data under a similar license and even that took a year or two to do.

See full entry