Skip to main content

Transportation Guide

Overview

The Overture transportation theme is the collection of LineString and Point features that describe the infrastructure and conventions of how people and objects travel around the world. The dataset contains two features types: connector and segment. The three subtypes within segment -- road, rail, and water -- contain familiar categories of transportation data: highways, footways, cycleways, railways, ferry routes, and public transportation.

Most of the data in the transportation theme is sourced from OpenStreetMap. In the 2024-09-18.0 release, we began adding data from TomTom to improve coverage in key areas.

You might use the Overture transportation data for:

  • mapping: rendering a map of connected roads and paths.
  • routing: calculating optimal routes from place to place.
  • navigation: generating granular instructions on the maneuvers needed to follow a route.
  • analytics: transportation-related analysis including traffic safety analysis and disaster planning.
  • geocoding: getting the coordinates of street intersections (geocodes) or the street intersection near specific coordinates (reverse geocodes).

This guide is an overview of the transportation data. To dig into the details of the schema, see the schema concepts for transportation and the reference documentation for the segment and connector feature types.

Dataset description

All Overture data, including transportation data, is distributed as GeoParquet, a column-based data structure. Below you'll find a table with column-by-column descriptions of the properties in the transportation feature type.

Schema for GeoParquet files in the transportation theme
column_namecolumn_typeDescription
idVARCHARA feature ID. This may be an ID associated with the Global Entity Reference System (GERS) if—and-only-if the feature represents an entity that is part of GERS.
geometryWKBThe line representation of the segment's location. Segment's geometry which MUST be a LineSting as defined by GeoJSON schema.
bboxSTRUCTArea defined by two longitudes and two latitudes: latitude is a decimal number between -90.0 and 90.0; longitude is a decimal number between -180.0 and 180.0.
versionINTEGERVersion number of the feature, incremented in each Overture release where the geometry or attributes of this feature changed.
sourcesSTRUCT[]The array of source information for the properties of a given feature, with each entry being a source object which lists the property in JSON Pointer notation and the dataset that specific value came from. All features must have a root level source which is the default source if a specific property's source is not specified.
subtypeVARCHARThe broad category of transportation segment.
classVARCHARCaptures the kind of road and its position in the road network hierarchy.
subclassVARCHARSpecifies the usage of a length of road.
subclass_rulesSTRUCT[]Defines the portion of a road that the subclass applies to.
namesSTRUCT[]Properties defining the names of a feature.
connectorsSTRUCT[]Array of connector IDs identifying the connectors this segment is physically connected to linearly referenced with their location. Each connector is a possible routing decision point, meaning it defines a place along the segment in which there is possibility to transition to other segments which share the same connector.
routesSTRUCT[]Routes this segment belongs to.
access_restrictionsSTRUCT[]Rules governing access to this road segment or lane.
level_rulesSTRUCT[]Defines the Z-order, i.e. stacking order, of the road segment.
prohibited_transitionsSTRUCT[]Defines where traveling from the segment to another is disallowed for navigation. This covers things situations prohibited turns or a transition from road to bike lane disallowing cars.
road_surfaceSTRUCT[]Defines the surface material on a road such as paved, asphalt, or unpaved.
road_flagsSTRUCT[]Additional properties relevant to roads such as is_bridge or is_under_construction.
speed_limitsSTRUCT[]Defines the speed limit of the road segment.
width_rulesSTRUCT[]Defines the width of the road segment for rendering.
destinationsSTRUCT[]Describes the transitions from one segment to another on the way to a specified location. This data is primarily used for routing.

Subtypes

Transportation segments are divided into three subtypes: rail, water, and road. The road subtype is then further divided into a variety of different classes based on usage captured in the table below.

Class and subclass feature counts
subtypeclasssubclassFeature count, November 2024 release
rail1,572,582
roadbridleway95,598
roadcyclewaycycle_crossing47,189
roadcycleway1,198,943
roadfootwaycrosswalk1,839,806
roadfootwaysidewalk2,961,562
roadfootway14,555,896
roadliving_street3,053,898
roadmotorwaylink626,752
roadmotorway428,359
roadpath12,727,710
roadpedestrian448,737
roadprimarylink470,293
roadprimary6,511,508
roadresidential123,774,708
roadsecondarylink369,680
roadsecondary10,438,045
roadservicealley1,541,197
roadservicedriveway15,160,138
roadserviceparking_aisle5,978,042
roadservice31,689,642
roadsteps1,691,848
roadtertiarylink283,157
roadtertiary19,199,059
roadtrack23,956,033
roadtrunklink535,332
roadtrunk3,342,592
roadunclassified28,763,594
roadunknown546,181
water27,337

Data access and retrieval

The latest transportation data can be obtained from AWS or Azure as GeoParquet files at the following locations.

ProviderLocation
Amazon S3
s3://overturemaps-us-west-2/release/2024-11-13.0/theme=transportation/type=segment/
Azure Blob Storage
https://overturemapswestus2.blob.core.windows.net/release/2024-11-13.0/theme=transportation/type=segment/

Data usage guidelines

We recommend downloading only the Overture data you need. If you have a particular geographic area of interest, there are several options for using a simple bounding box to extract places data and output a GeoJSON file.

First, follow the setup guide for the Python Command-line Tool.

Set type to either segment or connector and simply alter the bbox value to download a particular area.

overturemaps download --bbox=12.46,41.89,12.48,41.91 -f geojson --type=segment -o rome_segments.geojson

Data manipulation and analysis

Querying by properties in DuckDB

These examples use data properties to filter the data in useful ways using DuckDB.

The class column can be used to pull out subsets of the road data. Similarly, you could use subtype to select only water, rail, or road features. This example extracts only the parking_aisle features within the bounding box.

LOAD spatial; -- noqa
LOAD httpfs; -- noqa
-- Access the data on AWS in this example
SET s3_region='us-west-2';

COPY (
SELECT
*
FROM read_parquet('s3://overturemaps-us-west-2/release/2024-11-13.0/theme=transportation/type=segment/*')
WHERE
class = 'parking_aisle' AND
bbox.xmin > 13.0897 AND bbox.xmax < 13.6976 AND
bbox.ymin > 52.3100 AND bbox.ymax < 52.7086
)
TO 'berlin_parking_aisles.parquet';

Querying by properties in Athena

Athena can allow for faster querying of the transportation layer than DuckDB given the size of the data. These examples are designed for Athena, but could be reworked for DuckDB with some tweaking.

To properly return a linear referenced feature like a speed limit, we will need to query all the possible values of the feature as the queried value may only exist on one portion of the line. In this example, we're extracting roads with any speed limit max_speed value of 27 and unit of mph using the any_match function.

This same general query would also work for querying other similar columns such as prohibited_transitions and access_restrictions.

SELECT
id,
speed_limits,
ST_GEOMFROMBINARY(geometry) AS geometry
FROM v2024_11_13_0
WHERE type = 'segment'
AND ANY_MATCH(
speed_limits,
speed_limit->speed_limit.max_speed.value = 27
AND speed_limit.max_speed.unit = 'mph'
)

Tools and libraries

transportation-splitter

Overture Splitter
Conceptual diagram of the splitter tool output. The numbers following 1234@ represent start_lr and end_lr values.

The transportation-spitter tool transforms Overture road data into simpler sub segments. It will optionally divide features at each connector point and at each change of a scoped based property, depending on configuration. Depending on your needs and map stack, the resulting dataset may be easier to manipulate than the original Overture data as each segment will only have connections at either end and have one set of properties for its entire length.

Since a GERS ID will no longer be unique with this output, the resulting data will have two additional columns: start_lr and end_lr which are linear references describing which section of the orginal feature this new segment comes from.

Splitter Example

To help visualize this process better, here is a real world example of a residential street in OpenStreetMap, Overture, and after being run through the splitter tool.

OpenStreetMap Splitter Example
In OpenStreetMap this residential road is represented by two different features with the same tags with feature 1 having an additional restricted access tag.

More Information and Feedback

The tool requires a Spark environment to run and has been tested using Azure Databricks and AWS Glue. For set up information the transportation-spitter GitHub will contain the most up to date information as the tool is in active development still.

If you have feedback, questions, etc. on the tool you can create an issue on the GitHub.