Identify traffic and safety issues in less time with Urban SDK traffic management software.
Using mobile observation data to estimate traffic volume for travel demand and transportation planning studies.
Transportation planners need accurate and comprehensive data about traffic volumes on a statewide basis. Traditionally, this need has been met by installing permanent and portable traffic counting sensors on state roadways. The paper explores how data collected from mobile devices can supplement installing and maintaining traffic counting sensors.
The existing number of mobile devices (e.g., consumer smartphones, commercial navigation devices, fleet monitoring systems), activation of devices using location based services (GPS, Bluetooth, WiFi), and consumer adoption of applications and web services using location services (mobile apps, mobile web, i.e. Maps, Audio, Weather, Device Tracking) enables planners to estimate traffic flows from mobile devices appended to traffic message channels.
Urban SDK estimates traffic volumes from mobile device samples derived from global positioning system (GPS)-based mobile devices, observations of devices using location based services.
Real-time volume data remains the key missing dimension in operations data that would greatly improve the accuracy of assessing transportation system performance. Although agencies have invested in fixed sensors, volume data remains relatively sparse and of varying quality on the majority of the freeway and major arterial networks. Data on spatial mobility are essential in order to build and use travel demand forecasting models, for transport planning purposes and for the appraisal of transport policies. Anticipated volume does not reflect traffic volume fluctuations during weather events, major incidents, or even normal day to day fluctuations in traffic.
The use of mobile phone data to construct origin-destination matrices in an urban region was first proposed in Italy by Bolla and Davoli (2000) and tested on a small sample in (White and Wells, 2002) with the aim of studying traffic on specific roads. In 2002, Akin and Sisiopiku (2002) selected just 500 individuals in the city of Birmingham in the United States. One of the first studies to use the whole population rather than a sample was carried out in Israel in 2007 (Bar-Gera, 2007). The research in question set out to estimate the traffic and obtain mean speed data on a 14 km road in Israel with 10 interchanges. Calabrese et al. (2011) were the first to produce O-D matrices from a detailed dataset, for the Boston region in Massachusetts. In 2002, two simultaneous research projects attempted to extract origin-destination matrices from mobile phone network data. One of these (Akin, Sisiopiku, 2002), working in the city of Birmingham (USA), developed an algorithm which calculated origins and destinations and divided the day into periods. To compute the subject’s position during each time periods, they took the largest number of connections in a zone.
Our aim is to expand on the historical work to provide cities, counties, state agencies, DOT, and metropolitan planning agencies with the greatest access to mobile data to improve planning and reduce total reliance on surveys and modeling.
Urban SDK evaluated the accuracy of average annual daily traffic (AADT) volume estimates from Urban SDK Traffic Data using actual volume counts from FDOT traffic monitoring sites. The sites were grouped according to traffic volume levels since the magnitude of error appeared to be correlated to traffic volume (i.e., low-volume roads typically had higher estimation error).
The mean absolute percent error for the AADT estimates was 41% for all sites but ranged from 22% at high-volume sites to 79% at low-volume sites. The mean error was strongly influenced by numerous outliers in all volume categories.
Traffic volume estimation from mobile devices can provide accuracy and granularity to estimate traffic volumes. Some of the traffic volume estimates are within acceptable error ranges (10% to 35% absolute percent error), but other estimates on roads of 400-5000 AADT are significantly outside this acceptable error range (greater than 100% absolute percent error). Lower volume roadways will have the highest margin of error due to lower mobile device sample sizes.
A multi-year backfill of all observed devices is recommended to more accurately forecast AADT. To ensure accuracy, traffic-volume estimates should be calibrated locally against permanent, specifically selected traffic count sites with a minimum of 4 months mobile observation data.
Accuracy is impacted negatively when generating traffic volume estimates on a statewide basis. Any comparison should allow for selecting and controlling the characteristics of the comparison sites could have led to lower estimation error and develop a better understanding of where algorithm improvements are most needed.
Young, Stanley E., Kaveh Sadabadi, Przemysław Sekuła, Yi Hou, and Denise Markow.
2018. “Estimating Highway Volumes Using Vehicle Probe Data – Proof of Concept: Preprint.” Golden, CO: National Renewable Energy Laboratory. NREL/CP-5400-70938. URL. https://www.nrel.gov/docs/fy18osti/70938.pdf
This section describes how we generate traffic volume estimates.
Urban SDK provides analytics for estimating traffic volumes. Traffic volume estimation models are the intellectual property of Urban SDK and are considered confidential. Our overall approach to traffic volume estimation is as follows:
1. Combine GPS-enabled, location based services (LBS) and advertising data into “Traffic” data. These are distinct datasets that Urban SDK aggregates from source data providers. 2. Normalize traffic data by US Census population estimates. This provides the first scaling factor that attempts to account for the mobile device sampling.
3. Normalize traffic data by roadway Traffic Message Center (TMC) code for mobile device sampling by roadway.
4. Calibrate the mobile device samples using public agency traffic volume sources. This provides the second scaling factor that attempts to account for the mobile device scaling. The public agency traffic volumes typically come from permanent traffic monitoring sites, where there is greatest confidence in the traffic volume accuracy.
Urban SDK used traffic counts from 1,077 FDOT monitored road segments to calibrate the traffic volume estimates. The locations had originally been identified for the purposes of evaluation/validation of traffic volume levels. Permanent and temporary monitoring sites with annual average daily traffic (AADT) volumes less than 400 vehicles per day are removed from any comparison due to poor prediction results.
Figure 1.1 shows a scatterplot of Urban SDK Mobile Data AADT estimates as compared to actual FDOT AADT values. For the purposes of this analysis, the results were divided into five traffic volume level categories:
Table 1.1 summarizes the accuracy measures for each traffic volume level category, then for all short duration count sites combined.
There are several key findings regarding the comparison and resulting accuracy measures:
Figures 1.2 through 1.6 illustrate the wide range of error values in this initial comparison. These charts illustrate that a small number of comparisons had much higher error values than the majority of comparisons.