Cookie Settings
close

By clicking "Accept", you agree to the storage of cookies on your device to improve site navigation, analyze site usage and assist with our marketing efforts. See our privacy policy for more information.

Replica Data Schema

Replica produces large-scale models that accurately represent mobility, economic activity, people, and land use in detail throughout the United States.

Replica customers have access to a number of datasets, which are described below. Click to see detailed descriptions and schema.

Manhattan
Fall 2021
Weekday

High-fidelity travel model with simulated population and trip-taking activity​

Replica Places is a high-fidelity simulation that accurately represents the population and its travel patterns. Customers use Places data to improve planning and monitoring of transportation and land use systems, and to make decisions related to the ways people interact with the built environment.

Click to read more

Places is the world’s only calibrated travel demand model delivered at nationwide scale offered via software-as-a-service (SaaS). It is delivered in a pioneering privacy-sensitive approach.

The Places dataset is a complete trip and population table for a typical weekday and typical weekend day for the selected season and region. Places data is accessible through Replica’s web-based platform and through direct database access.

Replica Places megaregions cover geographies that include between 10 million and 50 million residents. Each data table is cross-tabular, meaning it is linked to other tables through common IDs, including person_id (the person making the trip), activity_id (the trip), or route_id (transit routes or network links used for a trip). 

Data can be filtered on any single attribute or combination of attributes. Each row of data in the simulation output reflects a single trip, with characteristics about the trip (e.g., origin, destination, mode, purpose, routing, duration) and trip taker (e.g., age, race/ethnicity, income, home location, work/school location).

The Places trip table contains the unique records and associated attributes for the trips between origin and destination points. Trip data represents the modeled output for a typical Thursday or Saturday during a three-month Replica season.

File Name
Content Type
Sample Value
Description
activity_id
String
15323941267251300000

A randomly assigned unique identifier defined for each trip. A trip is defined as travel between two locations with a discrete purpose at the origin and destination. Some intermediate stops with short dwell time may be captured within a single trip. Criteria for this dwell time may vary. Examples:

  • Home > Work
  • Work > Gym
  • Shop > Eat
  • Home > Starbucks drive-thru > Work
person_id
String
4339817784218880000

A randomly assigned unique identifier for each person modeled within the synthetic population.

household_id
String
14408018579505000000

A randomly assigned unique identifier for each household modeled within the synthetic population.

mode
String
PUBLIC_TRANSIT

The primary transportation mode used for the trip.

A trip is often made up of multiple travel modes, such as a journey to work that includes a walk to a rail station followed by a subway ride. In this case two modes were used by the traveler during their trip, walking and public transit. Only the primary mode of travel across a set of trip segments is included.
The primary mode is determined using the following ranking: 1) Public transit, 2) Driving (private auto)/Auto passenger/Taxi/TNC, 3) Biking, 4) Walking.

Valid values are:

  • BIKING: Trips made by people biking. Replica does not model scooter trips and does not separate out e-bike trips
  • CARPOOL: Trips made by passengers in private auto vehicles. Sum Carpool and Private Auto trips to get the total number of people who traveled in private autos
  • COMMERCIAL: Trips made by medium and heavy trucks
  • ON_DEMAND_AUTO: Trips made by passengers in a Taxi or using a Transportation Network Company (TNC) such as Uber or Lyft
  • OTHER_TRAVEL_MODE: Catch-all category for all other trips not assigned any of the modes listed herein
  • PRIVATE_AUTO:Trips made by drivers in private auto vehicles
  • PUBLIC_TRANSIT:Trips that primarily used public transit, such as buses, light rail, and subways
  • WALKING:Trips made by people walking
travel_purpose
String
WORK

The destination activity assigned to a synthetic person. For example, if a person is traveling to work, the purpose of the trip is ‘Work,’ whereas if a person is traveling to a restaurant the activity is ‘Eat.’

Valid values are:

  • COMMERCIAL: Trips by medium and heavy trucks for deliveries and other commercial purposes
  • EAT: Trips to restaurants
  • HOME: Trips where the destination is the person’s home
  • LODGING: Trips by visitors to overnight accommodation such as a hotel
  • MAINTENANCE: Trips to hairdressers, auto shops, banks, and a variety of other locations for the purpose of conducting errands
  • OTHER: Catch-all category for all other trips not assigned any of the purposes listed herein
  • RECREATION: Trips to recreational destinations such as parks and swimming pools
  • REGION_DEPARTURE: Trips by visitors to a port-of-exit, such as an airport, or major train station
  • SCHOOL: Trips to schools such as community colleges and universities
  • SHOP: Trips to stores and other commercial centers
  • SOCIAL: Trips for social activities
  • STAGE: Trips made by non-residents that start and end outside the region
  • WORK: Trips where the destination is the person’s workplace
  • WORK FROM HOME: Trips where a resident is returning home to work (specifically for residents who are working from home on the modeled day). For example, if a person goes out to lunch during the work day, their return trip home will be labeled “work_from_home”
tour_type
String
COMMUTE

Indicates what type of tour that included this trip. Tours are travel events that start at one location and return to that same location. For example, when a person travels to work and returns home, this a home-based work tour. A tour can have two or more trips.

Valid options are:

  • WORK_BASED: Tours that started and ended at work, excluding commute. For example: Work > Eat > Work
  • COMMUTE: Tours from home to work and back home. Also includes intermediate stops at other destinations. For example: Home > Recreation > Work > Shop > Home
  • OTHER_HOME_BASED: Tours that started and ended at home, excluding commutes. For example: Home > Shop > Eat > Home
previous_activity_type
String
HOME

The activity assigned to a synthetic person prior to starting travel.

Valid values are:

  • COMMERCIAL: Trips by medium and heavy trucks for deliveries and other commercial purposes
  • EAT: Trips to restaurants
  • HOME: Trips where the destination is the person’s home
  • LODGING: Trips by visitors to overnight accommodation such as a hotel
  • MAINTENANCE: Trips to hairdressers, auto shops, banks, and a variety of other locations
  • OTHER: Catch all category for all other trips not assigned any of the purposes listed herein
  • RECREATION: Trips to recreational destinations such as parks and swimming pools
  • REGION_DEPARTURE: Trips by visitors to a port-of-exit, such as an airport, or major train station
  • SCHOOL: Trips to schools such as community colleges and universities
  • SHOP: Trips to stores and other commercial centers
  • SOCIAL: Trips for social activities
  • STAGE: Trips made by non-residents that start and end outside the region
  • WORK: Trips where the destination is the person’s workplace
timezone
String
America/Los_Angeles

Timezone of the trip’s start location, using standard tz naming conventions.

start_time
Time
2019-01-10 06:08:00America/Los_Angeles

Date and 24-hour time of trip start, reported as yyyy-mm-dd hh:mm:ss timezone.

start_local_hour
Integer
12

Hour of day that the trip started, reported in the local time zone in 24-hour time format. Values in this field range between 0 and 23.

end_time
Time
2019-01-10 07:11:04
America/Los_Angeles

Date and 24-hour time of trip end, reported as yyyy-mm-dd hh:mm:ss timezone

end_local_hour
Integer
14

Hour of day that the trip ended, reported in the local time zone in 24-hour time format. Values in this field range between 0 and 23.

duration_minutes
Integer
63

Duration of trip in minutes, calculated as the difference between the trip start_time and end_time.

distance_miles
Float
10.7999733493896

Distance in miles measured along the trip route.

transit_route_ids
String
Capital Metro:803

The ID of the transit route used for a trip. Only applies to trips with mode as “PUBLIC_TRANSIT.”Transit Agency name and Route ID from agency’s GTFS feed.

network_link_ids
String
10541610554341416382

Unique ID assigned to the link (road segment) that the trip is associated with.

vehicle_type
String (Len 20)
PASSENGER_CAR

The vehicle type assigned to the vehicle of the synthetic trip.

Valid values are:
• PASSENGER_CAR
• MEDIUM_COMMERCIAL: Medium-duty commercial vehicles (14,000 – 26,000 lbs)
• HEAVY_ COMMERCIAL: Heavy-duty commercial vehicles (>26,000 lbs)
• MOTORCYCLE

Note: If values are NULL, this means that this data is not available for the region yet but we hope to have it soon.

vehicle_fuel_type
String
DIESEL

The fuel type assigned to the vehicle of the synthetic trip.

Valid values are:

  • GASOLINE
  • DIESEL
  • ELECTRIC
  • NATURAL_GAS
  • HYDROGEN
  • UNKNOWN_FUEL_TYPE

Note: If values are NULL, this means that this data is not available for the region yet but we hope to have it soon.

vehicle_fuel_technology
String
ICE

The fuel technology assigned to the vehicle of the synthetic trip.

Valid values are:

  • ICE
  • BEV
  • FCV
  • PHEV
  • UNKNOWN_FUEL_TECHNOLOGY

Note: If values are NULL, this means that this data is not available for the region yet but we hope to have it soon.

origin_bgrp
String (Len 12)
410510010004

The US Census Bureau-assigned FIPS code of the block group from which the trip originated.

origin_bgrp_lat
Float
45.492294

Latitude of the geometric centroid for the origin block group, reported in decimal degrees, WGS 84

origin_bgrp_lng
Float
-122.65334

Longitude of the geometric centroid for the origin block group, reported in decimal degrees, WGS 84.

destination_bgrp
String (Len 12)
410510079001

US Census Bureau-assigned FIPS of the block group in which the trip ended.

destination_bgrp_lat
Float
45.557407

Latitude of the geometric centroid for the destination block group, reported in decimal degrees, WGS 84.

destination_bgrp_lng
Float
-122.54931

Longitude of the geometric centroid for the destination block group, reported in decimal degrees, WGS 84.

origin_land_use_l1
String
residential

The primary land use category of the trip origin.

Valid values are:

  • residential
  • commercial
  • mixed_usel
  • industrial
  • civic_institutional
  • transportation_utilities
  • open space
  • agriculture
  • other
  • unknown
origin_land_use_l2
String
single_family

The secondary land use category of the trip origin.

Valid values are:
• single_family
• multi_family
• office
• retail
• non_retail_attraction
• education
• healthcare
• military
• civic_institutional
​​• transportation_utilities
• open_space
• agriculture
• other
• unknown

destination_land_use_l1
String
Commercial

The primary land use category of the trip destination.

Valid values are:

  • residential
  • commercial
  • mixed_use
  • industrial
  • civic_institutional
  • transportation_utilities
  • open space
  • agriculture
  • other
  • unknown
destination_land_use_l2
String
non_retail_attraction

The secondary building use category of the trip destination.

Valid values are:

  • single_family
  • multi_family
  • office
  • retail
  • non_retail_attraction
  • education
  • healthcare
  • military
  • civic_institutional​​
  • transportation_utilities
  • open_space
  • agriculture
  • other
  • unknown
origin_building_use_l1
String
residential

The primary building use category of the trip origin.

Valid values are:

  • residential
  • commercial
  • industrial
  • civic_institutional
  • transportation_utilities
  • open space
  • agriculture
  • other
  • unknown
origin_building_use_l2
String
single_family

The secondary building use category of the trip origin.

Valid values are:
• single_family
• multi_family
• office
• retail
• non_retail_attraction
• education
• healthcare
• military
• civic_institutional
​​• transportation_utilities
• open_space
• agriculture
• other
• unknown

destination_building_use_l1
String
commercial

The primary building use category of the trip destination.

Valid values are:

  • residential
  • commercial
  • industrial
  • civic_institutional
  • transportation_utilities
  • open space
  • agriculture
  • other
  • unknown
destination_building_use_l2
String
non_retail_attraction

The secondary building use category of the trip destination.

Valid values are:
• single_family
• multi_family
• office
• retail
• non_retail_attraction
• education
• healthcare
• military
• civic_institutional
​​• transportation_utilities
• open_space
• agriculture
• other
• unknown

The Places population table contains records for each person modeled within the synthetic population produced by Replica. Each record includes a set of attributes describing the person’s demographic and socioeconomic characteristics and the block-group assignment for their work, home, and school locations.

Population data represents the modeled output for each three-month Replica season.

Model inputs include the following products from the U.S. Census Bureau: American Community Survey 5-year estimates, TIGER/Line data, LEHD Origin-Destination Employment Statistics Data, and ACS Public Use Microdata Sample (PUMS), the Census Transportation Planning Products Program (CTPP) as well as data from the  National Center for Education Statistics, US Department of Education, building data and proprietary parcel data and points of interest data.

File Name
Content Type
Sample Value
Description
household_id
String
14102589208980400000

Unique identifier of a household.

person_id
String
14408018579505000000

Unique identifier of a person.

BLOCKGROUP
String
410050201001

The US Census Bureau-assigned FIPS code of the census block group containing the housing unit based on the decennial census 2010.

BLOCKGROUP _work
String
410050227021

The US Census Bureau-assigned FIPS code of the census block group containing the person’s workplace (for employed persons only).

BLOCKGROUP _school
String
410510063001

The US Census Bureau-assigned FIPS code of the census block group containing the person’s school (for students only).

TRACT
String
41005020100

The US Census Bureau-assigned FIPS code of the census tract containing the housing unit.

TRACT_work
String
41005022702

The US Census Bureau-assigned FIPS code of the census tract containing the person’s workplace (for employed persons only).

TRACT_school
String
41051006300

The US Census Bureau-assigned FIPS code of the census tract containing the person’s school (for students only).

age_group
Text
18_24

Age range a person falls within.Value ranges include:

  • lte_4
  • 5_14
  • 15_17
  • 18_24
  • 25_34
  • 35_64
  • 65_plus
age
Integer
19

Age, in years old, assigned to the person.

sex
Text
M

Sex assigned to the person, male (M) or female (F).

race
Text
white

Race assigned to the person.

Valid values include:

  • american_indian_alaska_native
  • asian
  • black_african_american
  • hawaiin_pacific
  • other_race_alone
  • two_or_more_races
  • white
ethnicity
Text
not_hispanic_or_latino

Ethnicity of a person.

Valid values include:
• not_hispanic_or_latino
• hispanic_or_latino

individual_income_group
Text
0_20000

Range a person’s annual income falls within.

Value ranges include:

  • lte_0
  • 0_20000
  • 20000_40000
  • 40000_80000
  • 80000_100000
  • 100000_plus
individual_income
Integer
7500

Total annual income of a person.

employment
Text
employed

Employment status of a person.Valid values include:• employed• notinlf• under_16• unemployed

education
Text
k_12

Education level of a person.

Valid values include:
• advanced_degree
• bachelors_degree
• high_school
• k_12
• no_school
• some_college
• under_3

school_grade_attending
Text
not_attending_school

Current grade level assigned to a person.

Valid values include:
• graduate
• kindergarten
• not_attending_school
• school
• undergraduate

industry
Text
naics51

Industry sector a person works within, reported in NAICS code. For a list of NAICS codes see:

Search NAICS Codes by Industry

household_role
Text
child

The role of the person in the household.

Valid values include:

  • child
  • GQ
  • head_of_household
  • immobile_GQ
  • non_relative
  • relative
  • spouse
subfamily_number
Integer
1

Number assigned to the subfamily that the person belongs to in a household.

subfamily_relationship
Text
parent_with_child

Relationship of a person within the subfamily.

Valid values include:
• child
• husband/wife_no_children
• husband/wife_with_children
• parent_with_child
• N/A

commute_mode
Text
driving

Typical commute mode used by a person.

Valid values include:

  • biking
  • carpool
  • driving
  • not_working
  • transit
  • walking
  • worked_from_home
tenure
Text
owner

Tenure of household.

Valid values include:

  • owner
  • renter
  • GQ
migration
String
same_house

Indicates mobility status (if the household has lived in the same unit 1 year ago).

Valid values include:

  • N/A (Lived in house <1 year; Also applies to group quarters)
  • same_house (non-movers)
  • outside_us
  • within_us
household_size
Text
3_person

Income range that the total income of the household falls within.

Valid values include:

  • 1_person
  • 1_person_group_quarters
  • 2_person
  • 3_person
  • 4_person
  • 5_person
  • 6_person
  • 7_plus_person
household_income_group
Text
125000_plus

Income range that the total income of the household falls within.

Valid values include:

  • lte_10000
  • 10000_40000
  • 40000_75000
  • 75000_125000
  • 125000_plus
household_income
Integer
408500

Total income of the household.

family_structure
Text
married_couple

Household family structure.

Valid values include:
• family_single
• GQ
• living_alone
• married_couple
• nonfamily_single

vehicles
Text
3_plus

Number of vehicles owned by a household.

Valid values include:

  • zero
  • 1
  • 2
  • 3_plus
  • GQ
building_type
String
mobile

Indicates the unit in the household’s structure.

Valid values include:

  • mobile (mobile home, trailer, boat, RV, or van)
  • single_family (one-family home, attached or detached)
  • several_units (2-4 apartments)
  • multiple_units (5+ apartments)
  • GQ_structure (group quarters)
resident_type
Text
core

Indicates whether or not the person is a resident of the region.

Valid values include:

  • Core: A resident of the core region
  • Donut: A resident of the surrounding counties who work or go to school in the core region
  • Visitor: A person who does not normally reside in the core or donut region who enters and/or exits the region via a ‘port of entry’ (airport, major train station)
language
String
english

Indicates the household language.

Valid values include:

  • english
  • spanish
  • indo-european
  • asian-pacific
  • other
  • GQ_language
lat
Latitude
37.434211

Latitude of the block group centroid representing the home location, reported in decimal degrees, WGS 84.

lng
Longitude
-122.16801

Longitude of the block group centroid representing the home location, reported in decimal degrees, WGS 84.

lat_work
Latitude
37.434211

Latitude of the block group centroid representing the work location, reported in decimal degrees, WGS 84 (for employed persons only).

lng_work
Longitude
-122.16801

Longitude of the block group centroid representing the work location, reported in decimal degrees, WGS 84 (for employed persons only).

lat_school
Latitude
37.434211

Latitude of the block group centroid representing the school location, reported in decimal degrees, WGS 84 (for students only).

lng_school
Longitude
-122.16801

Longitude of the block group centroid representing the school location, reported in decimal degrees, WGS 84 (for students only).

The Places network volumes table contains total estimated trip counts by mode for each network link within a given region. The trip counts represent the amount of traffic across the link on a typical Thursday during the given Replica season.

Network links (or, edges) are the streets or pathways upon which vehicles or people can travel. A single street is typically split into multiple segments, with the split points set at street intersections. Two way streets are modeled with separate network links, one for each direction of travel.

Arterial network links include freeways, highways, primary, secondary, and tertiary segments. The functional classification defined for the segments (freeways, highways, primary, secondary, and tertiary) are derived from OpenStreetMap street data. Most residential streets and service lanes are excluded from this dataset.

File Name
Content Type
Sample Value
Description
stableEdgeId
String
16380650950925800000

A unique identifier for the network link.

startLat
Float
34.369077929163232

The latitude of the starting point of the network link, stored in decimal degrees.

startLon
Float
-101.76221098964388

The longitude of the starting point of the network link, stored in decimal degrees.

endLat
Float
34.368660324111438

The latitude of the end point of the network link, stored in decimal degrees.

endLon
Float
-101.765637511737

The longitude of the starting point of the network link, stored in decimal degrees.

streetName
String
Sunrise Passage

The common name of the network link if available. Matches the name assigned by OpenStreetMap.

highway
String
secondary_link

The classification of the link based on OpenStreetMap data.

volume
Integer
323175

Total estimated trip counts on network link within the selected region.


The Places land use table contains a nationwide snapshot of aggregated parcels, their land use, their total built square footage, and their dwelling unit count for the Fall 2021 season. Please note that the built square footage is a modeled representation of total three-dimensional building space. There could be multiple buildings per parcel.

File Name
Content Type
Sample Value
Description
geo_fips
Integer
15323941267251300000

A randomly assigned unique identifier defined for each trip. A trip is defined as travel between two locations with a discrete purpose at the origin and destination. Some intermediate stops with short dwell time may be captured within a single trip. Criteria for this dwell time may vary. Examples:

  • Home > Work
  • Work > Gym
  • Shop > Eat
  • Home > Starbucks drive-thru > Work
name
String
141 (Bronx, NY)

The name of the corresponding row’s geography, as defined by the US Census Bureau or specified by the user for custom geography uploads.

dwelling_units_[category]
Integer
3

The number of dwelling units in the corresponding row’s geography, assigned to single_family, multi_family, or mixed_use buildings.

land_area_total
Integer
1259327

The total area for all parcels in your selected geographies in square feet.

land_area_[category]
Integer
15374

The total parcel area for each land use, in square feet.

Mixed-use parcels are assigned to their estimated primary use.

Primary land use categories are:

  • residential single family
  • residential multifamily
  • commercial retail
  • commercial office
  • commercial non-retail attraction
  • mixed use residential
  • mixed use commercial
  • mixed use industrial
  • mixed use other
  • industrial
  • civic healthcare
  • civic education
  • civic other
  • transportation utilities
  • open space
  • agriculture
  • other
  • unknown
building_area_total
Integer
2965985

The total area for all buildings in your selected geographies in square feet.

building_area_[category]
Integer
75022

The total building area for each building use, in square feet. There are no mixed-use fields – mixed-use buildings have their built area split across all estimated uses.

Building use categories are:
• residential single family
• residential multifamily
• commercial retail
• commercial office
• commercial non-retail attraction
• industrial
• civic healthcare
• civic education
• civic other
• transportation utilities
• open space
• agriculture
• other
• unknown

Win Federal Infrastructure Dollars for your Community

With Replica, public agencies leverage valuable data and insights to support grant funding applications. Federal grant applications almost always require information on equity and existing conditions. WIth Replica data, this insight is easily accessible.

Learn More

Replica data has been used to:

  • Enhance SS4A applications with novel insights about mobility conditions for disadvantaged communities
  • Support NYS DOT Raise Grant applications by providing mode-specific network link volumes and O/D data to inform cost/benefit analyses
  • Enable EV Charging NEVI applications by identifying crucial areas of impact for infrastructure upgrades