The average commute time within each U.S. census division has a large impact on its economy, productivity, infrastructure and environment. Longer commute times cause lost wages for workers with longer commute times, additional wearing of highway infrastructure and environmental impacts. As a result, this study evaluates commute patterns with the NHTS dataset and whether public transportation or additional transportation planning could reduce commute times based on data analysis.
This study outlines the data, methods and results used to identify commute patterns. Specifically, it seeks to answer the following questions:
- Which are the census divisions with the most trips per household?
- What are the average commute distance and time within those divisions?
- Could public transportation or transportation planning reduce commute times?
- What are some recommendations for improving commute times based on demographic data?
This study analyzes the households, trips and vehicles tables of the NHTS dataset to evaluate commute trends by census division. Specifically, the tables were analyzed to evaluate average commute distance and time.
The tools and process listed below were used to analyze data and provide recommendations:
- Jupyter Notebook - Exploratory data analysis and visualization were completed using this notebook.
- Python Modules - The modules listed below will need to be installed in order to run this notebook:
- Pandas
- NumPy
- SciPy
- Seaborn
- Matplotlib
- StatsModels
- Scikit-Learn
Clone repository, then run notebook with Python and Jupyter Notebook.