Research
Demographic and Geographic Characteristics of Green Stormwater Infrastructure Investments in Five U.S. Cities: A Machine Learning–Based Analysis
Dec 4, 2023
Findings from a Machine Learning–Based Analysis of Five U.S. Cities
Data VizPublished Dec 4, 2023
Over the past few decades, stormwater managers across the United States have increasingly turned to green stormwater infrastructure to mitigate flooding and improve the quality of stormwater runoff. Green infrastructure also offers a variety of co-benefits to the surrounding community compared with traditional gray infrastructure, including reduced urban heat island, improved water quality, and enhanced aesthetics.
Many cities have invested in green stormwater infrastructure directly or established incentive programs for property owners and developers to reduce stormwater runoff while providing these co-benefits to their communities. However, after several decades of green infrastructure investment and incentives, the question remains: Has green infrastructure been placed in areas where its residents stand to benefit the most?
A rain garden in Brooklyn, New York.
Photo by Alyson Youngblood/RAND Corporation
This study used an exploratory machine learning–based approach to evaluate green stormwater infrastructure investment across five cities in the United States—Boston, Detroit, New York City, Pittsburgh, and Washington, D.C. We compared the location of installed green stormwater infrastructure with demographic and land use characteristics of the surrounding area to understand whether green stormwater infrastructure is located in areas that, in addition to stormwater reduction, stand to gain from the co-benefits these investments provide.
A summary of the local characteristics that were frequently found in places with more green stormwater infrastructure assets (by number) for each city is shown in the graphic below.
The following tables include a subset of characteristics that had a meaningful association with green stormwater infrastructure according to our model.
Association strength is a normalized measure relative to the strongest obvserved association between any of the characteristics and the amount of green stormwater infrastructure for that city. A value of zero represents no low association, and a value of one represents the strongest association.
Characteristic | Association Strength |
---|---|
Poorer air quality | 1.000 |
More residents with coronary health challenges | 0.699 |
Higher percentage of residents with asthma | 0.457 |
Less tree canopy | 0.349 |
Higher percentage of Hispanic or Latino residents | 0.345 |
Owners that moved in more recently | 0.251 |
Higher redlining score | 0.162 |
Higher median household income | 0.154 |
Older average age of residents | 0.147 |
More residents in poverty | 0.097 |
Higher percentage of residents reporting as other race | 0.088 |
Higher percentage of White residents | 0.068 |
Higher percentage of American Indian and Alaska Native residents | 0.068 |
More residents with mental health challenges | 0.066 |
Higher percentage of residents reporting as two or more races | 0.027 |
Higher percentage of area at risk of future flooding | 0.027 |
Newer residences | 0.018 |
Higher percentage of Native Hawaiian and Other Pacific Islander residents | 0.010 |
Greater economic inequality | 0.003 |
Characteristic | Association Strength |
---|---|
Higher average housing costs | 0.602 |
Greater population density | 0.180 |
Higher percentage of Black or African American residents | 0.126 |
Renters that moved in more recently | 0.075 |
Higher percentage of Asian residents | 0.072 |
Larger number of housing units per building | 0.063 |
Warmer summer mean temperature | 0.051 |
Higher percentage impervious land | 0.017 |
Characteristic | Association Strength |
---|---|
Larger number of housing units per building | 1.000 |
Higher percentage of area at risk of future flooding | 0.451 |
Higher percentage of Hispanic or Latino residents | 0.306 |
Higher percentage of American Indian and Alaska Native residents | 0.288 |
More residents with coronary health challenges | 0.107 |
Higher median household income | 0.097 |
Higher percentage of residents reporting as two or more races | 0.087 |
Higher percentage of residents reporting as other race | 0.087 |
Higher percentage of Black or African American residents | 0.057 |
Higher percentage of Native Hawaiian and Other Pacific Islander residents | 0.049 |
Higher percentage of Asian residents | 0.032 |
Poorer air quality | 0.009 |
Higher redlining score | 0.006 |
Characteristic | Association Strength |
---|---|
Higher percentage impervious land | 0.514 |
Greater population density | 0.501 |
Greater economic inequality | 0.240 |
Owners that moved in more recently | 0.223 |
Newer residences | 0.204 |
Renters that moved in more recently | 0.193 |
Warmer summer mean temperature | 0.188 |
Higher average housing costs | 0.187 |
More residents with mental health challenges | 0.164 |
Less tree canopy | 0.112 |
Higher percentage of White residents | 0.034 |
More residents in poverty | 0.013 |
Higher percentage of residents with asthma | 0.008 |
Older average age of residents | 0.005 |
Characteristic | Association Strength |
---|---|
Warmer summer mean temperature | 1.000 |
Higher percentage of Hispanic or Latino residents | 0.384 |
Higher percentage of residents reporting as two or more races | 0.195 |
Higher percentage of Black or African American residents | 0.153 |
Higher percentage of American Indian and Alaska Native residents | 0.081 |
Higher percentage of Native Hawaiian and Other Pacific Islander residents | 0.064 |
Higher median household income | 0.057 |
Higher percentage of Asian residents | 0.047 |
More residents with mental health challenges | 0.041 |
Higher percentage of residents reporting as other race | 0.036 |
Higher percentage impervious land | 0.035 |
Older average age of residents | 0.029 |
Greater economic inequality | 0.028 |
Renters that moved in more recently | 0.020 |
Higher redlining score | 0.018 |
Higher percentage of area at risk of future flooding | 0.014 |
Higher average housing costs | 0.010 |
Less tree canopy | 0.001 |
Characteristic | Association Strength |
---|---|
Poorer air quality | 0.438 |
More residents with coronary health challenges | 0.362 |
Higher percentage of residents with asthma | 0.284 |
Larger number of housing units per building | 0.277 |
Greater population density | 0.086 |
More residents in poverty | 0.056 |
Owners that moved in more recently | 0.053 |
Higher percentage of White residents | 0.046 |
Newer residences | 0.006 |
Characteristic | Association Strength |
---|---|
Newer residences | 1.000 |
Higher percentage of Asian residents | 0.425 |
Larger number of housing units per building | 0.321 |
Higher percentage of American Indian and Alaska Native residents | 0.172 |
Less tree canopy | 0.156 |
Higher percentage of area at risk of future flooding | 0.151 |
Owners that moved in more recently | 0.148 |
Higher percentage of residents reporting as two or more races | 0.109 |
Higher redlining score | 0.073 |
Older average age of residents | 0.035 |
Higher average housing costs | 0.021 |
Greater population density | 0.016 |
Higher percentage of Black or African American residents | 0.012 |
Higher percentage impervious land | 0.011 |
More residents with mental health challenges | 0.009 |
Higher percentage of residents with asthma | 0.008 |
Higher percentage of White residents | 0.003 |
Greater economic inequality | 0.003 |
Higher percentage of Native Hawaiian and Other Pacific Islander residents | 0.003 |
Higher percentage of residents reporting as other race | 0.002 |
Poorer air quality | 0.001 |
Characteristic | Association Strength |
---|---|
Renters that moved in more recently | 0.063 |
More residents with coronary health challenges | 0.051 |
Warmer summer mean temperature | 0.038 |
Higher percentage of Hispanic or Latino residents | 0.033 |
More residents in poverty | 0.004 |
Higher median household income | 0.001 |
Characteristic | Association Strength |
---|---|
Newer residences | 1.000 |
Higher percentage of residents reporting as other race | 0.091 |
Warmer summer mean temperature | 0.068 |
Higher median household income | 0.064 |
Owners that moved in more recently | 0.054 |
Older average age of residents | 0.052 |
Higher percentage of Black or African American residents | 0.050 |
Higher percentage of residents reporting as two or more races | 0.043 |
Higher percentage of Hispanic or Latino residents | 0.022 |
Poorer air quality | 0.019 |
Higher percentage impervious land | 0.011 |
Higher average housing costs | 0.011 |
Higher percentage of residents with asthma | 0.011 |
Higher percentage of Native Hawaiian and Other Pacific Islander residents | 0.006 |
Higher percentage of area at risk of future flooding | 0.006 |
Characteristic | Association Strength |
---|---|
Greater population density | 0.790 |
Greater economic inequality | 0.257 |
Larger number of housing units per building | 0.174 |
Renters that moved in more recently | 0.116 |
Less tree canopy | 0.063 |
More residents with coronary health challenges | 0.061 |
Higher percentage of White residents | 0.043 |
More residents with mental health challenges | 0.035 |
More residents in poverty | 0.019 |
Higher percentage of Asian residents | 0.005 |
Higher percentage of American Indian and Alaska Native residents | 0.004 |
This graphic presents the findings of our machine–learning based analysis. To carry out this work, we collected city-level data on the types, sizes, and locations of existing investments to understand how much and where cities had installed green stormwater infrastructure. We also collected information to understand local characteristics—or the socioeconomic, geographic, and physical landscape—surrounding these investments in each city.
This information included data, for example, on income, race, climate, and infrastructure condition. These datasets were cleaned, combined, and used as inputs to a variety of popular machine learning methods. Using seven different machine learning methods,[1] a model was trained to quantify the relationship between green stormwater infrastructure and local demographic and geographic characteristics. With this approach, we were also able to compare the goodness of fit for each model and examine the strength of the association between green stormwater infrastructure and each local characteristic.
The graphic above shows the results for the Random Forest model, which generally performed well across all cities. More information on the study, datasets, and our quantitative approach can be found in our full report.
Alyson Youngblood (design) and Shawna Templeton (production)
This visualization is based on research by Michelle Miro and Susan Resetar.
Funding for this research was provided by gifts from RAND supporters and income from operations and conducted in the Community Health and Environmental Policy Program within RAND Social and Economic Well-Being.
This publication is part of the RAND visualization series. RAND visualizations present graphical or interactive views of data and information from a published, peer-reviewed product or a body of published work.
This document and trademark(s) contained herein are protected by law. This representation of RAND intellectual property is provided for noncommercial use only. Unauthorized posting of this publication online is prohibited; linking directly to this product page is encouraged. Permission is required from RAND to reproduce, or reuse in another form, any of its research documents for commercial purposes. For information on reprint and reuse permissions, please visit www.rand.org/pubs/permissions.
RAND is a nonprofit institution that helps improve policy and decisionmaking through research and analysis. RAND's publications do not necessarily reflect the opinions of its research clients and sponsors.