Thursday, February 11, 2021

New York City/New Jersey subway average PM2.5 levels are 65 times greater than EPA standards, worse than the worst Chinese city

PM2.5 Concentration and Composition in Subway Systems in the Northeastern United States. David G. Luglio, Maria Katsigeorgis, Jade Hess, Rebecca Kim, John Adragna, Amna Raja, Colin Gordon, Jonathan Fine, George Thurston, Terry Gordon, and M.J. Ruzmyn Vilcassim. Environmental Health Perspectives, February 10 2021. https://doi.org/10.1289/EHP7202

Abstract

Objectives: The goals of this study were to assess the air quality in subway systems in the northeastern United States and estimate the health risks for transit workers and commuters.

Methods: We report real-time and gravimetric PM2.5 concentrations and particle composition from area samples collected in the subways of Philadelphia, Pennsylvania; Boston, Massachusetts; New York City, New York/New Jersey (NYC/NJ); and Washington, District of Columbia. A total of 71 stations across 12 transit lines were monitored during morning and evening rush hours.

Results: We observed variable and high PM2.5 concentrations for on-train and on-platform measurements during morning (from 0600 hours to 1000 hours) and evening (from 1500 hours to 1900 hours) rush hour across cities. Mean real-time PM2.5 concentrations in underground stations were 779±249, 548±207, 341±147, 327±136, and 112±46.7 μg/m3 for the PATH-NYC/NJ; MTA-NYC; Washington, DC; Boston; and Philadelphia transit systems, respectively. In contrast, the mean real-time ambient PM2.5 concentration taken above ground outside the subway stations of PATH-NYC/NJ; MTA-NYC; Washington, DC; Boston; and Philadelphia were 20.8±9.3, 24.1±9.3, 12.01±7.8, 10.0±2.7, and 12.6±12.6 μg/m3, respectively. Stations serviced by the PATH-NYC/NJ system had the highest mean gravimetric PM2.5 concentration, 1,020 μg/m3, ever reported for a subway system, including two 1-h gravimetric PM2.5 values of approximately 1,700 μg/m3 during rush hour at one PATH-NYC/NJ subway station. Iron and total carbon accounted for approximately 80% of the PM2.5 mass in a targeted subset of systems and stations.

Discussion: Our results document that there is an elevation in the PM2.5 concentrations across subway systems in the major urban centers of Northeastern United States during rush hours. Concentrations in some subway stations suggest that transit workers and commuters may be at increased risk according to U.S. federal environmental and occupational guidelines, depending on duration of exposure. This concern is highest for the PM2.5 concentrations encountered in the PATH-NYC/NJ transit system. Further research is urgently needed to identify the sources of PM2.5 and factors that contribute to high levels in individual stations and lines and to assess their potential health impacts on workers and/or commuters.

Discussion

Our measurements and analyses reveal variable and, in places, very high PM2.5 exposures of commuters and transit workers in the underground subway systems of northeastern U.S. cities. The most extreme exposure, identified in a subway station on the PATH system (serving NJ and NYC), was higher than the previously published values for any subway station in the world (Martins et al. 2016Moreno et al. 2017Qiu et al. 2017Van Ryswyk et al. 2017Xu and Hao 2017Lee et al. 2018Minguillón et al. 2018Mohsen et al. 2018Moreno and de Miguel 2018Choi et al. 2019Loxham and Nieuwenhuijsen 2019Pan et al. 2019Shen and Gao 2019Velasco et al. 2019Smith et al. 2020), with a mean gravimetric PM2.5 concentration greater than 1,000μg/m3PM2.5 (Figure 1). The MTA-serviced subway stations in Manhattan also had poor air quality, with an adjusted real-time mean±SDPM2.5 concentration of 548±207μg/m3.

Our particle measurements were similar to those measured previously in the MTA-NYC stations with high PM2.5 levels (Vilcassim et al. 2014) and much greater than aboveground ambient PM2.5 levels [it must be noted that the MTA-NYC subway stations monitored in the present study were a biased sample and chosen based on the high PM2.5 levels in the Vilcassim study (2014)]. Thus, during rush hour, the underground subway stations targeted in the NYC/NJ’s MTA and PATH subway systems had significantly worse air quality, in terms of PM2.5, than the targeted subway stations in Boston, Philadelphia, and Washington, DC. Philadelphia’s subway stations, for example, had better air quality, although the mean real-time PM2.5 concentration was still several fold greater than the mean ambient PM2.5 concentration measured outside the Philadelphia subway stations. In addition, we cannot rule out spurious differences due to uncontrolled sources of variation related to sampling. However, our findings clearly indicate that PM2.5 concentrations in underground stations and measured on subway trains are much greater than aboveground ambient PM2.5 levels, at least during rush hour periods. In addition, we measured extremely high concentrations in individual underground stations in the MTA (NYC) and PATH (NYC/NJ) subway systems that, even if they represent extreme levels for these stations, raise serious health concerns and warrant additional investigation. In addition, underground PM2.5 concentrations were consistently higher than mean ambient PM2.5 concentrations. Thus, our findings suggest that, at least in the northeastern U.S. transit systems included in our study, commuters are exposed to poor air quality during their time spent in underground subway stations. Moreover, exposures in at least some underground stations may be high enough to increase the risk of the adverse health effects associated with PM2.5, even if they occur for relatively short periods of time.

It should be noted that most subway air pollution studies have relied on real-time data collected with light scattering instruments (Xu and Hao 2017) that have been factory calibrated, in the traditional manner, with Arizona road-dust (Curtis et al. 2008Wang et al. 2016). Despite their many advantages (e.g., real-time data, autocorrection for temperature and RH), the output of real-time PM2.5 instruments can be affected by particle composition, shape, and water content, all physical factors that will variably affect light scattering. In the present study, we compared real-time and gravimetric PM2.5 concentrations during simultaneous 30- to 60-min sampling sessions conducted in the targeted subway systems (except MTA-NYC or the LIRR) and found, overall, that gravimetric values were 2 4 times greater than what was measured with the real-time light scattering device. This ratio is much higher than what has been reported for other environments and dust types (Wu et al. 2005Wang et al. 20162018Patts et al. 2019), and this difference is most likely due to the large (e.g., as high as 60% of the total PM2.5 mass) contribution of iron, a dense metal, to the airborne PM2.5 in the targeted subway systems. Therefore, we adjusted our real-time PM2.5 data with a correction factor. Thus, this real-time/gravimetric ratio issue should be considered when interpreting health risks using published data from air quality studies of subway systems conducted throughout the world. Note that most of the samples collected at underground stations in the present study were selected because they had the highest estimated real-time PM2.5 concentrations in each system.

One of the highest unadjusted real-time mean subway system concentrations previously reported was 265μg/m3 in Suzhou, China (Cao et al. 2017), whereas Seaton et al. (2005) and Smith et al. (2020) observed real-time, dust-type calibrated PM2.5 concentrations in a few stations in London, UK, that approached what was observed in PATH stations (Table S2), with a maximum 30-min mean concentration of 480μg/m3 at one London station. Notably, Smith et al. (2020) observed a single 1-min peak of 885μg/m3. The high pollution levels measured in London’s subways did not reach the upper range of the PM2.5 levels in the PATH subway stations and particularly in the Christopher Street Station, which had a maximum 1-h gravimetric PM2.5 concentration of 1,780μg/m3 during rush hour. The gravimetric PM2.5 concentrations measured at Newport Station, however, were more consistent with the peak values estimated in Smith et al. (2020). Comparison of our underground and ambient PM2.5 data strongly suggests that ambient PM2.5 is not a likely source of the high PM2.5 levels observed in NYC’s underground subway stations and that other sources such as the continual grinding of the train wheels against the rails, the electricity-collecting shoes, and diesel soot emissions from maintenance locomotives are important sources.

The contribution of TC to the PM2.5 mass concentration varied considerably among the two underground stations sampled at each of three transit systems (Table 3). TC constituted 6% of the particle composition in the PATH-NYC/NJ stations, whereas it composed 39% and 22% of PM2.5 in Boston and Washington, DC, respectively (Figure S4). Even within a single urban transit system, the TC concentrations varied between stations as was observed in Boston’s Government Center–Blue Line (177μg/m3) and Broadway stations (70.6±24.7μg/m3), albeit based on one and two samples, respectively. Broadway is an older station on the Red line, and Government Center is a much larger station with separate Blue and Green Line platforms and was renovated in the summer of 2016. Notably, TC, made primarily of the estimated OC component, dominated the Government Center–Blue Line aerosol, although the significance of this is unclear and further investigation into the sources of PM2.5 and the role of the mechanical design (e.g., ventilation) of each station is needed. Notably, there was relatively little EC (or the roughly equivalent BC2.5) present in any of the six underground subway stations, an unexpected finding given the emphasis that multiple papers (Vilcassim et al. 2014Choi et al. 2019) have placed on inorganic carbon species. A plausible source of EC would be diesel combustion in subway systems, for example, from diesel maintenance trains that operate in the MTA system. However, these trains are typically active only at night, and therefore their contribution to the composition of subway PM2.5 is unclear.

Iron accounted for the largest fraction of PM2.5 in the targeted subway stations, and frictional forces between the train wheels and rails and collection shoes and the third rail may account for this finding. The relative concentration of other elements was observed to vary among subway systems, suggesting that other sources (e.g., silicon as a marker for crustal material; arsenic as a marker for rodenticides) contribute to the airborne particles encountered by transit workers and commuters in subway stations. A previous report on PM composition in MTA stations in Manhattan agrees with the present findings. Chillrud et al. (2004) found similar ratios of iron/manganese, and chromium/manganese concentrations (i.e., components of different grades of steel), although some of the trace element concentrations in the present study are many times higher than those reported by Chillrud et al. Although other studies have documented low concentrations of noniron and carbon elements in subways (Minguillón et al. 2018Lee et al. 2018), results in Shanghai, China, found that aluminum, silicon, and calcium made up more than 30% of PM2.5 (Lu et al. 2015a), suggesting that ambient soil particles can contribute to subway PM. Similarly, in Beijing subways, the iron concentration was outweighed by aluminum, potassium, sodium, calcium, magnesium, zinc, and barium (Pan et al. 2019). Thus, significant differences in PM composition exist among the underground subway systems across the globe, and it is likely that these differences are a result of source contributors that vary among systems.

Our results demonstrate considerable variability regarding the air quality that transit workers and commuters may encounter in the subway stations of the major cities in northeastern United States. Not only does the PM2.5 concentration vary among stations and cities, but the elemental composition of PM2.5. Previous studies have demonstrated that underground depth (Vilcassim et al. 2014Figueroa-Lara et al. 2019), station volume, age (Van Ryswyk et al. 2017), and ventilation (Martins et al. 2016) all affect aerosol loading. Therefore, these subway system- and station-dependent differences were not unexpected in the present study. It is interesting to note that Martins et al. showed that more recently built stations do not necessarily have better air quality: The stations established in 2002 and 2009 in Oporto, Portugal, and Athens, Greece, had higher PM2.5 aerosol concentrations than a station built in 1983 in Barcelona, Spain. Nevertheless, there is evidence that common methods of reducing airborne PM are effective (Park et al. 2019), such as cleaning stations more often (Chen et al. 2017), improving ventilation (Moreno et al. 20142017), using particle removal systems, and installing shields that confine track-generated particles from the boarding passengers (Guo et al. 2014).

Study Limitations

In the present study, there were relatively few interline differences in air quality among the subway lines within each city (Figure S2 and Table S1). One exception was the Blue line, which exhibited the highest PM2.5 concentration of the three targeted Boston subway lines. Because the Blue line is the most recently built of the three targeted Boston subway lines and would presumably have the best ventilation design (based on our subjective observations), this finding was unexpected. A limitation of this observation, however, is that we sampled air quality in two Blue line stations that were high train-traffic areas. A similar intrasubway system difference was, however, also observed in NYC/NJ’s PATH system, where the mean PM2.5 level on the 33rd Street line was significantly greater than the World Trade Center line serviced by the PATH. As noted above, we did not collect information on all potential factors that might explain differences in air quality among different stations, subway lines, or transit systems.

We compared several transit systems using data collected at similar times of the day and generally within the same season. Thus, we have not cataloged the total potential variation of PM2.5 in each system. In particular, we sampled during a small number of days in Boston and Washington, DC, and our data generally represent only summer conditions. Thus, we do not know if the subway air quality changes significantly over season or time, although Van Ryswyk et al. (2017) have shown that the Toronto, Ontario, Canada, metro had higher PM2.5 concentrations than Montréal, Quebec, regardless of season. Another study limitation is that our PM2.5 and BC2.5 sample sizes for each station were relatively small (n=4 for most stations). Although this study design allowed us to compare subway systems, we lacked sufficient power to compare PM2.5 concentrations among individual station platforms within a city’s transit system. Nonetheless, certain stations were clearly more polluted than others. PM2.5 concentrations in the underground Christopher Street (PATH-NYC/NJ) were the highest among the 71 northeastern U.S. underground stations included in our study, and to our knowledge, were higher than any levels reported for any subway system across the globe (Martins et al. 2016Moreno et al. 2017Qiu et al. 2017Van Ryswyk et al. 2017Xu and Hao 2017Lee et al. 2018Minguillón et al. 2018Mohsen et al. 2018Moreno and de Miguel 2018Choi et al. 2019Loxham and Nieuwenhuijsen 2019Pan et al. 2019Shen and Gao 2019Velasco et al. 2019Smith et al. 2020).

In addition, it must be noted that the methodologies used to assess BC, OC, and EC were developed for measurement of PM composition collected under ambient outdoor conditions. The measurement of these carbon components was hampered, however, by the presence of large amounts, relative to ambient PM, of iron compounds. As demonstrated by other investigators, the dark color of some iron compounds can interfere with the reflectance measurement of BC, and the chemistry of the iron compounds found in subway PM shifts the transition demarcation of OC and EC in the thermal ramp used by the Sunset Instrument analyzer. Therefore, we chose to present our data as TC (total carbon) concentrations to avoid the latter limitation. Regardless, the potential for underestimation of OC (i.e., caused by high levels of iron in PM2.5 collected on quartz filters in the subways; Figure S4) does not lessen the importance of OC as a major component of the PM2.5 collected in several subway stations.

Implications

The key issue with underground subway exposures is whether there is a significant increase in the risk for adverse cardiovascular and respiratory outcomes, given the well-documented association between PM2.5 and adverse health effects. With one notable exception, the PM2.5 concentrations measured in subway stations during morning and evening rush hours were generally 2 to 7 times the U.S. EPA’s 24-h ambient air standard of 35μg/m3. The one exceptional station (Christopher Street Station) on the PATH subway line connecting NJ to lower Manhattan had a maximum 1-h PM2.5 concentration of 1,780μg/m3, with a mean gravimetric concentration of 1,254μg/m3 (n=4) (Table S3). If we assume that commuters are exposed to this level of PM2.5 for a typical 15-min total time (from/to home) spent on a subway platform and at 100μg/m3 for two 20-min rides on the PATH subway trains each day, then a commuter’s 24-h mean PM2.5 exposure concentration would increase from an assumed daily mean of 7.7μg/m3 (for NYC metropolitan area; U.S. EPA 2020b) to 26.1μg/m3. Given an association of a 6% increase in relative risk for each 10μg/m3 increase in long-term (e.g., annual average) PM2.5 (Pope et al. 2004), this exposure scenario suggests that a typical commuter would be at an 11% increase in risk for cardiovascular mortality. However, this calculation assumes that the toxicity of underground subway PM2.5 is similar to that of ambient PM, which is uncertain in the absence of much-needed subway–health studies. It must be emphasized that this increase in individual risk for daily commuters differs in comparison to that for transit workers who spend considerably longer periods of time on the subway platforms (e.g., 8-h work shifts). The impact of exposure on transit workers is unclear because although they would be exposed to significantly greater PM2.5 accumulated doses (i.e., increased exposure time and breathing rates), workers are often considered “healthy,” and the most relevant applicable occupational exposure guidelines are for larger-diameter respirable dust, defined as PM4.0 (OSHA’s Permissible Exposure Limit of 5mg/m3 and ACGIH’s threshold limit value of 3mg/m3). In conclusion, these findings of poor air quality in subway systems should prompt further investigation as to the levels, sources, composition, and human health effects of the PM2.5 pollution in subway systems. However, even in the absence of such data, the results of our study already indicate that the Precautionary Principal (Science for Environment Policy 2017) would call for mitigation efforts, such as improved ventilation to protect the tens of thousands of subway workers and millions of daily commuters from potentially unwarranted health risks.

What people “see” with retinal prostheses? Fundamentally, qualitatively different than natural vision, all invoked electrical stimuli to describe the appearance of their percepts; the process never ceased to be cognitively fatiguing

Erickson-Davis C, Korzybska H (2021) What do blind people “see” with retinal prostheses? Observations and qualitative reports of epiretinal implant users. PLoS ONE 16(2): e0229189. https://doi.org/10.1371/journal.pone.0229189

Abstract

Introduction: Retinal implants have now been approved and commercially available for certain clinical populations for over 5 years, with hundreds of individuals implanted, scores of them closely followed in research trials. Despite these numbers, however, few data are available that would help us answer basic questions regarding the nature and outcomes of artificial vision: what do recipients see when the device is turned on for the first time, and how does that change over time?

Methods: Semi-structured interviews and observations were undertaken at two sites in France and the UK with 16 recipients who had received either the Argus II or IRIS II devices. Data were collected at various time points in the process that implant recipients went through in receiving and learning to use the device, including initial evaluation, implantation, initial activation and systems fitting, re-education and finally post-education. These data were supplemented with data from interviews conducted with vision rehabilitation specialists at the clinical sites and clinical researchers at the device manufacturers (Second Sight and Pixium Vision). Observational and interview data were transcribed, coded and analyzed using an approach guided by Interpretative Phenomenological Analysis (IPA).

Results: Implant recipients described the perceptual experience produced by their epiretinal implants as fundamentally, qualitatively different than natural vision. All used terms that invoked electrical stimuli to describe the appearance of their percepts, yet the characteristics used to describe the percepts varied significantly between recipients. Artificial vision for these recipients was a highly specific, learned skill-set that combined particular bodily techniques, associative learning and deductive reasoning in order to build a “lexicon of flashes”—a distinct perceptual vocabulary that they then used to decompose, recompose and interpret their surroundings. The percept did not transform over time; rather, the recipient became better at interpreting the signals they received, using cognitive techniques. The process of using the device never ceased to be cognitively fatiguing, and did not come without risk or cost to the recipient. In exchange, recipients received hope and purpose through participation, as well as a new kind of sensory signal that may not have afforded practical or functional use in daily life but, for some, provided a kind of “contemplative perception” that recipients tailored to individualized activities.

Conclusion: Attending to the qualitative reports of implant recipients regarding the experience of artificial vision provides valuable information not captured by extant clinical outcome measures.


Discussion

We undertook ethnographic research with a population of retinal prosthesis implant recipients and vision rehab specialists, documenting the process of getting, learning to use and living with these devices.

We found that the perceptual experience produced by these devices is described by recipients as fundamentally, qualitatively different than natural vision. It is a phenomenon they describe using terms that invoke electric stimuli, and one that is ambiguous and variable across and sometimes within recipients. Artificial vision for these recipients is a highly specific learned skillset that combines particular bodily techniques, associative learning and deductive reasoning to build a “lexicon of flashes”—a distinct perceptual vocabulary—that they then use to decompose, recompose and interpret their surroundings. The percept does not transform over time; rather, the recipient can better learn to interpret the signals they receive. This process never ceases to be cognitively fatiguing and does not come without risk nor cost to the recipient. In exchange recipients can receive hope and purpose through participation, as well as a new kind of sensory signal that may not afford practical or functional use in daily life, but for some provides a kind of “contemplative perception” that recipients tailor to individualized activities. We expand on these findings below to explore what they mean in terms of the development and implementation of these devices, as well as for our understanding of artificial vision as a phenomenon.

What does it mean that the recipients describe artificial vision as being fundamentally, qualitatively “different” than natural vision? We believe that acknowledging that artificial vision is a unique sensory phenomenon might not only be more accurate, but it may also open up new avenues of use for these devices. Artificial vision may be considered as “visual” in terms of being similar to what recipients remember of the experience of certain kinds of light, as well as by offering the possibility of being able to understand features of the environmental surround at a distance. That being said, artificial vision was also described as both qualitatively and functionally different than the “natural” vision the recipients remember. It is in this way that the sensory experience provided by these devices could be viewed as less a restoration or replacement and more as a substitution; that is, as offering an entirely different or novel sensory tool. By shifting from the rhetoric of replacement or restoration to substitution we believe it could widen the bounds in which researchers and rehabilitation specialists think and operate with regard to how these devices are designed and implemented, potentially liberating a whole new spectrum of utility through the novel sensations these devices produce. Likewise, this shift could change the expectations of individuals receiving these devices, including addressing the initial disappointment that was expressed by many of our recipients when they encountered just how different the signals were to what they were expecting.

Second, acknowledging artificial vision as a unique sensory phenomenon also helps us understand the importance of qualitative description. The process of learning to use the device is a cooperative process between the rehabilitation specialist and the recipient, with the specialist guiding the recipient to attend to their perceptual experience and interpret it in specific ways. This process begins with the recipient learning to recognize how the basic unit of artificial vision—the phosphene—appears for them, and then describe that to the rehab specialist. The specialist then uses this information to guide the recipient in learning how the phosphenes correspond to features of the environment. It is a continuous and iterative communicative practice between the recipient and specialist that evolves over many months, during which stimuli are encountered, the recipient responds, and the specialist gives corrective or affirming feedback (with more or less description by the recipient and guidance by the specialist depending on the dynamic and need). The process is so specific to the dynamic between recipient and specialist that it can be considered to be “co-constructed” within their interactions.

Because each recipients’ qualitative experience is so distinct (phosphenes differ significantly between recipients so that no recipients’ perceptual experience is alike [60]) each process is tailored to the individual recipient by specific specialists. We found that certain vision rehabilitation specialists inquire in more depth about a recipient’s qualitative experience than others, using different methods, styles and techniques, and this can result in a different experience—and thus outcome—for the recipients. Our findings are based on reports captured either by directly asking the recipients about their experience or observing descriptions that were part of the rehabilitation process but that were by and large not recorded by the specialists nor relayed back to the companies, early stage researchers nor the individuals being implanted. That is, we found that there is no protocol in place for capturing or sharing recipient’s qualitative reports, including within the companies (between various clinical sites). Yet these kinds of data are essential to understanding these devices as well as in learning about artificial vision more generally, and thus deserve careful consideration by both researchers and clinicians who are developing and implanting these and similar devices, as well as to individuals and their families who are considering receiving them.

The better vision rehabilitation specialists are able to understand the recipient’s qualitative experience, the more they are enabled to assist them in learning to use the device. The more that early-stage researchers understand about how the parameters of the device correspond to perceptual experience, the better they are able to optimize design and implementation strategies. Finally, communicating these data to individuals and their families who are considering being implanted is essential. It would contribute to a more accurate understanding of the qualitative experience and process they are signing on for, and thus is an important part of informed consent. It would also help to address certain psychosocial difficulties we found recipients to experience. For instance, we found that the recipient’s percept does not change over time—that instead the recipient becomes better able at interpreting the signal received using cognitive techniques. It is a subtle distinction, but a profound one in terms of conditioning expectations around these devices—both of the researchers and the recipients. We found that current rhetoric employed by researchers and vision rehab specialists regarding neuroplasticity and the ability for recipients to transform the signal with enough practice has created a situation in which failure of the recipient to significantly transform the signal over time is perceived as a failure of the recipient (behaviorally, where the recipient is deemed to have insufficiently practiced using of the device, and/or physiologically, where the problem is located within the recipient’s eye or visual system). By shifting the expectation that it is not the percept itself, but the recipient’s ability to use the percept over time that can improve, one can potentially avoid and address the psychosocial distress that we found some recipients experienced as a result.

This study had several limitations, first and foremost the number of recipients limited by small study populations and availability of recipients. Future studies of these devices would do well to include similar qualitative reports from recipients, either as primary focus or as supplement to other outcome measures (i.e. as “mixed methods” studies that combine qualitative and quantitative methodologies). In addition, qualitative reports are only one type of data and are not meant to replace other forms of data being collected on these devices. Rather, we believe they deserve special attention because they have been heretofore neglected in the literature despite their potential to provide valuable information not captured by normative functional outcome measures. Qualitative data about recipients’ perceptual experience can both inform device design and rehabilitative techniques, as well as grant a more holistic understanding of the phenomenon of artificial vision. In addition to contributing to the larger body of work on visual prostheses, this study serves as a case example of the kind of data mobilized by qualitative, ethnographic methodology—in particular phenomenological inquiry—in study of brain machine interface devices.