E91: Saving federal data with Jonathan Gilmour - podcast episode cover

E91: Saving federal data with Jonathan Gilmour

May 30, 202531 minEp. 91
--:--
--:--
Listen in podcast apps:
Metacast
Spotify
Youtube
RSS

Summary

Molly Wood talks with Jonathan Gilmour of Harvard about the urgent effort to save crucial federal climate and environmental data being dismantled by the US government. They discuss why this data is essential for forecasting extreme weather, guiding public health, informing policy, and underpinning climate tech startups. Gilmour details the work of the Public Environmental Data Partners in preserving data, rebuilding tools, and advocating for this vital public resource.

Episode description

This week on Everybody in the Pool, we're taking a deep dive into the unsung hero of climate solutions: federal data. This crucial data helps predict extreme weather, guide public health responses, and by the way, underpins a whole lot of existing and future climate tech startups. But the US government is pulling the plug on long-standing climate infrastructure, and all kinds of groups are scrambling to protect it. Molly talks with Jonathan Gilmour, a data scientist from Harvard, who’s on a mission to protect this precious resource. He's part of a team working tirelessly to keep these datasets safe and accessible for all the researchers, startups, and policymakers who need them. If you're curious about how federal data impacts everything from insurance rates to groundbreaking tech innovations, this episode is a must-listen. Discover why protecting this data is not just important—it's essential for our future.


LINKS:


Please subscribe and tell your friends about Everybody in the Pool! Send feedback or become a sponsor at [email protected]!


Hosted on Acast. See acast.com/privacy for more information.

Transcript

Really? Come on, let's go. Hear that? That's the sound of my customers leaving. When your broadband doesn't work, neither does your business. Will Sky Business keep me up and running? No matter what. With 4G backup and our Stay Connected guarantee, better believe it. Let our dedicated Sky Business team handle your switch today. That's not like it.

Welcome to Everybody in the Pool, the podcast where we dive deep into the innovative solutions and the brilliant minds who are tackling the climate crisis head on. I'm Molly Wood. This week something a little different. We're digging down to the foundational layer of what makes a lot of climate solutions possible. It's the thing that lets us adapt and evolve into a warming world, literally forecasts what's coming in terms of extreme weather exacerbated by climate.

That thing is data, specifically federal data. As we tape this, the federal government here in the US is dismantling much of the infrastructure built to deal with tracking climate change. Also built to deal with understanding its impacts on various populations, responding to disasters, even in some cases forecasting the actual weather. Leaving aside the fact that none of this will actually make climate change go away.

The thing is, researchers, scientists, nonprofits, city and state governments, and yes, entrepreneurs all need this data to set policy, allocate money, build infrastructure, and create new companies. Like, imagine... If Uber, the prototype of the startup that every investor wishes they could invest in, had had to build Uber without taxpayer funded satellites and the GPS technology that was invented by the federal government.

let alone like the internet and roads. But no data, no Uber. And if you're in the climate tech world, that's what's happening now. But forewarned is forearmed, and groups of people who saw these moves coming have mobilized to save the data. Here's one of them. My name is Jonathan Gilmore. I'm a data scientist at the Harvard T.H. Chan School of Public Health. I work in a lab that does research on climate and environmental impacts to health, to human health.

And I'm also one of the coordinators of the Public Environmental Data Partners, which is a group that we stood up in November, December 2024 to safeguard. data and tools that the federal government has created with taxpayer money to benefit all of us that were at risk following the election of Donald Trump. Great. Amazing.

Let's back up a little bit to the, let's start with safeguarding the data. And then I'm going to ask you more about you. Talk about identifying this threat. You know, you said we stood this organization up. in november december right after the election because we knew we would have to safeguard this federal data so there's a bunch of components there maybe let's start with which data is at risk yeah absolutely so initially

We knew from the first Trump administration that climate and environmental data would be at risk. We saw threats to climate and environmental data and research and communication to the public. back from 2017 to 2021. And so we knew that it was coming. Also, Project 2025 was pretty explicit in how... climate and environment work might be might be treated under a trump administration there are questions about how close was project 2025 to the trump campaign but we see now um that the campaign

regardless of distancing itself to the project has, the administration has instituted a whole bunch of the aims of the project, especially around climate and environment. And what were those? If you wouldn't mind backing up just to, I mean, it was. Project 2025 was very explicit on these topics in a way that I think people don't necessarily realize.

Absolutely. I mean, it was a roadmap for dismantling our climate and environmental protections. In addition to, it was this absolutely... giant plan for completely reformulating the way that our government works and dismantling lots of the protections for Americans, not just in terms of climate and environment, but um also health protections um it it represented a thesis for how government should work that was a

huge departure from the way that the government had worked for Americans up through January of this year. Right. So as part of that, certainly there was... You talk about the dismantling of environmental protections and agencies, and we've seen huge cuts, of course, to NOAA and reshaping of the EPA. But what is it about the data specifically? Did it also call for – what can you tell us in terms of the plans for the information? It's sort of like –

So then it was like, delete it all or rewrite it, which we did see during the first Trump administration. Yeah. So there were some calls to privatize certain functions of government. an effort to privatize the work of the National Weather Service. This has been sort of a rallying cry in far-right libertarian circles for a while. That the federal government should not pay for weather forecasting? Yeah, and that the functions of the National Weather Service should be privatized. And this...

is sort of absurd. I mean, every weather forecast in the states relies on government data, on federal data. And it's one of the most common ways that Americans interact with... federal data. There was a really interesting poll that came out a couple weeks ago that showed that 90% Americans use federal data on a weekly basis, but only 10% are concerned that removals of scientific data and weather data and food safety data

employment data, that that might impact them. And I'm really curious about how those questions were formulated. I don't think that each function was enumerated, but that's a huge, huge discrepancy. And points to the fact that a lot of this data is sort of silent, invisible infrastructure that we rely on, that our society is built on, but that no one's running ads for this data.

It has sold itself because it's so good and because it's largely free and it benefits us all. But I think there's a risk to the fact that it has been chronically, well, it's been underfunded. Our data infrastructure has had cracks for a long time, just like our physical infrastructure. And what we've seen in the past couple of months under this administration is just dynamiting that infrastructure.

So and all right, so then let's get more specific about dynamiting. Like, I mean, it is my understanding that the work you're doing is will preserve data that would otherwise just be gone. Deleted data that researchers rely on, you know, like walk us through the importance of this information and what might happen to it if not for the work that you and others are doing. Yeah, absolutely. So.

We at the Public Environmental Data Partners are trying to safeguard data that falls under the umbrella of environment. So we're using a pretty big umbrella there. There's a lot that we are... counting as environmental data, data that affects our health and the spaces around us. And we have identified what data is important to the broader community, to researchers, to folks who have deep knowledge about...

where these data sets were created and how at risk they might be, how politically vulnerable they might be. And we've identified a priority. list and based on those interactions and that feedback. And we've gone after all the data sets that were identified. So that list started out at about 300 after a bunch of input from folks who have deep expertise with the data. 300 data sets? 300 data sets. Okay.

in that environmental umbrella. And so far, we've safeguarded almost half of that. We have about 170 data sets in the repository that we're... um saving things too we have a whole bunch that are in progress. And we have about 100 that are really gnarly and need some coding, or they're just giant. Or in some cases, they've already been removed. And so to your question,

There's this really interesting and difficult dynamic where we're saving things because they are critical for the research and for our understanding of the world. But... we don't actually know for sure if anything in this list uh when things were identified We're using that as an estimate that this is something that is important. It's important and it's useful to someone. And so we're going to go after it.

This administration is so mercurial. There's so much happening all the time. We've seen cases where data sets go down and then because of suits, they come back up.

have safeguarded things that remain up to this day. There are things that were not on that list that have been taken down. So it's really mixed bag. And it's a really chaotic, difficult landscape to be working in. But at the end of the day... our goal is to ensure that what is identified to us as important remains freely accessible to the public and to researchers.

And it's important, I say free there, that we've seen some folks who are saving this data and then paywalling it. But this data was developed with taxpayer money. for the benefit of all of us, and we think that it should remain freely accessible. Right. You're like in the Internet Archive business, effectively. Absolutely, which is not much of a business. Which is not a business. Yeah, exactly. It is an academic.

Nonprofit undertaking for the most part. Is it all volunteer? Like, how is it? You know, who's all? We have some grant funding. We've received a huge outpouring of support. We launched a Get Involved forum, and in a couple of weeks, we'd had... 600 responses folks offering to help out with every aspect of our work and offering to donate money in addition to time and it's been we've really

been floored by the groundswell of support for this work. Good. Everybody, the links will be in the show notes and the newsletter. So please add either your skills or your money to this project. Give us a sense if you would. Because I think to your point, this is invisible infrastructure and people don't necessarily understand the value of this. Give us a sense of like, what might happen if this...

data goes away, if it has disappeared, if it is unavailable to the various services and researchers that rely on it. Yeah. So... I want to take a minute to talk about the main pillars of our work at the Public and Problems of Data Partners, which I think will help with this. So one of the pillars that we've talked about is the data sets themselves.

We've saved about 170 of those. Then another critical pillar is rebuilding tools. And there were nine federal... environmental justice screening tools um we have done our best to rebuild some of the the screening tools that have so these what did they do yeah Some of the first on the chopping block when the administration came in, there was the White House Council on Environmental Equality's Climate and Economic Justice Screening Tool. There was EPA's EJ Screen. Those were the two most popular.

served about 100,000 users together on a monthly basis and helped them make informed decisions and helped them identify. areas that had faced environmental injustices and helped users make investment decisions. identify areas that need extra protection, extra resources. So these were used by local governments and nonprofits and concerned citizens to identify areas of the country that had overlapping risks like

Where are there lots of lead pipes? Where is there high unemployment? Where is there high air pollution or other pollution from industrial activities? where can we invest to improve public health and address these environmental injustices right and so this is where we should i'm going to interject and say none of this was philosophical like we know that

Power plants and freeways have historically been built in poor areas. We know that Cancer Alley exists. We know that Flint, Michigan, I believe, still does not have clean water. Like it isn't, you know, none of these were. philosophical, it was about identifying real risk and harm and trying to mitigate real risk and harm. Right. Absolutely. To humans in America. To humans in America. Yep. And so... There's this overlapping quality of these resources. And to protect Americans, you need...

a few things. First, you need policies like the Clean Air Act and the Clean Water Act and EPA's National Ambient Air Quality Standards, which, by the way, my lab at Harvard provided a lot of the research that was the basis of the most recent revision of the...

National Ambient Air Quality Standards to better protect Americans from air pollution, which kills more people annually than car crashes. Then after the policies, you need the data. So you need air quality sensors and you need to test for... and identify where there are lead pipes and you need data about unemployment in the federal workforce and the civilian workforce. And then on top of the data, you have tools that help people make sense of the data.

A data set with no interpretation is really difficult to obtain conclusions from. But when you have a tool that allows someone to explore that data, it can be actionable. And that helps us figure out where to invest to protect Americans and where cities should design, where they should add trees to address urban heat island effects.

where folks need clean water. And then that can help direct the grants and investments and additional attention. And we're seeing... attacks from the government on all four of those pillars, policy, data, tools, and investments and grants.

Time for a quick break. When we come back, more on the pushback against the deletion of these data sets. And we'll talk about the impact that all of this could have on the next generation of climate entrepreneurs who rely on federal data, tools and information to build. the companies that could save us from the climate crisis. Really?

Come on, let's go. Hear that? That's the sound of my customers leaving. When your broadband doesn't work, neither does your business. Will Sky Business keep me up and running, no matter what? With 4G backup and our stay connected guarantee, better believe it. Let our dedicated Sky Business team handle your switch today. That's my like it. Subject to availability. For Ts and Cs, visit skybusiness.com. Welcome back to Everybody in the Pool.

We're talking with Jonathan Gilmore, a data scientist at the Harvard T.H. Chan School of Public Health. Before we continue our conversation about climate... Data, I do want to say that Gilmore has done this kind of thing before. He was a key contributor to the Atlantic's COVID tracking project.

During the COVID-19 pandemic, when the CDC under the first Trump administration at first wouldn't release any public information about the spread of the virus and then release data that was wrong or incomplete. OK, back to our interview. So you're preserving the data, rebuilding the tools, it sounds like. Those are the two primary pillars of your work. Do you touch investment and policy at all? We aren't touching investment.

we are advocating for policy so advocacy is another pillar and say the third pillar probably of our work so we've written op-eds about the importance of our federal digital digital infrastructure and the how critical this data is, how critical these tools are. And there are hundreds of organizations and cities and local governments that use these tools that in some cases exist now only.

in the copies that we've stood up. And so we are, in a perverse way, what the government is doing is also bringing a little bit more attention to some of these things, these resources, the data sets and the tools, which...

is important because they are the way that we make sense of the world and where people need additional help. And so while we're not going to see... progressive policy or any of that help coming from the federal government most likely this is an opportunity for private money and foundations to step up and for local governments to step up to protect those who are most vulnerable amongst us.

In no way is this a comfort exactly, but I am a big believer in the Newtonian principle of an equal and opposite reaction. And it has been somewhat heartening to see people... rally to this cause. Like I have seen a lot more concern than I would have expected about repositories of data and preserving data sets. But it is sort of like, it's just sort of wild to think that we would have compiled all of this.

life-saving information and that someone would just come along and either disappear it for political reasons or worse alter it right like i think like this feels like a good time to talk about How some – if the tools are a way to interpret the data, what is the follow-on concern about corruption of the data? There's huge concern. We –

I haven't seen many instances of corruption of the data or, I mean, there have been some changes to some data sets. One of the big nonprofits that are one of the nonprofit. partners that formed the public environmental data partners is the Environmental Data and Governance Initiative. And they were started during... the first trump term um to safeguard environmental data and track changes to environmental data and policy and they they have a data set tracker and stat also uh

ran a piece looking at changes to federal data sets. We've seen some things like changing gender to sex in public health data sets. I'm not aware of any big... you know, concerted efforts to manipulate data and sort of silently change things in federal data sets. The biggest thing we're addressing is just complete wholesale deletion of the data sets.

What you raise is really important. And there is, I think, a huge loss of trust in data and science that comes out of the federal government. There's still... incredibly talented public servants doing excellent science and trying to keep their heads down amidst the reductions in force and firings and reorganizations and doge cuts.

I think given the sheer amount of political pressure that is so evident coming from the top, you know, climate change isn't real. It's the green news scam. This, you know. IRA environmental justice grants and EPA's grants, they're handouts to- these communities that are grifting. All of these things that are coming out of the administration are definitely changing the public perception of everything that will come out of government, whether it's...

politically tainted or not. And I think that's a huge concern. We deal with some of the provenance. issues with the datasets that we're saving. So there's a little bit of, there's a loss of trust when you take something from one place to another. And so when we're saving these datasets and when we're reproducing these tools, we try to provide really rich

information and metadata about where it came from, how we got it, who made it, how did they make it, try to grab any data dictionaries, any context on the web pages. But it's really difficult to... assuage any concerns that a potential visitor might have when you are taking something from one location and putting it in another.

We briefly talked about the equal and opposite reaction and the idea that state and local governments could step in here and institutions could step in. The fact is that for decades, we have relied on federal data. backed by the full faith and credit of the United States, if you will, right? It's sort of like this is the gold standard of information. You can trust it for your weather forecasts. You can trust it for your food safety.

You can trust it for understanding the safest place to put new infrastructure. And there's just no question that without that kind of stamp, it doesn't have the same. Absolutely. I think about the carbon dioxide concentration data that comes out of the Mauna Loa Observatory. That's been the gold standard for... carbon dioxide in the atmosphere for decades and is at risk of going down completely. And it's the self-sabotage we're seeing in terms of...

U.S. science, the entire scientific establishment, we haven't even talked about the attacks on academia, is stunningly short-sighted and really horrific to see. It's going to result in less... fewer advances, fewer biomedical advances, fewer cancer treatments, fewer solutions that address the problems that we're talking about, the air pollution, the lead pipes, it's going to affect.

the entire way we live our lives. And it's going to be a very slow process to understand just how bad things will get, even if we stopped the bleeding, if you will, now. There's been such an impact on the entirety of the establishment. That is a good place to put a slightly, you know, normally on the show I am talking to startups, about startups, with investors.

We're talking about public data, but I think it's an important thing to point out that this is also the data that plenty of startups have been built upon. And, you know, I mean, and that and that that has always kind of been the history, right? Like Uber couldn't exist without the taxpayer funded GPS satellites that provide the mapping data that allow for a multibillion dollar startup to exist.

I think it like it's a if if nothing motivates us, let it be money. Like put a fine point on the fact that this data is also the data you might want to use to build your climate startup or some other kind of startup. Absolutely. So. That's a great point. Thank you for raising it. I keep waiting for private industry to step in and say, hey, hey, hey, this is madness. Let's not.

destroy our federal data infrastructure because we rely on it. I think especially of insurance companies that rely on climate risk data and environment and weather data from... from the federal government to build their models and price insurance. A topic that fascinates me and that I haven't had enough time to dig into lately is climate risks to insurance. And this...

this industry is facing existential risks. And the data that we're talking about informs their understanding of the world and their business direction. And so I... keep waiting for these insurance giants that have quite a bit of sway to step in and do something. And I haven't really seen that. There might be some back-channeling going on. I imagine there probably is. I also imagine that a lot of these companies probably have their own in-house data preservation efforts. I think that it's...

It's lamentable that those will probably never see the light of day for the public because, again, this data was, we paid for this data. We deserve to have it and to benefit from it. But I think that we may... I keep hoping that we may see some pushback from private industry because this data does form the backbone of a lot of the models that we're building, the ways that we understand the world.

Yeah, I mean, it's true. Maybe like scoot over to the economics department and sort of be like, all right, let's put out a report on the – I mean, I would imagine that an attempt to quantify the economic value of the data that we're talking about. would be fascinating because I have to imagine that those are numbers in the hundreds of billions, if not trillions of dollars globally. When you add up health data, climate data, like the data that underpins so many solutions.

I don't know. I try not to be like a mercenary VC about stuff. But honestly, all we talk about is tariffs when we are all about to die in 75 different ways. Whether it's measles, ticks. asteroid okay now i'm just now i have to bleep my own show anyway dengue fever so many of these risks that are coming down the pipe

And literally not knowing what the weather is. Like, you guys, we are literally talking about blinding ourselves to the weather because the word climate exists somewhere in the description. of what these organizations do. Anyway, I like this quantify the economic value thing. Maybe we should team up on this. I'd love to do that. Yeah, I think that's fascinating. I don't know where to start, but let's start it.

I know some people. I know some economist types. Yeah. I know a couple of economics journalists. Like, we could make this a business story. Yeah. This is some Wall Street Journal stuff right here. All right. We're slightly off topic now. But more importantly, Jonathan, where can people contribute, find out more, and hopefully help?

Yeah. So I'll send you our website. It's screening-tools.com. As you'll see, if you visited the website, we've been focused on preserving data and not making a beautiful website. So we've got all the content there. It's up there, but it's pretty bare bones. We also have links to a form to identify data that you think is at risk if you have information or domain expertise.

We especially love to hear from you. And we also have our Get Involved form there. We have a donation link there. We haven't prioritized sort of a GoFundMe. public funding drive we think that well the government should pay for this this work so we're we're talking with with philanthropy um we we think that you know

We're hoping that we can mobilize some large-scale support for this. But there's a link to donate if you so choose. And we have a bunch of... stories linked on the site about who we are and what we've done and um what motivates us amazing jonathan gilmore thank you so much for the good work we appreciate you thanks for having me

That's it for this episode of Everybody in the Pool. A couple things I do want to mention before we end. First, other people and organizations are fighting back against the deletion of federal data with some success. In May, farmers won a lawsuit against the USDA over deleted climate data. Because it turns out climate really matters to farmers. And they won.

And several data sets and tools were reinstated, including the U.S. Forest Service's Climate Risk Viewer. This is also happening with some health data. And Jonathan also pointed out that some of this data collection was actually mandated by Congress. And again, we taxpayers paid for it. So arguably, this wholesale deletion is not within the executive branch's power at all. And in case you're still wondering why it matters, as I have reminded you many times on this show and others,

You can't manage what you don't measure. Email me your thoughts and suggestions to in at everybodyinthepool.com. Find all the latest episodes and more at everybodyinthepool.com, the website. And if you want to become a subscriber and get an ad free version of this show, hit the link in the description in your podcast app of choice. Together, we can get this done. See you next week.

Come on, let's go. Hear that? That's the sound of my customers leaving. When your broadband doesn't work, neither does your business. Will Sky Business keep me up and running, no matter what? With 4G backup and our stay connected guarantee, better believe it. Let our dedicated Sky Business team handle your switch today. That's my like it. Subject to availability. For T's and C's visit skybusiness.com.

Every great business starts as a dream, but turning that dream into reality, that's where Shopify comes in. Shopify is proud to power many sources of UK e-commerce from brands like Mattel and Gymshark to new entrepreneurs. Thank you. Need a hand with content? Shopify's AI tools generate product descriptions, headlines and even enhance your images. Struggling to get noticed? Shopify's marketing tools let you launch email and social campaigns with ease.

And if you ever get stuck, Shopify's award-winning 24-7 support is always there to help. Turn those what-ifs into why-nots. Turn those dreams into... And give them the best shot at success with Shopify. Sign up for your £1 per month trial and start selling today at shopify.co.uk slash try. Go to shopify.co.uk slash try. shopify.co.uk slash try.

This transcript was generated by Metacast using AI and may contain inaccuracies. Learn more about transcripts.
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast