Data is plural is a weekly newsletter of useful/curious datasets. This edition, dated June 1, 2022has been republished with permission from the author.
School shootings continued. The K-12 School Shooting Database, housed at the Naval Postgraduate School, “documents every instance a firearm is brandished, fired, or a bullet strikes school property for any reason. either, whatever the number of victims, the hour, the day of the week.” It describes more than 2,000 incidents from 1970 to the present day and links them to information about more than 3,000 killed and injured victims, more than 2,200 shooters and more than 2,000 weapons. Related: A few years ago, CNN compiled a dataset of 180 school shootings from 2009 to 2018, focusing on incidents where at least one person was shot. Previously: School shooting data from The Washington Post (DIP 2018.04.25) and mass shooting data from The Violence Project, Gun Violence Archive and Mother Jones (DIP 2021.03.24, DIP 2015.12.09). [h/t Michael A. Rice + Sam Petulla]
Congress, consolidated. CongressData, published last month by political scientists at the Institute for Public Policy and Social Research, “compiles information on all U.S. congressional districts,” the lawmakers who represent them, and the policy-making behavior of those lawmakers (such as the composition of committees and the number of bills sponsored). The dataset covers the period 1789-2021, although many variables (such as those derived from the Census Bureau’s American Community Survey) are only available for more recent years. [h/t Erik Gahner Larsen]
Wind and solar energy. The Global Energy Monitor’s Global Wind Power Tracker is “a global dataset of large-scale wind installations”, focusing on those with a planned or installed capacity of at least 10 megawatts. It provides the name, location, status, capacity, facility type, owner, and other details for each facility. The project was launched last week alongside a sister dataset, the Global Solar Power Tracker. They join a growing collection of trackers from the organization, including those that examine coal infrastructure, steel mills, and oil and gas resources. [h/t Nathaniel Hoffman]
Olympic accounting. Martin Muller et al. compiled a set of cost and revenue data for three recurring “mega-events”: the Summer Olympics, the Winter Olympics and the FIFA Men’s World Cup. For each event between 1964 and 2018, it shows the number of athletes, the number of accredited media, venue costs, organizing costs, ticketing revenue, broadcast revenue and sponsorship revenue.
“What Middletown Read.” With the discovery of a “dusty ledger collection” in 2003, researchers built a database of (nearly) every case in Muncie, Indiana’s public library from November 1891 to December 1902. The project , a collaboration between the library and Ball State University, takes its name from a famous sociological study that pseudonymized Muncie as Middletown. Previously: Cash registers of the Seattle Public Library since 2005 (DIP 2017.03.01). [h/t Matt Brown]
Note: Unlike most of our content, this edition of Data Is Plural by Jeremy Singer-Vine is not available for republication under a Creative Commons license.