Sharing Data
In our work comparing different approaches to pathogen detection and piloting a biosurveillance system, we collect metagenomic sequencing data. We aim to share as much of this data as we can, while respecting access restrictions requested by our partners.
Zephyr (Boston Nasal Swabs)
We collect nasal swabs at public locations in greater Boston. Sequences we identify as potentially human-infecting viruses are linked from our sample log in FASTQ format. We have also published our data via the SRA (PRJNA1379685).
CASPER (Wastewater Sequencing)
In collaboration with partners across the CASPER network, we are monitoring wastewater from multiple metropolitan areas. Our collaborators at the Universities of Missouri and Wisconsin-Madison, along with others in the Lung.Fish collaboration, maintain a dashboard tracking pathogen abundance over time.
As of March 2026, the CASPER collaboration has publicly shared 1.2B read pairs from 1,206 samples collected from December 2023 through December 2025. These are available on the SRA under accession PRJNA1247874, with a small amount of earlier data available at PRJNA1198001. While this represents the majority of the data we have sequenced to date, if you would like access to data that has not yet been made public please send us a brief description of your research.