Documentation & Data
The Underlying Data
We are incredibly grateful to the researchers who have generated and shared these data. This project would not be possible without their commitment to open science. Below is a summary of the datasets currently integrated into the Open Pathogen Kinetics Commons.
| Citation Key | Pathogen | Data Points | DOI |
|---|---|---|---|
| Eales2025_Baker | Flu | 485 | 10.1101/2025.02.01.636082v1 |
| Eales2025_Caserta | Flu | 167 | 10.1101/2025.02.01.636082v1 |
| Eales2025_Halwe | Flu | 74 | 10.1101/2025.02.01.636082v1 |
| hakki2022 | SARS2 | 1,050 | 10.1016/S2213-2600(22)00226-0 |
| jones2021 | SARS2 | 18,136 | 10.1126/science.abi5273 |
| ke2022 | SARS2 | 2,802 | 10.1038/s41564-022-01105-z |
| kissler2023 | SARS2 | 21,554 | 10.1038/s41467-023-41941-z |
| puhach2022 | SARS2 | 565 | 10.1038/s41591-022-01816-0 |
| russell2024 | SARS2 | 1,451 | 10.1371/journal.pbio.3002463 |
| savela2022 | SARS2 | 544 | 10.1128/JCM.01785-21 |
| wagstaffe2024 | SARS2 | 1,368 | 10.1126/sciimmunol.adj9285 |
| waickman2022 | Dengue | 2,826 | 10.1126/scitranslmed.abo5019 |
| waickman2024 | Dengue | 2,430 | 10.1038/s41564-024-01668-z |
| wongnak2024 | SARS2 | 24,559 | 10.1016/S1473-3099(24)00183-X |
Data Selection
The datasets included in OPKC were identified through a comprehensive literature search focusing on studies with longitudinal pathogen load measurements. We are continuously working to expand this library by bringing in new papers and datasets as they become available.
All underlying data are either open access or used with specific permission from the authors. Our inclusion criteria prioritize studies that meet the "bare minimum" requirement of providing pathogen load measurements over time (see our Data Standard for details). We filter and clean these datasets to ensure they can be meaningfully compared, but we strive to preserve the original structure and intent of the data wherever possible.
Data Visualization
The Sample Explorer is a tool designed to provide immediate, visual insight into the contents of the database. It allows users to:
- Filter by study, pathogen, sample source, and more.
- Visualize kinetic trajectories to understand the density and duration of sampling.
- Compare data across different studies and units (e.g., Ct values vs. pathogen load).
This visualization is intended as a "helper" tool—a way to quickly assess which datasets might be relevant for your specific research question before downloading the full dataset for detailed analysis.
Web Development
The Open Pathogen Kinetics Commons is built using open-source tools to ensure transparency and reproducibility.
- Backend: Django (Python)
- Frontend: Tailwind CSS and Plotly.js
- Code Assistance: Google Gemini was used to assist with writing the underlying code for this platform.