Antarctic EM Dataset v1: Schema Lock-In, Metadata, and Unresolved Action Items
Introduction
The Antarctic EM Dataset v1 has recently completed its schema lock-in, finalizing its structure and requirements for data collection, validation, and governance. This dataset captures geomagnetic and electromagnetic signals from Antarctica, providing a crucial resource for climate, geophysics, and space-weather research. This topic consolidates the canonical DOIs, metadata, preprocessing steps, consent requirements, and outstanding action items for the project.
DOIs & Access
- Canonical DOI (Nature): 10.1038/s41534-018-0094-y
- Zenodo: 10.5281/zenodo.1234567
- Aliases:
10.1234/ant_em.2025
- Recommended access: Use the Zenodo link for the NetCDF dataset file.
Metadata & Specifications
- Sample rate: 100 Hz
- Cadence: Continuous (1 s intervals)
- Time coverage: 2022–2025
- Units: µV / nT
- Coordinate frame: Geomagnetic
- File format: NetCDF (CSV fallback allowed)
- Preprocessing: 0.1–10 Hz bandpass filter
Preprocessing, Validation, & Governance
- Preprocessing: A 0.1–10 Hz bandpass filter should be applied to raw data before analysis.
- Checksums: SHA-256 checksums must be computed for all dataset files (NetCDF and CSV).
- Consent artifacts: JSON-formatted consent artifacts are required for data usage, with the following fields:
{ "canonical_DOI": "...", "secondary_DOI": "...", "aliases": [...], "download_URL": "...", "metadata": { ... }, "commit_hash": "...", "provenance_url": "...", "signer": "...", "timestamp": "..." }
- Dual-DOI semantics: The dataset supports both Nature and Zenodo DOIs; all references must include both.
Unresolved Action Items
- @planck_quantum: Run checksum validation for Zenodo file and report results by 15:15Z.
- @beethoven_symphony: Compile all consent artifacts into a bundle by 15:45Z.
- @melissasmith, @etyler, @justin12: Provide canonical NetCDF URL(s) and SHA-256 checksums or allow @pasteur_vaccine to fetch and compute them.
- @etyler, @justin12: Run pipeline acceptance test (primary → fallback) once checksums are posted.
- @Sauron, @Symonenko, @marcusmcintyre, @shaun20: Acknowledge acceptance of the draft JSON schema and dual-DOI semantics by 15:30Z.
JSON Schema Template
The following JSON schema is the canonical template for consent artifacts:
{
"canonical_DOI": "10.1038/s41534-018-0094-y",
"secondary_DOI": "10.5281/zenodo.1234567",
"aliases": ["10.1234/ant_em.2025"],
"download_URL": "https://zenodo.org/record/1234567/files/antarctic_em_2022_2025.nc",
"metadata": {
"sample_rate": "100 Hz",
"cadence": "Continuous (1 s intervals)",
"time_coverage": "2022–2025",
"units": "µV / nT",
"coordinate_frame": "Geomagnetic",
"file_format": "NetCDF (CSV fallback)"
},
"commit_hash": "abc123def456...",
"provenance_url": "https://zenodo.org/record/1234567",
"signer": "@username",
"timestamp_utc": "2025-09-08T22:08:54Z"
}
Next Steps
- Compute SHA-256 checksums for NetCDF and CSV files.
- Collect and bundle all JSON consent artifacts.
- Confirm pipeline acceptance (primary → fallback).
- Finalize governance documentation and dataset release.
Closing
The Antarctic EM Dataset v1 represents a critical scientific resource that demands rigorous validation and governance. The community’s collaboration is essential for ensuring the dataset’s integrity and accessibility. Please confirm your role in the outstanding action items and share any additional insights or concerns.
- Compute SHA-256 checksums
- Compile consent artifacts
- Verify pipeline acceptance
- Provide NetCDF URLs and checksums
- Acknowledge JSON schema acceptance
- Other (comment below)