OAE Data Protocol, Version 1.0.0 - Carbon to Sea Initiative

OAE Data Management Protocol

The OAE Data Management Protocol outlines recommendations for producing consistent data and metadata for Ocean Alkalinity Enhancement (OAE) research projects.

With broad adoption, this protocol will make it possible to compare and interpret field research and more quickly advance our understanding of the field.

In collaboration with Submarine Scientific, NOAA and dozens of ocean researchers, we present this protocol after several rounds of feedback.

Please use the following when citing this protocol:
Long, J., et al. (2025). Ocean Alkalinity Enhancement Data Management Protocol v.1.0.0. https://doi.org/10.5281/zenodo.17154988

If you have questions or concerns, please email data@carbontosea.org.

Column Header Names
Controlled Vocabularies
Deprecated Standards and Practices
Acknowledgements
References
Revision History

Background

Introduction

This document outlines recommendations for producing consistent data and metadata for Ocean Alkalinity Enhancement (OAE) field trial projects. Its first iteration was produced in partnership with the OAE community (see Acknowledgements), with the intention to remain a living document and continually improve to reflect best known scientific practice. Initially the document will be revised and updated in line with the latest OAE research approximately once per year, incorporating learnings and feedback from the OAE community and led by the Data Management Coordination Team. All major updates will undergo review and all previous versions will be maintained for transparency (see Deprecated Standards and Practices for a detailed description of versioning).

Objectives & Guiding Principles

The objective of the OAE Data Management Guidelines and this OAE Data Management Protocol document are to enable marine Carbon Dioxide Removal (mCDR)-OAE data collected from academia, government, non-profit, and industry to be documented in a consistent way, and make them findable and discoverable from shared data repositories to facilitate future data synthesis efforts. The guidelines here support FAIR data sharing principles, in effort to make data findable, accessible, interoperable, and reusable.

OAE introduces unique challenges and opportunities for data standardization. Traditional oceanographic data standards, while robust, require updates to address the specific needs of OAE projects.

The main updates and recommendations are driven by these Guiding Principles:

Project Comparability: Developing guidelines that ensure data from different OAE projects are intercomparable, enabling meta-analyses and large-scale assessments of OAE effectiveness and environmental impact.
Minimal Burden on Data Providers: Establishing streamlined protocols and tools that simplify data submission while ensuring high-quality, standardized outputs.
Flexibility for Innovation: Allowing for innovation in project designs by creating standards that accommodate diverse methodologies and intervention scales without imposing restrictive requirements.
Transparency and Accessibility: Promoting open and transparent data sharing, with appropriate metadata, to facilitate peer review, collaboration, and public trust in OAE efforts.

By building on existing standards and addressing these updates, the goal is to create a system that supports rigorous science while remaining practical and adaptable for data providers. This ensures that OAE projects contribute meaningfully to the collective understanding of marine carbon dioxide removal while fostering collaboration across the community.

Methodology

This protocol was developed by the OAE community in a multi-process method, starting with a workshop during OCEANS 2024 with participants representing academia, government, non-profit, and industry to gather initial feedback and input for sensor output, model output, and discrete carbonate and nutrient data. This feedback was developed into an outlined draft of recommendations, which were reviewed by attendees and developed into the first draft OAE Data Management Protocol 0.1.0.

Working Groups were formed to capture input from biological sciences, sediment processes, and social science data, each hosting virtual meetings to gather community feedback, which further informed the initial draft. The recommended variable column header names and controlled vocabularies are made to mirror existing naming conventions, and long-standing recommendations by the Ocean Acidification Community. The draft was then provided to the Steering Committee members for internal review. The resulting draft was presented at a second workshop during AGU in December 2024 for additional mCDR-community feedback, followed by additional internal revisions to create the draft that was presented during an open public review period. This document represents the final conclusions developed by the OAE community to ensure projects will be standardized by data providers, and findable, openly accessible, and intercomparable by data end-users.

This work was performed under a Cooperative Research and Development Agreement (CRADA) between NOAA and Carbon to Sea. However, the views expressed herein are not necessarily those of NOAA, the Department of Commerce or the U.S. Government.

We are grateful for the contributions of the workshop participants, working group members, the Steering Committee and those who provided comments during the open review period. For a list of contributors and contributing authors, please see the Acknowledgements section.

Intended Users

This protocol is designed to support data producers, granting agencies, institutions, registries, regulatory and verifying bodies working on OAE field projects that generate model output and/or field data from various sources, including commercially available sensors, discrete observations, social sciences, sediment processes, and biological processes. It provides a standardized approach for documenting datasets while offering guidance on selecting appropriate repositories, submission timelines, controlled vocabularies, and best practices to facilitate field data intercomparison. The level of detail provided to guide various fields of research and observing systems varies, as protocols and recommendations have been better defined in some areas than others by the general oceanographic community. Future work to develop these areas pertinent to the scope of this protocol are outlined in Emerging Standards. Data from field projects include any observations made by measuring the natural environment as well as its response to alkalinity addition, versus controlled laboratory experiments.

The protocol provides best practices around data management that may be adopted by any user to help develop their data management plan. Projects referencing compliance to this protocol must meet the minimum requirements outlined below, though additional recommendations provided here are strongly encouraged. While developed for OAE, these guidelines may also be relevant to other marine Carbon Dioxide Removal (mCDR) methods. Please reach out to data@carbontosea.org with any questions, comments, or suggestions regarding the protocol.

The protocol is organized into five chapters:

Metadata & Templates – Building on existing metadata guidelines that accompany each repository, this protocol requires mCDR-specific Metadata & Templates to provide essential context unique to OAE field trials. It also includes Model Metadata to introduce and define key fields for documenting model and model output data details.
Guidelines for Data Management – This chapter outlines how to set up, manage and submit data across a range of data types.
Column Header Names – Recommendations are provided for column header names for use in data files.
Controlled Vocabularies – Definitions are provided for controlled vocabularies, including OAE-specific fields.
Deprecated Standards and Practices – Versioning control for the Data Protocol is outlined here.

mCDR	Marine carbon dioxide removal
OAE	Ocean alkalinity enhancement
DOI	Digital object identifier

Metadata	Metadata is structured information that describes and provides context for a data resource, helping to ensure that the dataset remains discoverable and usable in the future.
Data standards	Data standards are a set of agreed-upon rules, formats, and conventions used to define and structure data, ensuring consistency, interoperability, and clarity across different systems, datasets, and organizations. They help maintain data quality, facilitate data sharing, and enable effective analysis by establishing uniformity in how data are managed.
Column header names	Standardized column header terms describing a parameter, these may be an abbreviation of the measured parameter.
Controlled vocabulary	Controlled vocabularies are standardized lists of terms and definitions used to ensure consistency in the naming and classification of concepts within a specific domain. By limiting the use of predefined terms, controlled vocabularies help avoid ambiguity, enhance data interoperability, and improve the accuracy of data retrieval and analysis across different systems and datasets.
Ocean alkalinity enhancement	Ocean alkalinity enhancement (OAE) is a climate change mitigation strategy that involves increasing the alkalinity of seawater to enhance its capacity to absorb and store atmospheric carbon dioxide.
Platform	Any physical structure or system used to support and deploy instruments, sensors, or other equipment for collecting data in the ocean. Platforms can include research vessels, ships of opportunity (SOOP), profiling floats, buoys, underwater vehicles, and moorings.
Sensor data	Sensors refer to instruments or devices used to measure and collect data on various oceanographic parameters and are typically deployed on ships from rosettes or underway systems, buoys, underwater vehicles, or autonomous, aerial, or space-born platforms. Data collected from these systems are considered sensor data and do not refer to data from autosampling devices.
Quality control	Methods or procedures involving validating and verifying collected data to identify and correct errors, inconsistencies, or outliers, ensuring that the measurements are accurate and suitable for analysis. This typically includes tasks such as instrument calibration, data validation checks, and cross-referencing with other datasets to maintain the integrity of scientific results.
Model data	The model data referenced in this protocol refers to code, configuration and output from mathematical simulations that discretise the equations for fluid motion and energy transfer and integrate these over time on a realistic three-dimensional grid. This encompasses model output relevant to OAE projects on nearfield and regional scales, as well as global circulation models (GCMs) and Earth System Models (ESMs). This could include ocean circulation models with or without coupling to biogeochemical, sediment, sea ice, or atmospheric models. This does not currently cover data standards for conceptual, process models, 1D or 2D models, or simplified plume mixing zone models.
Data file	Refers to a file containing values of some measurements. File format type may vary (e.g., NetCDF, xlsx, xml), however all data files will contain quantitative values with associated column header names.
Baseline	Baseline refers to the initial set of data or conditions that are representative of the marine environment without interventions or modifications made. This baseline field data serve as a reference point for comparing intervention measurements, allowing for the assessment of the effectiveness and impacts of the interventions over time, such as changes in ocean alkalinity, CO₂ absorption, or ecosystem health.
Intervention	An intervention refers to the intentional action or process applied to the ocean to alter its chemical or physical properties in order to enhance its capacity for carbon dioxide removal. This could include adding alkaline substances to the water or implementing other methods aimed at increasing ocean alkalinity and improving the ocean’s ability to absorb and store atmospheric CO₂.
Control	A control site refers to a designated area in proximity to an intervention site, with shared characteristic waters, but that remains unaffected by the intervention, serving as a ‘control’ for comparison during and following intervention. The purpose of a control site is to isolate and account for natural variability in oceanographic conditions, biogeochemical processes, and carbon fluxes, enabling the evaluation of changes directly attributable to intervention activities.
Counterfactual	A counterfactual model experiment simulates what would have happened in the absence of a particular intervention, i.e. assuming baseline conditions.

mCDR Project Metadata	The project metadata contains high level project information that stays constant across all experiments and datasets within a project
Fields All fields are required if applicable to your project, unless noted as recommended.	Input descriptions for each metadata field are provided below, inputs shown in ‘quotes’ denote controlled vocabularies and must match the exact vocabulary provided. If a required field is not applicable to your project, put ‘NA’.	Illustrative example (not based on a real study).
Project ID:	The project to which the submitted data belong. A unique project identifier that can be used to link project data across data submissions, and link baseline data to intervention data, for example. If no Project ID has been assigned, one may be generated by combining the following fields, as described in Cross-linking Data Sets with Common Identifiers. Any method that creates a unique ID that will link all project data (e.g., a project’s baseline data to intervention data, and various data submissions within an experiment type) is acceptable.	Lead organizer: Carbon Dive Project beginning August 05, 2025 Hvalfjordur, Iceland Carbondive_20250805_Hvalfjordur
Temporal coverage:	Start date and end date (if unknown put ‘NA’) of the project in ISO-8601 format YYYY-MM-DD	[2023-04-28, 2025-02-19]
Spatial coverage:	Latitude/longitude bounds of project site (e.g., boundary domain of observations or relevant activities) provided in decimal degrees as westernmost longitude, southernmost latitude, easternmost longitude, northernmost latitude. [S, W, N, E]	[64.227, -22.190, 64.411, -21.350]
Vertical coverage:	Minimum and maximum depths of observation in meters.	[0, 76]
Sea names (recommended):	Names of the seas where the data collection takes place, See Controlled Vocabularies section for definitions.	North Atlantic Ocean
Project description:	A narrative description of the project. For example, what were the goals of the project? What were the research questions? What were the processes to achieve these goals and answer these questions? Who were the key stakeholders, organizers, project leaders? Was this building off a previous or ongoing project, or is this a new region/experiment/mechanistic study? If there are relevant regulatory parameters and/or limits to dosing trials at this location, these may be described here.	Hvalfjordur MRV System Pilot Study A baseline study beginning in 2024 captured the physical, biogeochemical, atmospheric and biological data of the site over the course of a year. It included autonomous and vessel-based samples as well as public data sources. Building on this year-long baseline study, Dual Tracer study, and Dye Tracer Study, an interdisciplinary project team under the leadership of Dr. Jane Doe conducted research to establish and test a prototype MRV System Pilot. The research questions were: 1) Can adding NaOH effectively increase seawater alkalinity, 2) Can increased alkalinity reduce surface ocean pCO₂, 3) Does reducing surface ocean pCO₂ result in CDR, and 4) is there an impact on local species and natural communities as a result?
Physical site description:	Provide information to help characterize the field site and provide context when interpreting the data. For example, descriptions of tidal patterns, climatological conditions, notable geological characteristics, the geographical and marine setting (coastal, intertidal, island region, sheltered environment), and characteristic meteorological events. If possible based on the file type of this submission, please include useful maps or figures here. Links to relevant datasets, cruise reports, etc may be provided here.	Hvalfjordur, Iceland The proposed field site is in Hvalfjordur, Iceland. The fjord is approximately 35 km long, 3.5 km wide and 15 – 50 m deep. The site has a sheltered physical environment with predictable circulation and water residence time. The flow in the fjord is characterized by inflow at depth and outflow at the surface, with primarily counterclockwise circulation. Water temp ranges from 0° C in winter and 10° C in summer. Hvalfjordur experiences a subpolar oceanic climate characterized by strong downslope winds, increased rainfall due to its fjord-mountain landscape, and maritime temperature moderation from the North Atlantic current.
Social context site description:	Details may include: -Commercial, recreational, ecological, and cultural uses of study site -Industrial site history -Demographics of site area -Notable events that may impact local sentiment to mCDR (for example: site had significant toxic spill in past decade, local positive support for offshore wind farming, frequent HAB site) -Ecologically protected species, economically significant operations in the marine environment -In study areas with nearby state or federal jurisdiction borders, potential conflicts with other countries or permits from foreign governments should be described. -Links to relevant social science surveys, engaged community groups, etc.	The local community is represented in project governance (board) and is engaged actively via town halls, information sessions, a website (www.communityexample.earth) and newsletter (www.exampleletter.com). Cultural activities in the fjord include mussel harvesting, though the toxicity of the mussels is monitored by the food agency in Iceland and is not always permitted. Global Attitude Surveys were conducted in 2010 and 2020 and reported in the Bjarnadóttir et al report What do Icelanders think about the environment and climate change? Economic activity in the fjord includes several areas zoned for sediment mining, a port in Grundertangi and an aluminum smelting plant, among others. Moderate harbour seal population has rebounded in recent years. Usually no pelagic fish in fishable quantities with the exception of winter 1947 – 1948 where large schools of summer spawning herring led to high catches.
Social research conducted to date:	A description of any social research conducted to date. If provided as a separate file, list filename here. Information may include: -Description of Community engagement research approach conducted and results -Stakeholder mapping method and link to output	Stakeholder mapping is provided in file icealand_stakeholders.pdf
mCDR pathway:	Please select any of the following that describe the submitted data: ‘ocean alkalinity enhancement’, ‘biomass sinking’, ‘direct ocean capture’, ‘ocean nutrient fertilization’, ‘artificial upwelling and downwelling’, ‘marine ecosystem recovery’, ‘other’ See Controlled Vocabularies section for definitions.	ocean alkalinity enhancement
Previous or on-going mCDR research in the area:	This field is required for co-located operations that potentially impact the project results. If previous or on-going mCDR field operations have occurred in the study domain by any project developer, they may be mentioned here either as a description, and/or if a reference to the study exists in the form of a data set, publication, etc, the DOI or other identifying information should be provided. Please provide direct links to data when available.	mCDR company Algae Lock was headquartered near the fjord and conducted some proprietary carbonate chemistry and algae farming research in the fjord (http://doi.org/xx.xxx). Algae farming operations were active from August 2022 to December 2023.
Co-located operations:	A description is required if any nearby operations exist that may influence the waters over the time period covered by this data. This might be a nearby mCDR project, a facility that discharges water with different characteristics than the inflow (e.g., a desalination plant), frequent boating operations, etc.	Aluminum smelting plant co-located in fjord at latitude/longitude 64.5°N, -21.3°W, activities and plans unknown.
Permit number:	Associated permit number(s).	permit #XYZ (permit pending, example is illustrative)
Permit approval document:	Link to permit or document reference.	permit #XYZ (permit pending, example is illustrative)
Permitting authority:	Name of organization or authority related to permitting, if applicable.	Ministry for Foreign Affairs (Utanríkisráðuneytið) Environmental Agency of Iceland (Umhverfisstofnun)
Public comments (recommended):	Provide the link to any public comments that were generated in response to this study (e.g., for permitting), or if uploaded separately as a pdf, provide the file name here.	Public comment files have been compiled into one pdf titled ‘Carbondive_20250805_Hvalfjordur_public_comments.pdf’ and available at http://doi.XXX.
Research project:	Project, which the data collection is part of. For example, West Coast Ocean Acidification (WCOA) Project.	NA
Funding info:	Include the name of the funder, funder country, project title, project ID, the project start and end dates, and whether the funding is public vs private. If there is no funding source (e.g., in the case of commercial projects), put ‘NA’.	NA
Datasets and experiments that should be cross linked to this one:	All datasets submitted under this Project ID should be listed here. For example, if some data were submitted to other repositories, this will link the data. If all data for this Project ID are included in the resulting DOI from this submission, please indicate that “All data for the current experiment are provided in this submission”. Also include links to any other experimental data (e.g., Experiment IDs) produced within this Project ID. For model experiments, project datasets that are directly used to force, inform or validate the model are required. References to other associated datasets are recommended but not required.	All data for the current experiment (CarbonModel_20250805_Hvalfjordur_intervention01) are provided in this submission, with the current draft doi: http://doi.org/xx.xxx Additional experimental project data include: Baseline study: Carbondive_20250805_Hvalfjordur_baseline01 http://doi.org/xx.xxx Dye Tracer Study: CarbonModel_20250805_Hvalfjordur_intervention02 http://doi.org/xx.xxx Counterfactual model run: CarbonModel_01012024_iceland_modeloutput01 http://doi.org/xx.xxx Earlier biological research relevant to mCDR operations can be found via the Marine and Freshwater Research Institute.
Additional details:	Open text area to include additional information. These may include information for sediment processes data, biological data, or any other required information if not included in the main metadata or data files; see Guidelines for Data Submission for relevant sections of your data. Additional informational files, such as digitized laboratory notebooks, blogs, etc., may be linked here.	See https://samplewebsite.is/data for a field blog and additional data from this site.

Experiment Metadata	Experiment metadata applies to a specific study but remains consistent across datasets.
Fields All fields are required if applicable to your project, unless noted as recommended.	Input descriptions for each metadata field are provided below, inputs shown in ‘quotes’ denote controlled vocabularies and must match the exact vocabulary provided. If a required field is not applicable to your project, put ‘NA’.	Illustrative example (not based on a real study).
Project ID:	Project ID must be included in the Experiment-level metadata in order to link all experiments to the main project. The project to which the submitted data belong. A unique project identifier that can be used to link project data across data submissions, and link baseline data to intervention data, for example. If no Project ID has been assigned, one may be generated by combining the following fields, as described in Cross-linking Data Sets with Common Identifiers. Any method that creates a unique ID that will link all project data (e.g., a project’s baseline data to intervention data, and various data submissions within an experiment type) is acceptable.	Lead organizer: Carbon Dive Project beginning August 05, 2025 Hvalfjordur, Iceland Carbondive_20250805_Hvalfjordur
mCDR experiment type:	‘baseline’, ‘control’, ‘intervention’, ‘model’, ‘other’ See Controlled Vocabularies section for definitions.	intervention
Experiment ID:	The experiment to which the data belong. Any naming convention that produces a unique ID is usable. The recommended naming convention is: Project ID Experiment type Optional numerical indicator to differentiate between various experiments of the same type for a project. A two digit consecutive number beginning with 01	First intervention (experiment type = intervention) for this project Carbondive_20250805_Hvalfjordur_intervention01
Investigators:	Provide details for each investigator including: Name, institutional information (name, address), phone, email, ID type (e.g., ORCID, etc), researcher ID, and role.	Jane Doe, University of Europa, 2345 Galactic Way, Centauri, CA, 98750, (+019-091-346-2938), ORCID, 9x938429x, Chief scientist for cruise data, data submitter.
Start date and time:	Start date and time of experiment in UTC ISO-8601	2025-08-05T01:20:30Z
End date and time:	End date and time of experiment in UTC ISO-8601	2025-08-07T11:00:00Z
Spatial coverage:	Latitude/longitude bounds of observed data in experiment, provided in decimal degrees as westernmost longitude, southernmost latitude, easternmost longitude, northernmost latitude. [S, W, N, E]	[64.227, -22.190, 64.411, -21.350]
Vertical coverage:	Minimum and maximum depths of observations in meters.	[0, 76]
Experiment description:	A narrative description of the experiment. For example, what part of the project do these data represent (e.g., baseline, intervention, control) and what do they contribute to the overall project? Are all project research questions listed in Project description relevant? What were the processes to achieve these goals and answer these questions? Data submitters are encouraged to note any significant changes to the original experimental plan due to unforeseen circumstances here.	On August 5, 2025 the project team released 23 tons of diluted NaOH solution over 96 hours and observed the results for 14 days. This experiment represented the first intervention conducted in the region and for this project. All project research questions above are relevant as the intervention will allow these questions to be answered. To effectively monitor these study regions in order to answer these questions, 10 repeat ship surveys were conducted to collect grab samples, underway sensor data, and profile data from rosettes. 12 buoys were deployed with sensor arrays including temperature, salinity, oxygen, chlorophyll, particulate information, pH, and pCO₂. These data will be used to monitor the change in seawater pCO₂ and local species impact due to aqueous alkalinity addition.
Data conflicts and unreported data:	If data exist that are or have been used by the project but are not provided due to conflicts (e.g., geopolitical or other), data availability (e.g., a dataset is no longer available), it may be noted here.	Data from a citizen-based water quality effort were available between the years 2021 – 2023 and are informed by Carbon Dive project planning, but are no longer accessible.
Meteorological and tidal data:	Include links to relevant open datasets if referenced in the experiment but not provided in the submission.	Wind data: Vindatlas.vedur.is Bathymetry data: Coast Guard Data from Atlas.lmi.is Land and water usage map: Vefsja.is Tide & weather data (sea level, wind, air pressure, temperature, salinity: Vedur.mogt.is)
Additional details:	Open text area to include additional information. These may include information for sediment processes data, biological data, or any other required information if not included in the main metadata or data files; see Guidelines for Data Management for relevant sections of your data. Additional informational files, such as digitized laboratory notebooks, blogs, etc., may be linked here.	See https://samplewebsite.is/data for a field blog and additional data from this site.
Experiment Metadata for OAE Interventions	OAE Intervention Metadata are additional Experiment Metadata that apply to experiments where an intervention, such as an alkalinity addition, was conducted.
Alkalinity feedstock processing:	Select all that apply: ‘electrochemistry’, ‘synthetically derived’, ‘mineral mining’, ‘blended’, ‘other’ See Controlled Vocabularies section for definitions.	Synthetically derived
Alkalinity feedstock form:	The phase upon delivery to the ocean: ‘solid’, ‘aqueous’, ‘slurry’ See Controlled Vocabularies section for definitions.	aqueous
Alkalinity feedstock:	Examples may include: olivine, potassium hydroxide, magnesium hydroxide, lime, portlandite, calcium carbonate, anorthite, dolomite, periclase, brucite, magnesite, forsterite, sodium hydroxide, natrite, nahcolite, akermanite, akermanite, alunoakermanite, etc. See Controlled Vocabularies section for selected examples (this list is not exhaustive, you may need to include your unique feedstock).	sodium hydroxide
Alkalinity feedstock CO₂ removal potential:	Maximum CO₂ removal potential of a mineral/rock feedstock material. We recommend using an adjusted version of the Steinour equation (Gunning et al., 2010), which uses bulk elemental oxide composition to estimate the maximum CO₂ removal potential of a feedstock material. The calculation output is in the form of kg of CO₂ per tonne of feedstock and represents the quantitative hypothetical potential of the material to capture CO₂ as bicarbonate or carbonate. See Isometric’s CO2 removal potential module for details.	N/A
Alkalinity feedstock description:	Information such as feedstock source, characteristics, impurities, dilution prior to dosing, and concentration. For feedstock other than NaOH: trace metal composition and particulate grain size. Any variable information must be provided in the dosing data file, in this case include the data file and column header names here provided as variables. See Intervention Data for details.	30% NaOH solution (commercially acquired) mixed with freshwater to achieve 1050 kg/m3 density. Tagged with 32g of inert gas SF6, dissolved in 1000 liters of freshwater.
Equilibration:	Pre-equilibrated or Unequilibrated	unequilibrated
Dosing location:	Provide latitude and longitude in decimal degrees. Depending on your method of dispersal, this information may be provided as a point source, vector, or bounding box. If provided as a vector, the latitude and longitude values should be included in the dosing data file.	[64.394426, -21.465808]
Dosing dispersal hydrologic location:	Descriptive dosing location, select from the following: ‘coastal surface’, ‘offshore surface’, ‘river’, ‘wetland’, ‘seafloor’	coastal surface
Dosing delivery type:	‘static point source’, ‘variable point source’, ‘static distributed’, ‘variable distributed’. See Controlled Vocabularies for definitions.	static point source
Dosing depth(s):	Depth(s) in meters. If this is variable, please include the schedule of depth changes and depths, or as a vector in meters with the data, named ‘dosing_depth’. Please note here that ‘dosing depth is provided as a variable’.	3 meters below surface
Dosing description:	Please be descriptive. Information about the dosing mechanism must be included (e.g., outflow from pipe, diffuser, doser, manual placement) E.g., outflow from existing facility pipe directly to ocean, manual riverine introduction, coastal distribution at three separate 30 meter long sections, pier-based diffuser to intercoastal bay, distributed from stationary barge 10 miles offshore.	Static pier-based diffuser. Three meter long steel diffuser oriented parallel to shore.
Dosing regimen:	At a minimum, please provide the schedule and timeline of dosing, including the time between doses, the duration of treatment and the amount used each time. More optimally, this information would be provided as a vector of binary data in the data file where 1 = dosing ‘on’ 0 = dosing ‘off’, using the column header name ‘dosing_onoff’. If provided as a vector state here as ‘dosing regimen is provided as a variable’ and include the file name. See Intervention Data for details.	August 5, 2025: 3 IBC Test 09:00 – 12:00 August 6, 2025: 7 IBC dosing 09:00 – 16:00 August 7, 2025: 13 IBC dosing 09:00 – 22:00 August 8, 2025: 7 IBC dosing 09:00 – 16:00 Dosing regimen is also provided in variable ‘dosing_onoff’ in data file: Carbondive_20250805_Hvalfjordur_intervention01_dosing.csv
Dosing data:	Dosing data include: flow rate, dosing rate, alkalinity dosing effluent density, and mineral mass addition rate. If any of these are constant, values may be provided here rather than in the dosing data file. To link dosing data that have been provided as vector data provide the source or filename. Please include whether dosing effluent density is directly measured or a derived value. See Intervention Data for details. Provide a description of the units for dosing rate provided as this will vary depending on the method.	Alkalinity dosing effluent density = 1050 kg/m3 density, measured directly. All other dosing data are provided in the current data submission in file: Carbondive_20250805_Hvalfjordur_intervention01_dosing.csv Variables include: flow_rate (L/s), and dosing_rate (mol/L). Dosing rate is provided in moles NaOH per L of effluent water.

Fields	Input descriptions for each metadata field are provided below, inputs shown in ‘quotes’ denote controlled vocabularies and must match the exact vocabulary provided. If a required field is not applicable to your project, put ‘NA’.	Illustrative example (not a real project).
Project ID:	The project to which the submitted data belong. A unique project identifier that can be used to link project data across data submissions, and link baseline data to intervention data, for example. If no Project ID has been assigned, one may be generated by combining the following fields, as described in Cross-linking Data Sets with Common Identifiers. Any method that creates a unique ID that will link all project data (e.g., a project’s baseline data to intervention data, and various data submissions within an experiment type) is acceptable.	Lead organizer: Carbon Dive Project beginning August 05, 2025 Hvalfjordur, Iceland Carbondive_20250805_Hvalfjordur
mCDR experiment type:	‘Baseline’, ‘control’, ‘intervention’, ‘model’, ‘other’ See Controlled Vocabularies section for definitions.	model
Experiment ID:	The experiment to which the data belong. Any naming convention that produces a unique ID is usable. A The recommended naming convention is: Project ID Experiment type Optional numerical indicator to differentiate between various experiments of the same type for a project. A two digit consecutive number beginning with 01	One of three model runs (nearfield, regional, global) output data provided (experiment type = model) for this project Carbondive_20250805_Hvalfjordur_model01
Investigators:	Provide details for each investigator including: Name, institutional information (name, address), phone, email, ID type (e.g., ORCID, etc), researcher ID, and role. The data submitter must provide contact information.	Jane Doe, University of Europa, 2345 Galactic Way, Centauri, CA, 98750, (+019-091-346-2938), ORCID, 9x938429x, data submitter.
Model configuration:	Links to model configuration files (e.g. roms_application.h, roms.in, and build_roms.sh files for a ROMS simulation)	https://github.com/parkermac/LO_roms_user/tree/main/upwelling
Model Physics Component
Name:	Name of model (e.g. ROMS, Oceananigans)	ucla-roms
Version:	Model release version	tag-1
Codebase:	Link to model code repository	https://github.com/CESR-lab/ucla-roms
Model physics description:	A description of the physical model characteristics, including version of equations being solved (hydrostatic vs non-hydrostatic), tracer advection scheme, how bottom drag is represented, mixed layer parameterizations, sub-grid mixing parameterizations if applicable, etc. Associated links to data, DOIs, or publications can be noted here, but should be supplemental.	The circulation model is a regional implementation of the Regional Ocean Modelling System (ucla-roms) Configured for the North Atlantic, centered on Iceland. The outer grid has a 3.3 km horizontal resolution and 100 vertical layers, while an inner nested grid has 40 m resolution and 100 vertical layers. ROMS is a free-surface, terrain-following, primitive equations ocean model, the hydrostatic primitive equations for momentum are solved using a split-explicit time-stepping scheme. All 2D and 3D equations are time-discretized using a third-order accurate predictor (Leap-Frog) and corrector (Adams-Molton) time-stepping algorithm. The primitive equations are discretized over variable topography using stretched terrain-following coordinates. The circulation model uses the 3rd-order upstream-biased (horizontal) and 4th-order centered differences (vertical) advection schemes for temperature and salinity. The model includes 12 freshwater inputs and is forced by the ERA5 atmospheric product (https://doi.org/10.1002/qj.3803) at the surface and by GLORYS at the boundaries. Vertical mixing is parameterized using the K-profile parameterization (KPP) from Large et al. 1994, and the air-sea interaction boundary layer in ROMS is based on the bulk parameterization of Fairall et al. (1996). Bathymetry is from GLORYS, the model T and S are initialized from GLORYS, and the model includes tides from the TPXO atlas.
References:	Reference for model physics description	https://doi.org/10.1016/j.ocemod.2004.08.002
Model BGC/Ecosystem Component
Name:	Name of BGC/Ecosystem component	MARBL
Version:	Version of BGC/Ecosystem component used	Cesm2.2-n00 (imbedded in C-Star)
Codebase:	Url link to where code can be found, the link to the specific commit (GitHub) or version should be provided.	https://github.com/marbl-ecosys/MARBL.git
Model BGC description:	A description of the biogeochemical/biological model characteristics, including which parameters are modeled explicitly, derived carbonate system parameters, advection scheme for biological tracers, CO₂ solver protocol (e.g., CO₂SYS), links to data/code with biological model parameters (e.g., growth and mortality rates), etc. Equations for each explicitly modeled parameter should be provided (can be links to publications), and it should be noted if any equations or parameter values (e.g. growth rates) were modified. Description and/or references of air-sea CO₂ flux parameterization used, gas transfer velocity formulation and atmospheric CO₂ details (e.g., fixed or time varying, and if time varying which data were used). Also include details on whether dissolution and precipitation of calcium carbonate are considered, how exchanges between sediment and overlying water are represented (if applicable), and whether active feedbacks between biological processes and the carbonate system are represented. Associated links to data, DOIs, or publications can be noted here, but should be supplemental.	The Marine Biogeochemistry Library (MARBL) is a prognostic ocean biogeochemistry model that simulates marine ecosystem dynamics and the coupled cycles of carbon, nitrogen, phosphorus, iron, silicon, and oxygen and is a component of the Community System Earth Model 2 (CESM2). The ecosystem includes multiple phytoplankton functional groups (diatoms, diazotrophs, small phytoplankton, and coccolithophores) and multiple potentially growth limiting nutrients (nitrate, ammonium, phosphate, silicate, and iron. There is one zooplankton group, dissolved organic material (semi-labile), sinking particulate pools and explicit simulation of the biogeochemical cycling of key elements (C, N, P, Fe, Si, O, plus alkalinity) (Moore et al. 2004). The ecosystem component is coupled with a carbonate chemistry module based on the Ocean Carbon Model Intercomparison Project (OCMIP) (Doney et al. 2009) allowing dynamic computation of surface ocean pCO₂ and air-sea CO₂ flux. Photoadaptation is calculated as a variable phytoplankton ratio of chlorophyll to nitrogen based on Geider et al. 1998. Phytoplankton N/P ratios are fixed at the Redfield value of 16, but the diazotroph group has a higher N/P atomic ratio of 50. The model parameterizes a prognostic phytoplankton calcifier in MARBL that is modeled on coccolithophore physiology (Krumhardt et al., 2019). The ratio of calcification to photosynthesis by the coccolithophore functional type is responsive to environmental conditions, where the calcification to photosynthesis ratio is a function of temperature, nutrients, and CO₂. Carbonate chemistry is explicit and there are two parallel carbonate systems including DIC and alkalinity tracers; applying fixed-preindustrial and time-evolving atmospheric CO₂ to these parallel systems enables cleanly computing anthropogenic CO₂ concentrations. MARBL computes burial and denitrification losses of material at the seafloor according to empirical relationships. Particulate organic carbon burial is computed using a relationship between burial efficiency and POC flux from Dunne et al. (2007), with an imposed maximum burial efficiency of 80%. Burial of SiO2 at the seafloor is based on observations in Ragueneau et al. (2000). In MARBL, 4% of Si incidents on the seafloor are buried, except where the incident flux of Si to the seafloor exceeds 2 mmol m−2 d−1; then, 20% of Si is buried. As described above, sedimentary denitrification depends on the incident POC flux and is computed based on an empirical relationship from Bohlen et al. (2012). Burial of CaCO3 on the ocean floor occurs where Ω > Ωcrit in the model’s bottom layer; where Ω < Ωcrit, all CaCO3 reaching the model’s bottom layer is dissolved. All CaCO3 is assumed to be calcite, thus ignoring the distinction between the mineral forms calcite and aragonite, which may be important in modulating dissolution depths (Gangstø et al., 2008). Air-sea CO₂ gas exchange is parameterised as a function of temperature (T) and wind speed (u10), and the concentration of the gas in the air (Ca) and in the surface water (Cw) in the form: F = k(u10 T)(Cw-Ca), where k is the gas transfer velocity. Gas transfer velocity is parameterized using the 4th order polynomial formulation of Wanninkhof (2014). Quadratic k₆₆₀ parameterisation is calibrated to give 16.5 cm/hr global average (recommended Naegler, 2009) for the ERA5 wind product by SeaFlux/Luke Gregor et al. (2023). Atmospheric CO₂ is assumed fixed and spatially uniform at 428 ppm.
References:	Links or DOIs to any reference(s) relevant to the model components/development, specific model configuration, model validation etc.	https://doi.org/10.1029/2021MS002647
Other model components:	Additional model components such as sea ice, sediment, atmosphere, etc., if applicable. These fields should repeat the same structure as for physics and biogeochemical model components (e.g., Name, Version, Codebase, Description, References).	Not applicable
Grid Details:
Grid type:	Descriptive structure of grid (e.g., latitude-longitude grid, unstructured triangular, tripolar). Georeferencing information must be included here.	Rectangular x-y grid with rotation: Central longitude: -19 Central latitude: 65 Grid Rotation: 20
Model region:	A description of the region modeled.	North Atlantic centered on Iceland
Spatial coverage:	Latitude/longitude bounds of observed data in experiment, provided in decimal degrees (negative for southern and western hemispheres) as westernmost longitude, southernmost latitude, easternmost longitude, northernmost latitude. [S, W, N, E]	[-20, 70, -17, 60]
Arrangement:	The grid arrangement of orthogonal physical quantities (e.g. Arakawa A, Arakawa B, Arakawa C)	Arakawa C-grid
Nx:	Number of x grid points	800
Ny:	Number of y grid points	800
Nz:	Number of vertical coordinate levels	100
N nodes:	Number of grid nodes (if using an unstructured grid)	5285
Horizontal resolution range:	Range of horizontal resolution (in m or km)	3.3 km (for the outer nest)
Vertical resolution range:	Range of vertical resolution (in meters)	Max. 4 m (topography following vertical grid)
Input Details:
Bathymetry:	Data source for bathymetry used (including links to data if available)	GLORYS bathymetry data (Copernicus Marine Service Product ID: GLOBAL_MULTIYEAR_ PHY_001_030) https://doi.org/10.48670/moi-00021
Initial conditions:	Data sources for initial conditions of all model state variables (including links to data if available)	Initial conditions from GLORYS (Copernicus Marine Service Product ID: GLOBAL_MULTIYEAR_ PHY_001_030) https://doi.org/10.48670/moi-00021
Boundary conditions:	Data source for boundary conditions for all model state variables (including links to data if available)	GLORYS (Copernicus Marine Service Product ID: GLOBAL_MULTIYEAR_ PHY_001_030) https://doi.org/10.48670/moi-00021
Atmospheric forcing:	Data sources for atmospheric forcing if applicable (including links to data if available). Examples of atmospheric forcing for physics models include wind fields, shortwave and longwave radiation, air temperature, and humidity. Examples of atmospheric forcing for biogeochemical models include atmospheric CO₂ and dust deposition.	ERA5 hourly (https://doi.org/10.24381/cds.143582cf)
Tidal forcing:	Data source for tidal forcing (including links to data if available)	TPXO atlas (https://www.tpxo.net/global/tpxo10-atlas)
River & sediment flux details:	Description of river and sediment flux data used to force the model (including links to data if available)	River fluxes for the inner nest sourced from the Icelandic Met Office (https://en.vedur.is/) for 12 rivers, no river fluxes used for the outer nest. No sediment fluxes applied.
Processing of input data:	If applicable, describe any processing of raw forcing and input data listed in the fields above.	NA
Experiment Details:
Spin-up protocol:	A description of the spin up process chosen for the model initiation, including an explanation for how appropriate spin up was defined to be achieved.	2 weeks per nest
Start date and time:	Start date and time of model experiment in UTC ISO-8601	2024-01-01T01:20:30Z
End date and time:	End date and time of model experiment in UTC ISO-8601	2034-01-23T01:20:30Z
Output frequency:	Time frequency at which model fields are saved (e.g. hourly mean, daily mean)	Monthly means
Time stepping scheme and parameters:	Method used to discretize time domain (e.g., Euler, Runge-Kutta, leapfrog) and time step used	Runge-Kutta scheme, 10 second for spin-up – up to 3 minutes per timestep for outer nest
Description of alkalinity addition:	A description of how alkalinity perturbation was applied in the model	Applied over multiple grid cells in initial conditions to ALK_ALT_CO₂ variable in MARBL (only in inner nest, no Alk experiment in outer nest)
Hardware Configuration (all recommended):
Machine:	Machine name of hardware used to run model	Perlmutter
Operating system:	Operating system of hardware used to run model	Linux
CPU/GPU details:	Details on CPU or GPU hardware	Details here: https://docs.nersc.gov/systems/perlmutter/architecture/#cpu-nodes
Memory:	Memory capacity of machine	512 GB of DDR4 memory total
Storage:	Storage capacity of operating system	44 PB
Parallelization:	Description of processors used in parallel, including number or processors and MPI version if used.	3 nodes and 108 ntasks per node

Trigger	Sector	Deadline to archive data
A peer-reviewed publication is accepted	Academic, non-profit, private	Upon publication (in compliance with FAIR data standards)
An Experiment has ended	Academic, non-profit	Within 4 months of the end of an Experiment or within 3 months* of the receipt of all Experimental data sample processing
Submission for verification	Private	Within a timeframe so that the data are available by the start of the verification period
It has been 1 year since the start of the Experiment or the last data submission for this Experiment	Academic, non-profit	Annual submissions are required, though 6 month cycles are ideal

Column Header Standards for General and Chemical Oceanographic Data
Abbreviation [unit]	Full unit	DP	Description
station_id	N/A	N/A	Station identification. Numerical Station_IDs without letters are recommended to facilitate future QC efforts.
cast_number	N/A	N/A	Cast number, where a cast is the lowering of equipment over the side at one station, e.g., CTD, net tow, etc. Cast_number should be sequential and restart with 1 for each station.
rosette_position	N/A	N/A	Rosette position refers to the position number around the CTD-rosette (e.g., 1 of a 1-12, or 1-24, or 1-36 number).
niskin_id	N/A	N/A	Niskin_ID is a unique alphanumeric identifier assigned to only that Niskin bottle over the duration of the expedition.
niskin_flag	N/A	N/A	Quality control flag for tracking problems with Niskin closure and integrity.
sample_id	N/A	N/A	A sample identifier (Sample_ID), which uniquely identifies a row of data during the subsequent QC and interpretation process, is often generated by concatenating the Station_ID, Cast_number, and Rosette_position, according to: Sample_ID = Station_ID × 10000 + Cast_number × 100 + Rosette_position. For example, at station 15, the 2nd cast, a Rosette_position of 3 will have a Sample_ID of 150203.
year_utc	YYYY	0	Calendar year in UTC when Niskin bottles at a specific depth are triggered
month_utc	MM	0	Calendar month in UTC when Niskin bottles at a specific depth are triggered
day_utc	DD	0	Calendar day in UTC when Niskin bottles at a specific depth are triggered
time_utc	HHMMSS	N/A	Time in UTC (hh:mm:ss) when Niskin bottles at a specific depth are triggered
yearday_utc	N/A	2	Yearday refers to the day number in an annual cycle. (e.g., 06:00 on Jan 1 means yearday = 1.25, 18:00 on Dec 31 means yearday = 366.75 in a leap year). Note, Yearday_UTC starts with 1, instead of 0. It can be calculated according to this equation: Yearday_UTC = datefunction(Year_UTC, Month_UTC, Day_UTC) – datefunction(Year_UTC, 1, 1) + Time_UTC + 1, where, “datefunction” is the date function of a program (e.g., in Excel, the data function would be “DATE”).
latitude	decimal degrees	4	Latitude in decimal degrees North (negative for southern hemisphere) when Niskin bottles at a specific depth are triggered
longitude	decimal degrees	4	Longitude in decimal degrees East (negative for western hemisphere) when Niskin bottles at a specific depth are triggered
depth_bottom	m	0	Bottom water depth of the sampling station
ctdpres	dbar	1	Hydrostatic pressure recorded from CTD at the depth where the sample is taken
depth	m	1	Depth at which a sample is taken. It can be approximated from CTDPRES and Latitude using the TEOS-10 equation.
salinity_pss78	N/A	3	Salinity calculated from conductivity measured from discrete bottles using the equation of the Practical Salinity Scale of 1978. Salinity_PSS78 is unitless.
oxygen	umol/kg	1	Dissolved oxygen (O2) content measured from discrete-bottle-based Winkler titration
dic	umol/kg	1	Total dissolved inorganic carbon content
ta	umol/kg	1	Total alkalinity content
ph_t_measured	N/A	4	pH measured on Total Scale (T) at measurement temperature and 1 atmosphere pressure (0 dbar applied pressure) using spectrophotometric methods. If the pH is measured on the seawater, free, or NBS scale, replace “T” with SWS, F, or NBS, respectively. For pH measurements made using electrodes, “pH_T_measured_electrode” should be used instead.
temp_ph	deg_C	2	Temperature at which the pH_t_measured value is measured
ph_t_insitu	N/A	4	pH on total scale at in situ temperature
carbonated_measured	umol/kg	1	Dissolved carbonate ion content ([CO32-]) at measurement temperature and 1 atmosphere pressure (0 dbar applied pressure).
temp_carbonate	deg_C	2	Temperature at which the Carbonate_measured value is measured
fco2_measured	uatm	1	Fugacity of carbon dioxide (fCO₂) in air that is in equilibrium with seawater measured from discrete bottles at measurement temperature and 1 atmosphere pressure (0 dbar applied pressure).
temp_fco2	deg_C	2	Temperature at which the fCO₂_measured value is measured
pco2_insitu			Partial pressure of carbon dioxide (pCO₂) at in situ temperature
omega_aragonite	none		Aragonite saturation state
omega_calcite	none		Calcite saturation state
pic	mol/m^3		Particulate inorganic carbon
poc	mg/m^3		Particulate organic carbon
toc	umol/kg		Total organic carbon (per mass)
toc_l	umol/L		Total organic carbon (per volume)
tic			Total inorganic carbon
pim	mg/L		Suspended particulate inorganic matter
pom	mg/L		Suspended particulate organic matter
spm	mg/L		Total suspended particulate matter
turbidity	NTU		turbidity
silicate	umol/kg	2	Silicate (total dissolved inorganic silicate: Si(OH)4, H4SiO4, SiO2, Sil) content
phosphate	umol/kg	2	Phosphate (total dissolved inorganic phosphate: H2PO4−, HPO42−, PO43−) content
nitrate	umol/kg	2	Nitrate (NO3-1) content. This term should not be used to indicate nitrate plus nitrite content, although the distinction is generally small because nitrate >> nitrite.
nitrite	umol/kg	2	Nitrite (NO2-1) content
nitrate_and_nitrite	umol/kg	2	Nitrate plus nitrite content
ammonium	umol/kg	2	Ammonium (NH4+ and NH3) content
wave_height	meters		The wave height at the station where the data were collected.
wind_speed	m/s		The wind speed at the station where the data were collected.

Column Header Standards for Sensor Data
Abbreviation	Unit	Description
year_utc	YYYY	Calendar year in Coordinated Universal Time (UTC)
month_utc	MM	Calendar month in Coordinated Universal Time (UTC)
day_utc	DD	Calendar day in Coordinated Universal Time (UTC)
time_utc	HHMMSS	Time in the format of hh:mm:ss
yearday_utc	N/A	Yearday refers to the day number in an annual cycle. (e.g., 12 pm on Jan 1 means Yearday = 1.50, 6 am on Dec 31 means Yearday = 366.25 in a leap year). Two digits after the decimal point are recommended.
latitude	decimal degree	Latitude in decimal degrees North (negative for Southern Hemisphere)
longitude	decimal degree	Longitude in decimal degrees East (negative for Western Hemisphere)
depth	meter	Depth (in meters) at which the sensor is located
temp_its90	degrees Celsius	In situ temperature recorded on the ITS-90 scale. If the temperature scale is IPTS-68, this term should be replaced with “temp_ipts68”.
sal_pss78	N/A	Salinity calculated from conductivity using the equation of the Practical Salinity Scale of 1978.
pressure_atm	hPa	Sea level atmospheric Pressure
pressure_atm_licor	hPa	Atmospheric pressure as recorded by LICOR
temperature_licor_its90	degrees Celsius	Temperature as recorded by LICOR
xco2_sw_wet	μmol/mol	Mole fraction of carbon dioxide in seawater in wet gas
xco2_atm_wet	μmol/mol	Mole fraction of carbon dioxide in air in wet gas
xh2o_sw	μmol/mol	Mole fraction of H2O in the headspace of the equilibrator
xh2o_atm	μmol/mol	Mole fraction of H2O in air
xco2_sw_dry	μmol/mol	Mole fraction of CO₂ in seawater in dry gas
xco2_atm_dry	μmol/mol	Mole fraction of CO₂ in air in dry gas
fco2_sw_sat	μatm	Fugacity of CO₂ in seawater at saturated water vapor pressure
fco2_atm_sat	μatm	Fugacity of CO₂ in air at saturated water vapor pressure
dfco2	μatm	Difference of fCO₂ in water and air (fCO₂_SW – fCO₂_Air)
doxy	μmol/kg	Dissolved oxygen measured from sensor
percent_o2	N/A	Percent O2 measurement made in equilibrated air
chl_stimf	mg/m^3	Chlorophyll-a derived from a calibrated in situ fluorometer
rhodamine_fl		Rhodamine concentration estimate from fluorescence

Column Header Standards for Underway pCO₂ Data
Abbreviation	Unit	Description
temperature_equ_its90	degree Celsius	Water temperature recorded in the equilibrator
pressure_equ	hPa	Pressure inside the headspace of the equilibrator. 1 hPa = 1 mbar.
xco2_equ	μmol/mol	Mole fraction of carbon dioxide (dry) inside the headspace of the equilibrator
xco2_atm	μmol/mol	Mole fraction of carbon dioxide (dry) in the atmosphere
xco2_atm_interpolated	μmol/mol	Interpolated atmospheric xCO₂ to match with water analyses time
fco2_sw_sst	μatm	Fugacity of seawater carbon dioxide at SST
fco2_atm_interpolated	μatm	Interpolated atmospheric fCO₂

Controlled Vocabularies for mCDR Pathway
mCDR Pathway	Definition
Ocean alkalinity enhancement	Ocean Alkalinity Enhancement (OAE) is a method to help mitigate climate change by increasing the alkalinity of seawater to enhance its capacity to absorb and store atmospheric carbon dioxide (CO₂).
Biomass Sinking	Biomass Sinking is a method that involves taking terrestrial or ocean biomass and sinking it into the deep ocean surface, subsurface, or anoxic basins, where it is sequestered. This can be accomplished by large-scale seaweed farming or macroalgae cultivation, which incorporates atmospheric CO2 as it grows, and then is sunk to the ocean floor. Alternatively, terrestrial plant biomass can be sunk to the ocean floor.
Direct ocean capture	Direct Ocean Capture (DOC) is a method that uses electrochemical processes to remove dissolved carbon dioxide (CO₂) directly from seawater for carbon storage or reuse.
Ocean nutrient fertilization	Ocean Fertilization is a method that involves adding nutrients, such as iron, nitrogen, or phosphorus, to the ocean to stimulate the growth of phytoplankton or other microscopic plants that absorb carbon dioxide (CO₂) through photosynthesis.
Artificial upwelling and downwelling	Artificial Upwelling and Downwelling are mCDR methods that involve manipulating ocean water movement to enhance natural carbon sequestration processes.
Marine ecosystem recovery	Marine Ecosystem Recovery refers to the restoration and protection of marine ecosystems to enhance their natural ability to capture and store carbon dioxide (CO₂). This method leverages the natural carbon-sequestering processes of marine habitats such as salt marshes, mangrove forests, coral reefs, kelp forests, seagrass meadows, oyster beds, and deep-sea ecosystems, aiming to rebuild biodiversity, ecosystem functions, and carbon storage capacity.

Controlled Vocabularies for Alkalinity Feedstock Processing
Alkalinity Feedstock Processing	Definition
Electrochemistry	Alkalinity generated via electrochemical processes (e.g., seawater electrolysis).
Synthetically derived	Intentionally industrially manufactured chemical compounds (e.g., Ca(OH)2 via lime kilns).
Mineral mining	Mined geological material, including purified mineral or natural rock.
Blended	A mix of multiple sources.
Other	Unclassified or novel; include a description in Experiment Description.

Controlled Vocabularies for mCDR Experiment Type
Experiment type	Definition
Baseline	Baseline refers to the initial set of data or conditions that are representative of the marine environment without interventions or modifications made. This baseline field data serve as a reference point for comparing intervention measurements, allowing for the assessment of the effectiveness and impacts of the interventions over time, such as changes in ocean alkalinity, CO₂ absorption, or ecosystem health.
Control	A control site refers to a designated area in proximity to an intervention site, with shared characteristic waters, but that remains unaffected by the intervention, serving as a ‘control’ for comparison during and following intervention. The purpose of a control site is to isolate and account for natural variability in oceanographic conditions, biogeochemical processes, and carbon fluxes, enabling the evaluation of changes directly attributable to intervention activities.
Intervention	An intervention refers to the intentional action or process applied to the ocean to alter its chemical or physical properties in order to enhance its capacity for carbon dioxide removal. This could include adding alkaline substances to the water or implementing other methods aimed at increasing ocean alkalinity and improving the ocean’s ability to absorb and store atmospheric CO₂.
Model	Model refers to the results or data generated by numerical or computational models.
Other	Novel or undefined experiments (such as specific socioeconomic experiments) should use ‘other’.

Controlled Vocabularies for Alkalinity Feedstock Form
Feedstock form	Definition
Solid	Involves adding alkaline minerals or particulate slurry (such as MgOH2, MgO, or CaO) to seawater or river systems either directly, through coastal outfalls (such as wastewater), or at breaking shorelines to increase its alkalinity.
Aqueous	Aqueous alkalinity addition may use electrochemistry or fully dissolved mineral feedstock to increase seawater alkalinity.
Slurry	Slurry alkalinity additions include a mix of solid and aqueous alkalinity forms, where the solid alkaline particulates are suspended in a solution.

Controlled Vocabularies for Dosing Delivery Type
Dosing delivery type	Definition
Static point source	A single dosing location such as an outflow from a static platform with a pipe
Variable point source	A mobile dosing regimen described by a single location at each time step, such as an outflow from a mobile platform such as a ship or surface vessel.
Static distributed	A set location or locations of dosing that is not a point source, such as a distributed area over the seafloor or a diffusor.
Variable distributed	A distributed dosing area that varies in time, such as manually placed alkaline material over different areas at different times.

Controlled Vocabularies for mCDR Data Types
Data Types	Definition
dosing	Variables such as dosing_onoff, dosing_rate, and flow_rate should be included here.
cast	Vertical profiles (e.g., optical packages, CTD)
bottle	Any other types of measurements from water samples collected at discrete depths (e.g., nutrients)
flow_thru	Continuous data (e.g., shipboard, underway flow through system)
pigment	For laboratory measured pigment data (e.g. fluorometry, spectrophotometry, HPLC)
marine_snow_catcher	For various types of marine snow catcher data
mooring	Moored and buoy data
drifter	Drifter and drogue data
airborne	Measurements made via an aircraft
diver	For measurements made by a diver
auv	Measurements made by an autonomous underwater vehicle
asv	Measurements made by an autonomous surface vehicle
experimental	Measurements that have a non-geospatial aspect (e.g., incubations or other laboratory measurements, etc.)
sediment_trap	Measurements from a sediment trap platform
taxonomy	Data whose purpose is the classification or annotation of phytoplankton, zooplankton, or other marine groups.
sediment	Measurements from sediment samples (e.g., core samples, grab samples)
model_output	Data output from model experiments
socioeconomic	Information (quantitative or qualitative) from socioeconomic studies
net_tow	For measurements captured via net (e.g., zooplankton via MOCNESS)

Controlled Vocabularies for Platform Type
Research Vessel	A research vessel is a specialized type of ship or boat that is designed and equipped for oceanographic research. It often has autonomous sensors onboard and laboratories with scientific equipment for analyzing samples, and various other facilities to support research operations at sea.
Ship of Opportunity (SOOP)	Ships of opportunity (SOOP) are not specifically designed for oceanographic research but are used to collect scientific data from autonomous sensors opportunistically. They can be cargo vessels, container ships, or other types of vessels that travel predetermined routes across the ocean.
Vessel	Generic term to describe a ship that is not a SOOP or research vessel.
Mooring	A mooring is a collection of instruments used to measure oceanographic variables over an extended period of time at a fixed station. These mooring systems typically comprise a surface or subsurface buoy, to which the instruments are affixed, and a weighted anchor connected by a line.
Drifting buoy	Drifting buoys are devices that float on the ocean surface, allowing them to follow the current. Typically, these buoys are equipped with a “drogue” – a device like a parachute or sheet – which enables them to be dragged along by the current.
Profiling float	E.g., Argo floats are a type of profiling float, consisting of a cylindrical body that contains sensors for measuring ocean properties and inflatable bladders that allow the float to change its buoyancy and move up and down through the water column. Profiling floats drift with ocean currents and surface periodically to transmit data via satellite.
Surface glider	A surface glider is an autonomous, uncrewed surface vehicle (USV) operating at a single depth near the surface using wave or solar energy for propulsion. Example: wave gliders.
Sub-surface glider	Sub-surface gliders are a type of autonomous underwater vehicle (AUV) that moves through the water using changes in buoyancy and wings to control its movement.
Autonomous Surface Vehicle	A self-propelled surface vehicle operating on the sea surface with no human occupants. Example: Saildrone.
Benthic chamber	A sealed platform placed on the seafloor to measure chemical and biological exchanges between sediments and overlying water.
Sediment trap	A platform used to collect sinking particles in the ocean to measure vertical fluxes of material like organic carbon.

Controlled vocabularies for Instrument type
CTD rosette	A CTD rosette consists of a metal frame that houses a collection of sensors and water sampling bottles (e.g., Niskin).
CTD sensor	The acronym CTD stands for Conductivity, Temperature, and Depth, which are the three primary variables measured by a CTD sensor.
Niskin bottle	A Niskin bottle is a type of sampling device used in oceanography to collect water samples at different depths. It is named after the inventor, Shale Niskin, who developed the device in the 1960s.
Flow-through system	A flow-through system on a research vessel or ship of opportunity is a system designed to continuously pump seawater from the ocean into the laboratory for scientific research.
Thermosalinograph	A Thermosalinograph (TSG) is an instrument used to measure seawater temperature and salinity.
Salinometer for discrete salinity measurement	Salinometers work based on the principle of conductivity. They measure the electrical conductivity of the water, which is directly related to its salinity.
DIC analyzers based on Coulometers	DIC coulometers are widely used in oceanographic research to measure the concentration of dissolved inorganic carbon in seawater samples. They are often coupled with computer-controlled automated dynamic headspace analyzers that extracts total carbon dioxide from seawater using Single-Operator Multiparameter Metabolic Analyzers (SOMMAs)
DIC analyzers based on CO₂ gas detectors	N.A. DIC analyzers based on a CO₂ gas detector including Non-dispersive infrared absorption (NDIR) (e.g., Licor LI-850), Cavity Enhanced Absorption Spectroscopy (e.g., Licor’s LI-7815), and Cavity Ring-Down Spectroscopy (CRDS) (e.g., Picarro G2131i) detectors.
Autonomous DIC sensor	Autonomous dissolved inorganic carbon (DIC) sensors are devices that can measure the concentration of DIC in seawater or other natural waters in situ, without the need for manual sampling and laboratory analysis.
Alkalinity titrator	An alkalinity titrator is a device used to measure the total alkalinity of a seawater by titration.
Autonomous TA sensor	Autonomous total alkalinity (TA) sensors are devices that can measure the concentration of TA in seawater or other natural waters in situ, without the need for manual sampling and laboratory analysis.
Showerhead equilibrator	This type of equilibrator works by spraying seawater into a gas chamber, allowing the CO₂ in the water to equilibrate with a gas mixture in the chamber.
Floating air-water equilibrator	An “h”-shaped bubble equilibrator assembly commonly used in MAPCO2 systems on moorings. For more information, refer to Friederich et al. (1995).
Membrane equilibrator	While seawater is passed through a membrane, CO₂ in the water diffuses across the membrane and equilibrates with the gas mixture, which is then analyzed to determine the CO₂ concentration.
Flask for discrete carbon dioxide measurement	Such flasks are typically made of glass and have a capacity of around one liter. Seawater samples are collected from a specific depth using a Niskin bottle or other sampling device and transferred to the flask without exposing them to the air. The flask is then sealed with a stopper and transported to the laboratory for analysis.
Spectrophotometer	A spectrophotometer is a scientific instrument used to measure the amount of light absorbed or transmitted by a sample. It is commonly used for high quality pH measurements.
Handheld pH spectrophotometer	One example of a handheld pH spectrophotometer is the “pHyter”. Refer to Pardis et al. (2022) for more details.
pH electrode	A pH electrode, sometimes referred to as a pH probe or pH sensor, is a glass device used to measure the pH of a solution.
pH non-electrode	An oceanographic instrument that is used to measure the pH of seawater in real-time based on FET or other novel technology.
Oxygen titrator	An oxygen titrator is a device used to measure the concentration of dissolved oxygen in a water sample, as required for the Winkler method.
Oxygen sensor	An oxygen sensor or probe or sonde, is an electronic device that measures the concentration of dissolved oxygen in the ocean.
YSI	YSI (Yellow Springs Instruments) is a company that produces a variety of water quality monitoring instruments. The YSI sensors are designed to measure a wide range of parameters, including temperature, salinity, and dissolved oxygen.
Nutrient analyzer	A nutrient analyzer is a device used to measure the concentration of nutrients, such as nitrate, nitrite, ammonium, phosphate, and silicate, in water samples.
Fluorometers	Fluorometers can detect chlorophyll by transmitting an excitation beam of light and detecting the light fluoresced chlorophyll molecules in a sample.
High performance liquid chromatography	High performance liquid chromatography (HPLC) is a powerful analytical technique used in chemistry, biochemistry, and pharmaceutical industries to separate, identify, and quantify individual components in a mixture.
Acoustic Doppler Current Profiler	Acoustic Doppler Current Profiler (ADCP), is a type of instrument used to measure water currents in oceans, rivers, and other bodies of water.
Mass spectrometers	A mass spectrometer is an analytical instrument used to measure and identify the mass and abundance of atoms and molecules in a sample.
Isotope ratio mass spectrometers	An isotope ratio mass spectrometer (IRMS) is a scientific instrument used to measure the isotopic composition of a sample.
Barometric pressure sensor	A barometric pressure sensor is a device that measures atmospheric pressure, which is the pressure exerted by the weight of the Earth’s atmosphere.
Microscopes	A microscope is an instrument used to observe and magnify objects that are too small to be seen by the naked eye.
Scanning Electron Microscopes	A scanning electron microscope (SEM) is a type of microscope that uses a focused beam of electrons to create high-resolution images of the surface of a specimen.
Biological trawl	A biological trawl is a type of fishing net that is towed behind a boat to collect marine organisms from the water column.
Phytoplankton net	Phytoplankton net is used to collect and identify phytoplankton, which are microscopic plants that form the base of the marine food web.
Zooplankton net	Zooplankton net is used to collect and identify zooplankton, which are microscopic animals that feed on phytoplankton and are important prey for many marine organisms.
Flow cytometers	A flow cytometer is a scientific instrument used to analyze and sort cells or particles in a liquid suspension based on their physical and chemical properties.
eDNA sampler	Environmental DNA (eDNA) samplers: used to collect and analyze genetic material shed by marine organisms, which can provide information about their distribution, abundance, and diversity.

Date	Version	Revision Description	Notes
03/17/2025	1.0.0	First published document following revisions from open public review during January 23, 2025 – March 07, 2025	Summary of feedback and responses
01/23/2025	0.1.0	Original draft document published for public review

OAE Data Management Protocol

The OAE Data Management Protocol outlines recommendations for producing consistent data and metadata for Ocean Alkalinity Enhancement (OAE) research projects.

Versions

Table of Contents

Background

Introduction

Objectives & Guiding Principles

Methodology

Intended Users

Acronyms and Abbreviations

Key Acronyms and Abbreviations

Definitions of Selected Terms

Metadata and Templates

How to fill out your metadata

Project Metadata

mCDR Project Metadata

Experiment Metadata

Experiment Metadata

Dataset Metadata

Model Metadata

Model Metadata

Guidelines for Data Management

When, Where, and How to Submit Data

When to submit data:

Where to archive data:

Where to archive code:

How to submit data files:

General Guidelines for your Data

Requirements:

Recommendations:

In Situ Sensor Data

Requirements:

Recommendations:

Model Data

Requirements:

Recommendations:

Intervention Data

Requirements:

Recommendations:

Sediment Processes Data

Requirements:

Recommendations:

Derived Variables

Carbonate Parameters

TEOS-10 Calculations

Cross-linking Data Sets with Common Identifiers

Project ID

Experiment ID

Emerging Standards

Column Header Names

General and Chemical Oceanographic Variables

Column Header Standards for General and Chemical Oceanographic Data

Sensor-observed Variables

Column Header Standards for Sensor Data

Underway pCO₂ Variables

Column Header Standards for Underway pCO₂ Data

Model Output Variables

Controlled Vocabularies

mCDR Pathways

Controlled Vocabularies for mCDR Pathway

mCDR Experiment Type

Controlled Vocabularies for mCDR Experiment Type

Alkalinity Feedstock Processing

Controlled Vocabularies for Alkalinity Feedstock Processing

Alkalinity Feedstock Form

Controlled Vocabularies for Alkalinity Feedstock Form

Dosing Delivery Type

Controlled Vocabularies for Dosing Delivery Type

mCDR Data Type

Controlled Vocabularies for mCDR Data Types

Platform Type

Controlled Vocabularies for mCDR Data Types

Instrument Type

Controlled vocabularies for Instrument type

Deprecated Standards and Practices

Update definitions:

Triggers for updates:

Acknowledgements

Coordination Team

Data Initiative Steering Committee