Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ESS-DIVE Data Package Structure To Do #31

Closed
7 of 8 tasks
stephpenn1 opened this issue Sep 26, 2022 · 7 comments
Closed
7 of 8 tasks

ESS-DIVE Data Package Structure To Do #31

stephpenn1 opened this issue Sep 26, 2022 · 7 comments

Comments

@stephpenn1
Copy link
Member

stephpenn1 commented Sep 26, 2022

Note: Deadline for ESS-DIVE submission October 15

End Product: ESS-DIVE data package with corrected Level 2 datasets, multiple zip files (one zip per sample type - soil, sediment, water). Multiple files (e.g. pH, TCTN) or folders (e.g. CDOM, FTICR) within each zip

What the data package would like like, separate zip files per Analyte:

Screen Shot 2022-09-26 at 2 47 07 PM

Metadata

The Rest

@stephpenn1
Copy link
Member Author

stephpenn1 commented Sep 26, 2022

Each data file should meet the following criteria:

  • File name: [Campaign]_[Sample Type]_[Analyte]_L2.csv
  • Missing values are NA
  • Kit IDs are sorted in order least to greatest
  • Flag columns state reason for each NA value
  • File is in flat CSV format
  • No duplicate Kit_IDs

@stephpenn1 stephpenn1 pinned this issue Oct 4, 2022
@stephpenn1
Copy link
Member Author

stephpenn1 commented Oct 21, 2022

Checklist for SP putting into folders

METADATA

  • kit-level metadata
  • collection-level metadata
  • collection-level data
  • igsn metadata

WATER

  • Basic Water Quality (conductivity, pH, ORP, Alkalinity)
  • Total Suspended Solids
  • Major Anions (fluoride, chlorine, nitrate, nitrite, sulfate, bromide, phosphate)
  • Major Cations (lithium, sodium, ammonium, potassium, magnesium, calcium)
  • Dissolved organic carbon (NPOC/TDN datastream)
  • Total dissolved nitrogen (NPOC/TDN datastream)
  • High-resolution mass spectrometry (FTICR-MS)
  • Colored Dissolved Organic Matter (CDOM)

SEDIMENT

  • Oxygen concentration time-series
  • GHGs
  • Gravimetric water content
  • Loss on ignition

SOIL

@amyerspigg
Copy link
Collaborator

amyerspigg commented Nov 1, 2022

ESS DIVE Submission checklist
https://ess-dive.lbl.gov/archive/

  • Clean and organize datasets (see original issue post and steps for this)
  • Compile submission metadata
  • Dataset metadata (required to publish on ESS-DIVE) reporting format https://github.com/ess-dive-community/reporting-format-template-repo
  • Dataset Title
  • DOIs
  • Abstract
  • Keywords
  • Data Variables
  • Publication Date
  • Data Usage Rights
  • Project Affiliation
  • Funding Organizations
  • DOE contracts
  • Related References
  • Dataset Contact
  • Dataset Creators
  • Dataset Contributors
  • Dates (start and end date)
  • Location Description
  • Bounding Box Coordinates
  • Methods ("You may provide a citation for any related methods used that have been previously published, but we strongly recommend still including methods text in your dataset metadata that fully describe data collection and processing steps"

@amyerspigg
Copy link
Collaborator

@kaizadp / @peterregier who was doing the LOI check? I see several kits with no data for LOI that I THINK we should have samples for (based on the filtering of collected/not collected script PR wrote). All sediment:
K001
K007
K029
K051
K052
K056
K058

@amyerspigg
Copy link
Collaborator

Metadata @stephpenn1
K046 transition
K014 wetland

@peterregier

This comment was marked as resolved.

@amyerspigg
Copy link
Collaborator

Ok cool. I can't find the check list assignments since we didnt do them in github

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants