Skip to content
This repository has been archived by the owner on Jun 2, 2023. It is now read-only.

Check on preferred column names for temperature observations #38

Open
lekoenig opened this issue Sep 16, 2022 · 2 comments
Open

Check on preferred column names for temperature observations #38

lekoenig opened this issue Sep 16, 2022 · 2 comments
Assignees

Comments

@lekoenig
Copy link
Collaborator

The 2022 forecasting data release contains both aggregated (by NHM segment-date) and unaggregated temperature observations, both of which have columns for min_temp_c, mean_temp_c, and max_temp_c. As a result, these are the columns we carry through when we create p2_drb_temp_obs_by_comid, which is a data frame containing one row of temperature data per COMID-date.

Janet pointed out that river-dl is expecting a column called temp_c and so to run river-dl she changes the column from mean_temp_c to temp_c. This issue is a reminder to check in with others who run river-dl to see if switching to mean_temp_c in that workflow would cause any issues. If so, we should adjust the column names used in our pipeline to match what river-dl is expecting.

@janetrbarclay
Copy link
Collaborator

Just to clarify, the expected temp column name for river-dl is specified in the config file, so we could change what we use without causing any errors or changes to the behavior for any other project. It's just a question of whether there would be confusion / perceived inconsistencies if what's really the same input variable has different names in different models.

@msleckman
Copy link
Collaborator

@lekoenig I believe we can close this now?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

3 participants