Add tests for integrity of merged contexts #26

cthoyt · 2022-10-26T12:58:47Z

#15 introduced data integrity tests, but did not apply them to the merged contexts, because there seems to be some issues with the logic that generates them.

The merged and merged.oak strings should be removed from the following (effectively leaving skip = set() to go along with any updates that it takes to generate merged contexts with bijectiveness guarantees.

prefixmaps/tests/test_core/test_integrity.py

Line 20 in 433da1f

skip = {"merged", "merged.oak"}

The text was updated successfully, but these errors were encountered:

cmungall · 2022-12-19T16:40:16Z

If we don't skip these, then

    def test_namespace_aliases(self):
        """Test that prefix aliases have a valid namespace."""

will fail, but this is expected.

this is because we have:

merged,sdo,https://schema.org/,namespace_alias

from prefix.cc -- which is junk because, the correct semantic URL uses http not https, so there is no direct deterministic join point with the correct canonical prefix or IRI:

merged,schema,http://schema.org/,canonical

so this is effectively orphaned

now, we could extend the test to do a transitive walk over aliases and use the following as a join point:

merged,schema,https://schema.org/,prefix_alias

but this would be overkill, and not necessary, as the fundamental assumption here doesn't hold - the merged context does have integrity, it provides the correct bijective mapping for schema.org, according to the precedence rules in which the merged context is built (with the junky prefix.cc having lowest priority).

The thing to remember here is that the only thing the API exposes is the canonical mappings, the rest is just there for debugging purposes. It could be argued that a cleaner design would be for the CSVs to only include the canonical mappings for that context, and to put additional anciliary metadata that arises from the ETL elsewhere - a separate issue could be made for this if it's a priority (this would only be a priority for making the library easier to understand, as it wouldn't affect the users of the library)

Fixes #26

cthoyt mentioned this issue Oct 26, 2022

Contexts don't guarantee bi-directional mapping #11

Open

3 tasks

nlharris added the testing label Oct 27, 2022

cmungall added a commit that referenced this issue Dec 19, 2022

Added internal documentation on why some contexts skip some tests.

8be83b1

Fixes #26

cmungall mentioned this issue Dec 19, 2022

Added internal documentation on why some contexts skip some tests. #35

Merged

sierra-moxon closed this as completed in #35 Mar 15, 2023

cthoyt mentioned this issue Dec 7, 2023

Updating to Pydantic v2 monarch-initiative/monarch-app#495

Merged

vorburger mentioned this issue Jun 28, 2024

Schema.org recommends https, but validation fails because this requires http #73

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests for integrity of merged contexts #26

Add tests for integrity of merged contexts #26

cthoyt commented Oct 26, 2022

cmungall commented Dec 19, 2022

Add tests for integrity of merged contexts #26

Add tests for integrity of merged contexts #26

Comments

cthoyt commented Oct 26, 2022

cmungall commented Dec 19, 2022