Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhancement: Provide unmerged dump ordered by directory name #831

Open
amaltaro opened this issue Jul 29, 2024 · 0 comments
Open

Enhancement: Provide unmerged dump ordered by directory name #831

amaltaro opened this issue Jul 29, 2024 · 0 comments

Comments

@amaltaro
Copy link

Enhancement Description

The WM MSUnmerged service consumes RSE Unmerged dump from the Rucio Consistency Monitoring system, an API like:
https://cmsweb.cern.ch/rucioconmon/unmerged/files?rse=T1_ES_PIC_Disk&format=json

It would be helpful if the list of files would be grouped/ordered by directory name, such that it's easier to iterate through the list of files to be considered for deletion, in addition to consuming less resources.

Use Case

To better deal with memory footprint in the WM MSUnmerged, where it can be many GBs of memory for large json dumps (common for FNAL_Disk and T2_CH_CERN).

Possible Solution

Perhaps consuming a format different than the JSON object, such that we can have a better control of what we load into memory. Still, if data is not ordered, it can be extremely inefficient both for the service and for the storage.

Related Issues

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant