Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add cleanupData command to nrt_utils #733

Merged
merged 1 commit into from
Sep 30, 2024

Conversation

aprudhomme
Copy link
Contributor

Porting of the legacy incrementalDataCleanup to use the new structure.

Added cleanupData command to remove all the unneeded index data files in S3. The command remove all nrt point state older than the specified threshold. It then removes all the index data files older than the threshold and not referenced by the retained point states. Only the first and last point states need to be loaded to find the active index files of note. The additional files for the points in between will not be older than the threshold (a grace period is added for safety).

@aprudhomme aprudhomme merged commit 1e75d95 into Yelp:main Sep 30, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants