Deletion Overview
Deletion in Rucio can be performed in broadly two ways.
-
Rule based deletion:
Each rule has a corresponding lifetime associated with it (default is None). The attribute
expires_at
of the rule is the time when the lifetime is set plus the lifetime associated with the rule. Additionally each rule can belocked
for additional protection. If a rule is locked the rule is protected from expiration. TheJudge-cleaner
daemon is responsible for handling expired rules. If an expired rule is not locked it setstombstones
on all replicas not covered by the rule anymore (or any other rule). These replicas then become eligible for deletion. -
DID based deletion:
Additionally, each DID also has a lifetime associated with it (default is None). The metadata
expires_at
of the DID is calculated similarly to a rule. TheUndertaker
daemon is responsible for handling expired DIDs. The daemon checks if none of the associated rules to the DID are locked, if not it removes all rules of the DID (see section above). The DID itself is then removed from the catalog.
The DID expiration overrules the rule expiration. But the locked rules are protected.
After the tombstone is set for replicas, the actual data deletion is done by the Reaper
daemon. The reaper physically deletes the tombstoned replicas from storage.
The deletion service supports two different modes: greedy and non-greedy.
-
Greedy
The reaper daemon gets all the replicas with tombstone in the RSE and immediately deletes all replicas.
-
Non-greedy
The reaper daemon first checks if the free space is needed in the RSE. The needed free space is the difference of minimum free space (attribute set for RSE) and actual free space in RSE. Deletion only occurs once free space is needed. Deletions are processed by Least Recently Used (LRU) algorithm, thus oldest accessed (tombstoned) replicas are deleted first.