Catalog-Level Configuration
Catalog-level maintenance sets the default behavior for all tables in a catalog. Resources must be configured before you can enable any operations.
Step 1: Opening the Catalog Maintenance Tab
- Go to the Spark Catalogs page:
- Platform Admins: Admin Portal > Spark Catalogs
- Domain Admins: Domain > Settings > Spark Catalogs
- Open a qualifying catalog (see Prerequisites).
- Click the Maintenance tab.


Maintenance controls are disabled until an owner domain is assigned. See Catalog Owner Domain to assign one.


Step 2: Configuring Resources
The maintenance service requires a compute cluster and a service account to run operations.
- In the Resources section, select a Compute cluster from the dropdown. The list shows clusters that belong to the catalog's owner domain.
- Select a Service Account from the dropdown. The list shows all service accounts in the domain.
- Click Save to apply the resource configuration.


Once resources are saved, the Maintenance Operations section becomes active.
- The compute cluster must be active when a maintenance job runs. If it's stopped or disabled, the operation fails.
- The service account must have
CONSUMEpermission on the chosen compute cluster. Otherwise, the save is rejected with a permission error. See Granting Access to Users and Groups to assign the required permission. - The service account must have write access on the tables included in maintenance. Without it, maintenance operations on those tables will fail.
- Reassigning the owner domain for a catalog disables maintenance and clears all configured resources. Re-enable maintenance and reconfigure resources after the change.
Recommended Compute Resources
Rewrite Data Files and Rewrite Manifest Files run as Spark SQL jobs on the configured compute cluster. Under-resourced clusters cause operations to run slowly or fail entirely. Minimum recommended settings:
| Component | CPU | Memory |
|---|---|---|
| Driver | ≥ 0.5 vCPU | ≥ 0.5 GiB |
| Executor | ≥ 0.5 vCPU | ≥ 0.5 GiB |
- Executor count: ≥ 1
- Autoscaling: Enabled (with a scale-down delay of at least 5 minutes to avoid premature shutdown mid-job)
Step 3: Configuring Operations
- Toggle Enable maintenance to ON. This is the master switch for the entire catalog.
- For each of the four operations, set the toggle to Enabled or Disabled:
- Expire Snapshots: removes old snapshots to free storage and improve metadata performance.
- Rewrite Data Files: compacts small files and optimizes data layout for better query performance.
- Rewrite Manifest Files: optimizes manifest files for faster query planning.
- Cleanup Orphan Files: removes files no longer referenced by table metadata.
- To configure operation-specific thresholds, expand Advanced Settings on any enabled operation card and add the properties you want to override. See Advanced Configuration for all available options.
- Click Save Operations to save. Click Reset to discard unsaved changes.


Catalog Owner Domain
Every catalog that uses maintenance must have an owner domain assigned. The owner domain determines which compute clusters and service accounts are available for maintenance jobs. Resources are always scoped to a domain, so the catalog must belong to one before any maintenance configuration is possible.
To assign an owner domain:
- Open the catalog in Admin Portal > Spark Catalogs.
- Select the catalog.
- Go to the Permissions tab.
- Click the
⋮(three-dot menu) next to the domain and select Set as catalog owner.

