zcollection.merging.perform#

zcollection.merging.perform(ds_inserted, dirname, axis, fs, partitioning_dim, *, delayed=True, merge_callable, synchronizer=None, **kwargs)[source]#

Merges a new dataset with an existing partition.

Parameters:
  • ds_inserted (Dataset) – The dataset to merge.

  • dirname (str) – The name of the partition.

  • axis (str) – The axis to merge on.

  • fs (AbstractFileSystem) – The file system on which the partition is stored.

  • partitioning_dim (str) – The partitioning dimension.

  • delayed (bool) – If True, the existing dataset is loaded lazily. Defaults to True.

  • merge_callable (MergeCallable | None) – The merge callable. If None, the inserted dataset overwrites the existing dataset stored in the partition. Defaults to None.

  • synchronizer (Sync | None) – The instance handling access to critical resources. Defaults to None.

  • **kwargs – Additional keyword arguments are passed through to the merge callable.

Return type:

None