gs_rsync
Synchronize content of two buckets/directories
Description
The gs_rsync
function makes the contents under destination
the same as the contents under source
, by copying any missing files/objects (or those whose data has changed), and (if the delete
option is specified) deleting any extra files/objects. source
must specify a directory, bucket, or bucket subdirectory.
Usage
gs_rsync(source, destination, delete = FALSE, recursive = FALSE,
parallel = TRUE, dry_run = FALSE, options = NULL, echo = TRUE)
Arguments
Arguments | Description |
---|---|
source | The file to be copied. This can be either a path on the local filesystem, or a Google Storage URI (e.g. gs://[BUCKET_NAME]/[FILENAME.CSV] ). |
destination | The location where the source file should be copied to. This can be either a path on the local filesystem, or a Google Storage URI (e.g. gs://[BUCKET_NAME]/[FILENAME.CSV] ). |
delete | Delete extra files under destination not found under source By default extra files are not deleted. |
recursive | Causes directories, buckets, and bucket subdirectories to be synchronized recursively. If you neglect to use this option gs_rsync() will make only the top-level directory in the source and destination URLs match, skipping any sub-directories. |
parallel | Causes synchronization to run in parallel. This can significantly improve performance if you are performing operations on a large number of files over a reasonably fast network connection. |
dry_run | Causes rsync to run in “dry run” mode, i.e., just outputting what would be copied or deleted without actually doing any copying/deleting. |
options | Character vector of additional command line options to the gsutil rsync command (as specified at https://cloud.google.com/storage/docs/gsutil/commands/rsync). |
echo | Echo command output to console. |