dataset_map_and_batch

Fused implementation of dataset_map() and dataset_batch()

Description

Maps `map_func`` across batch_size consecutive elements of this dataset and then combines them into a batch. Functionally, it is equivalent to map followed by batch. However, by fusing the two transformations together, the implementation can be more efficient.

Usage

 
dataset_map_and_batch( 
  dataset, 
  map_func, 
  batch_size, 
  num_parallel_batches = NULL, 
  drop_remainder = FALSE, 
  num_parallel_calls = NULL 
)

Arguments

Arguments	Description
dataset	A dataset
map_func	A function mapping a nested structure of tensors (having shapes and types defined by `output_shapes()` and `output_types()` to another nested structure of tensors. It also supports `purrr` style lambda functions powered by `rlang::as_function()`.
batch_size	An integer, representing the number of consecutive elements of this dataset to combine in a single batch.
num_parallel_batches	(Optional) An integer, representing the number of batches to create in parallel. On one hand, higher values can help mitigate the effect of stragglers. On the other hand, higher values can increase contention if CPU is scarce.
drop_remainder	(Optional.) A boolean, representing whether the last batch should be dropped in the case it has fewer than `batch_size` elements; the default behavior is not to drop the smaller batch.
num_parallel_calls	(Optional) An integer, representing the number of elements to process in parallel If not specified, elements will be processed sequentially.

dataset_map_and_batch

Fused implementation of dataset_map() and dataset_batch()

Description

Usage

Arguments

See Also