DataFrameSteps

This step object builds DataFrameReader and DataFrameWriter objects. There are two step functions provided:

Get DataFrameReader

Given a DataFrameReaderOptions object, this step will build a DataFrameReader. Full parameter descriptions are listed below:

dataFrameReaderOptions - DataFrameReaderOptions object used to configure the DataFrameReader.

Given a DataFrameReaderOptions object, this step will load a DataFrame. Full parameter descriptions are listed below:

Given a DataFrameReader object, this step will load a DataFrame. Full parameter descriptions are listed below:

Given a DataFrameWriterOptions object, this step will build a DataFrameWriter[_]. Full parameter descriptions are listed below:

dataFrameWriterOptions - DataFrameWriterOptions object used to configure the DataFrameWriter.

Given a DataFrame and DataFrameWriterOptions object, this step will save a DataFrame. Full parameter descriptions are listed below:

Given a DataFrameWriter[_] object, this step will save a DataFrame. Full parameter descriptions are listed below:

Given a DataFrame object and optional storage level, this step will persist the data. Full parameter descriptions listed below:

Mark the DataFrame as non-persistent and and remove all blocks for it from memory and disk. Full parameter descriptions listed below:

Repartition the dataFrame to have the provided number of partitions. Full parameter descriptions listed below:

dataFrame - The DataFrame to repartition.
partitions - The desired number of partitions.
rangePartition - Optional flag to indicate whether partitionByRange should be used. Defaults to false.
shuffle - Optional flag to indicate whether a shuffle needs to be performed. Defaults to true.
partitionExpressions - Optional list of expressions used to sort data into partitions.

Sort the DataFrame based on the provided list of expressions. Full parameter descriptions listed below:

dataFrame - The DataFrame to sort.
expressions - The List of sort expressions.
descending - Optional flag to indicate whether sort order should be descending. When true, will apply the desc function to each provided expression. Defaults to false.