nxcals.api.extraction.data.builders.DataFrame.localCheckpoint

DataFrame.localCheckpoint(eager: bool = True) DataFrame

Returns a locally checkpointed version of this DataFrame. Checkpointing can be used to truncate the logical plan of this DataFrame, which is especially useful in iterative algorithms where the plan may grow exponentially. Local checkpoints are stored in the executors using the caching subsystem and therefore they are not reliable.

Added in version 2.3.0.

Parameters:

eager (bool, optional, default True) – Whether to checkpoint this DataFrame immediately.

Returns:

Checkpointed DataFrame.

Return type:

DataFrame

Notes

This API is experimental.

Examples

>>> df = spark.createDataFrame([
...     (14, "Tom"), (23, "Alice"), (16, "Bob")], ["age", "name"])
>>> df.localCheckpoint(False)
DataFrame[age: bigint, name: string]