nxcals.api.extraction.data.builders.DataFrame.crossJoin

DataFrame.crossJoin(other: DataFrame) DataFrame

Returns the cartesian product with another DataFrame.

New in version 2.1.0.

Changed in version 3.4.0: Supports Spark Connect.

Parameters:

other (DataFrame) – Right side of the cartesian product.

Returns:

Joined DataFrame.

Return type:

DataFrame

Examples

>>> from pyspark.sql import Row
>>> df = spark.createDataFrame(
...     [(14, "Tom"), (23, "Alice"), (16, "Bob")], ["age", "name"])
>>> df2 = spark.createDataFrame(
...     [Row(height=80, name="Tom"), Row(height=85, name="Bob")])
>>> df.crossJoin(df2.select("height")).select("age", "name", "height").show()
+---+-----+------+
|age| name|height|
+---+-----+------+
| 14|  Tom|    80|
| 14|  Tom|    85|
| 23|Alice|    80|
| 23|Alice|    85|
| 16|  Bob|    80|
| 16|  Bob|    85|
+---+-----+------+