Class

org.apache.spark.rdd.mergejoin.MergeJoin

SpillableJoiner

Related Doc: package MergeJoin

Permalink

abstract class SpillableJoiner[K, V, W, Out] extends Joiner[K, V, W, Out] with Logging

Base implementation for spillable merge-join Joiner that contains a default implementation for inner that will accumulate all right values into a spillable collection.

Exposes an emit method to allow each implementation to format the output tuple accordingly

Linear Supertypes
Logging, Joiner[K, V, W, Out], Serializable, Serializable, AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. SpillableJoiner
  2. Logging
  3. Joiner
  4. Serializable
  5. Serializable
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new SpillableJoiner(context: TaskContext)

    Permalink

Abstract Value Members

  1. abstract def emit(key: K, left: V, right: W): Out

    Permalink

    key

    key value to emit

    left

    left value to emit

    right

    right value to emit

    returns

    output value for a single row, eg: (K, (V, W)) or (K, (V, Option[W]))

  2. abstract def leftOuter(itr: Iterator[(K, V)]): Iterator[Out]

    Permalink

    Emit values for left-side only key

    Emit values for left-side only key

    Definition Classes
    Joiner
  3. abstract def rightOuter(itr: Iterator[(K, W)]): Iterator[Out]

    Permalink

    Emit values for right-side only key

    Emit values for right-side only key

    Definition Classes
    Joiner

Concrete Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. def closeKey(): Unit

    Permalink

    Called after all values for a key has been emitted.

    Called after all values for a key has been emitted.

    This will be called after each 'inner' key join, and once more on task completion to ensure we cleanup any intermediate spill files and release memory back to the executor when we are done with this key.

    Attributes
    protected
  7. var currentKey: K

    Permalink
    Attributes
    protected
  8. var currentSpillable: ExternalSorter[Int, W, W]

    Permalink
    Attributes
    protected
  9. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  10. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  11. def finalize(): Unit

    Permalink
    Definition Classes
    SpillableJoiner → AnyRef
  12. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  13. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  14. val includeSpillMetrics: Boolean

    Permalink
    Attributes
    protected
  15. def initializeLogIfNecessary(isInterpreter: Boolean): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  16. def inner(key: K, left: Iterator[V], right: Iterator[W]): Iterator[Out]

    Permalink

    Emit both left and right values for the given key

    Emit both left and right values for the given key

    Definition Classes
    SpillableJoinerJoiner
  17. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  18. def isTraceEnabled(): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  19. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  20. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  21. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  22. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  23. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  24. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  25. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  26. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  27. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  28. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  29. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  30. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  31. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  32. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  33. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  34. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  35. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  36. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  37. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  38. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Logging

Inherited from Joiner[K, V, W, Out]

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped