com.thoughtworks.datacommons.prepbuddy.rdds

TransformableRDD

class TransformableRDD extends AbstractRDD

Linear Supertypes
AbstractRDD, RDD[String], Logging, Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. TransformableRDD
  2. AbstractRDD
  3. RDD
  4. Logging
  5. Serializable
  6. Serializable
  7. AnyRef
  8. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new TransformableRDD(parent: RDD[String], fileType: FileType = ...)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. def ++(other: RDD[String]): RDD[String]

    Definition Classes
    RDD
  5. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  6. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  7. def addColumnsFrom(otherRDD: TransformableRDD): TransformableRDD

    Zips the other TransformableRDD with this TransformableRDD and returns a new TransformableRDD with current file format.

    Zips the other TransformableRDD with this TransformableRDD and returns a new TransformableRDD with current file format. Both the TransformableRDD must have same number of records

    otherRDD

    Other TransformableRDD from where the columns will be added to this TransformableRDD

    returns

    TransformableRDD

  8. def aggregate[U](zeroValue: U)(seqOp: (U, String) ⇒ U, combOp: (U, U) ⇒ U)(implicit arg0: ClassTag[U]): U

    Definition Classes
    RDD
  9. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  10. def cache(): TransformableRDD.this.type

    Definition Classes
    RDD
  11. def cartesian[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(String, U)]

    Definition Classes
    RDD
  12. def checkpoint(): Unit

    Definition Classes
    RDD
  13. def clearDependencies(): Unit

    Attributes
    protected
    Definition Classes
    RDD
  14. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  15. def clusters(columnIndex: Int, clusteringAlgorithm: ClusteringAlgorithm): Clusters

    Returns Clusters that has all cluster of text of @columnIndex according to @algorithm

    Returns Clusters that has all cluster of text of @columnIndex according to @algorithm

    columnIndex

    Column Index

    clusteringAlgorithm

    Algorithm to be used to form clusters

    returns

    Clusters

  16. def coalesce(numPartitions: Int, shuffle: Boolean)(implicit ord: Ordering[String]): RDD[String]

    Definition Classes
    RDD
  17. def collect[U](f: PartialFunction[String, U])(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  18. def collect(): Array[String]

    Definition Classes
    RDD
  19. val columnLength: Int

    Attributes
    protected
    Definition Classes
    AbstractRDD
  20. def compute(split: Partition, context: TaskContext): Iterator[String]

    Definition Classes
    AbstractRDD → RDD
    Annotations
    @DeveloperApi()
  21. def context: SparkContext

    Definition Classes
    RDD
  22. def count(): Long

    Definition Classes
    RDD
  23. def countApprox(timeout: Long, confidence: Double): PartialResult[BoundedDouble]

    Definition Classes
    RDD
  24. def countApproxDistinct(relativeSD: Double): Long

    Definition Classes
    RDD
  25. def countApproxDistinct(p: Int, sp: Int): Long

    Definition Classes
    RDD
  26. def countByValue()(implicit ord: Ordering[String]): Map[String, Long]

    Definition Classes
    RDD
  27. def countByValueApprox(timeout: Long, confidence: Double)(implicit ord: Ordering[String]): PartialResult[Map[String, BoundedDouble]]

    Definition Classes
    RDD
  28. def deduplicate(primaryKeyColumns: List[Int]): TransformableRDD

    Returns a new TransformableRDD containing unique duplicate records of this TransformableRDD by considering the given columns as primary key.

    Returns a new TransformableRDD containing unique duplicate records of this TransformableRDD by considering the given columns as primary key.

    primaryKeyColumns

    A list of integers specifying the columns that will be combined to create the primary key

    returns

    TransformableRDD A new TransformableRDD consisting unique duplicate records.

  29. def deduplicate(): TransformableRDD

    Returns a new TransformableRDD containing unique duplicate records of this TransformableRDD by considering all the columns as primary key.

    Returns a new TransformableRDD containing unique duplicate records of this TransformableRDD by considering all the columns as primary key.

    returns

    TransformableRDD A new TransformableRDD consisting unique duplicate records.

  30. final def dependencies: Seq[Dependency[_]]

    Definition Classes
    RDD
  31. def distinct(): RDD[String]

    Definition Classes
    RDD
  32. def distinct(numPartitions: Int)(implicit ord: Ordering[String]): RDD[String]

    Definition Classes
    RDD
  33. def drop(columnIndex: Int, columnIndexes: Int*): TransformableRDD

    Returns a new TransformableRDD by dropping the @columnIndex

    Returns a new TransformableRDD by dropping the @columnIndex

    columnIndex

    The column that will be dropped.

    returns

    TransformableRDD

  34. def duplicates(primaryKeyColumns: List[Int]): TransformableRDD

    Returns a new TransformableRDD containing unique duplicate records of this TransformableRDD by considering the given columns as primary key.

    Returns a new TransformableRDD containing unique duplicate records of this TransformableRDD by considering the given columns as primary key.

    primaryKeyColumns

    A list of integers specifying the columns that will be combined to create the primary key

    returns

    TransformableRDD A new TransformableRDD consisting unique duplicate records.

  35. def duplicates(): TransformableRDD

    Returns a new TransformableRDD containing duplicate records of this TransformableRDD by considering all the columns as primary key.

    Returns a new TransformableRDD containing duplicate records of this TransformableRDD by considering all the columns as primary key.

    returns

    TransformableRDD A new TransformableRDD consisting unique duplicate records.

  36. def duplicatesAt(columnIndex: Int): RDD[String]

    Returns a new RDD containing the duplicate values at the specified column

    Returns a new RDD containing the duplicate values at the specified column

    columnIndex

    Column where to look for duplicates

    returns

    RDD

  37. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  38. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  39. def filter(f: (String) ⇒ Boolean): RDD[String]

    Definition Classes
    RDD
  40. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  41. def first(): String

    Definition Classes
    RDD
  42. def firstParent[U](implicit arg0: ClassTag[U]): RDD[U]

    Attributes
    protected[org.apache.spark]
    Definition Classes
    RDD
  43. def flag(symbol: String, markerPredicate: (RowRecord) ⇒ Boolean): TransformableRDD

    Returns a new TransformableRDD that contains records flagged by @symbol based on the evaluation of @markerPredicate

    Returns a new TransformableRDD that contains records flagged by @symbol based on the evaluation of @markerPredicate

    symbol

    Symbol that will be used to flag

    markerPredicate

    A matchInDictionary which will determine whether to flag a row or not

    returns

    TransformableRDD

  44. def flatMap[U](f: (String) ⇒ TraversableOnce[U])(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  45. def fold(zeroValue: String)(op: (String, String) ⇒ String): String

    Definition Classes
    RDD
  46. def foreach(f: (String) ⇒ Unit): Unit

    Definition Classes
    RDD
  47. def foreachPartition(f: (Iterator[String]) ⇒ Unit): Unit

    Definition Classes
    RDD
  48. def getCheckpointFile: Option[String]

    Definition Classes
    RDD
  49. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  50. def getDependencies: Seq[Dependency[_]]

    Attributes
    protected
    Definition Classes
    RDD
  51. final def getNumPartitions: Int

    Definition Classes
    RDD
    Annotations
    @Since( "1.6.0" )
  52. def getPartitions: Array[Partition]

    Attributes
    protected
    Definition Classes
    AbstractRDD → RDD
  53. def getPreferredLocations(split: Partition): Seq[String]

    Attributes
    protected
    Definition Classes
    RDD
  54. def getStorageLevel: StorageLevel

    Definition Classes
    RDD
  55. def glom(): RDD[Array[String]]

    Definition Classes
    RDD
  56. def groupBy[K](f: (String) ⇒ K, p: Partitioner)(implicit kt: ClassTag[K], ord: Ordering[K]): RDD[(K, Iterable[String])]

    Definition Classes
    RDD
  57. def groupBy[K](f: (String) ⇒ K, numPartitions: Int)(implicit kt: ClassTag[K]): RDD[(K, Iterable[String])]

    Definition Classes
    RDD
  58. def groupBy[K](f: (String) ⇒ K)(implicit kt: ClassTag[K]): RDD[(K, Iterable[String])]

    Definition Classes
    RDD
  59. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  60. val id: Int

    Definition Classes
    RDD
  61. def impute(columnIndex: Int, strategy: ImputationStrategy, missingHints: List[String]): TransformableRDD

    Returns a new TransformableRDD by imputing missing values and @missingHints of the @columnIndex using the @strategy

    Returns a new TransformableRDD by imputing missing values and @missingHints of the @columnIndex using the @strategy

    columnIndex

    Column Index

    strategy

    Imputation Strategy

    missingHints

    List of Strings that may mean empty

    returns

    TransformableRDD

  62. def impute(column: Int, strategy: ImputationStrategy): TransformableRDD

    Returns a new TransformableRDD by imputing missing values of the @columnIndex using the @strategy

    Returns a new TransformableRDD by imputing missing values of the @columnIndex using the @strategy

    column

    Column index

    strategy

    Imputation strategy

    returns

    TransformableRDD

  63. def inferType(columnIndex: Int): DataType

    Returns inferred DataType of @columnIndex

    Returns inferred DataType of @columnIndex

    columnIndex

    Column Index on which type will be infered

    returns

    DataType

    Definition Classes
    AbstractRDD
  64. def intersection(other: RDD[String], numPartitions: Int): RDD[String]

    Definition Classes
    RDD
  65. def intersection(other: RDD[String], partitioner: Partitioner)(implicit ord: Ordering[String]): RDD[String]

    Definition Classes
    RDD
  66. def intersection(other: RDD[String]): RDD[String]

    Definition Classes
    RDD
  67. def isCheckpointed: Boolean

    Definition Classes
    RDD
  68. def isEmpty(): Boolean

    Definition Classes
    RDD
  69. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  70. def isTraceEnabled(): Boolean

    Attributes
    protected
    Definition Classes
    Logging
  71. final def iterator(split: Partition, context: TaskContext): Iterator[String]

    Definition Classes
    RDD
  72. def keyBy[K](f: (String) ⇒ K): RDD[(K, String)]

    Definition Classes
    RDD
  73. def listFacets(columnIndexes: List[Int]): TextFacets

    Returns a new TextFacet containing the facets of @columnIndexes

    Returns a new TextFacet containing the facets of @columnIndexes

    columnIndexes

    List of column index

    returns

    TextFacets

  74. def listFacets(columnIndex: Int): TextFacets

    Returns a new TextFacet containing the cardinal values of @columnIndex

    Returns a new TextFacet containing the cardinal values of @columnIndex

    columnIndex

    index of the column

    returns

    TextFacets

  75. def localCheckpoint(): TransformableRDD.this.type

    Definition Classes
    RDD
  76. def log: Logger

    Attributes
    protected
    Definition Classes
    Logging
  77. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  78. def logDebug(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  79. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  80. def logError(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  81. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  82. def logInfo(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  83. def logName: String

    Attributes
    protected
    Definition Classes
    Logging
  84. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  85. def logTrace(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  86. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  87. def logWarning(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  88. def map[U](f: (String) ⇒ U)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  89. def mapByFlag(symbol: String, symbolColumnIndex: Int, mapFunction: (String) ⇒ String): TransformableRDD

    Returns a new TransformableRDD by applying the function on all rows marked as @flag

    Returns a new TransformableRDD by applying the function on all rows marked as @flag

    symbol

    Symbol that has been used for flagging.

    symbolColumnIndex

    Symbol column index

    mapFunction

    map function

    returns

    TransformableRDD

  90. def mapPartitions[U](f: (Iterator[String]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  91. def mapPartitionsWithIndex[U](f: (Int, Iterator[String]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  92. def max()(implicit ord: Ordering[String]): String

    Definition Classes
    RDD
  93. def mergeColumns(columns: List[Int], separator: String = " ", retainColumns: Boolean = false): TransformableRDD

    Returns a new TransformableRDD by merging 2 or more columns together

    Returns a new TransformableRDD by merging 2 or more columns together

    columns

    List of columns to be merged

    separator

    Separator to be used to separate the merge value

    retainColumns

    false when you want to remove the column value at @column in the result TransformableRDD

    returns

    TransformableRDD

  94. def min()(implicit ord: Ordering[String]): String

    Definition Classes
    RDD
  95. def multiplyColumns(firstColumn: Int, secondColumn: Int): RDD[Double]

    Returns a RDD of double which is a product of the values in @firstColumn and @secondColumn

    Returns a RDD of double which is a product of the values in @firstColumn and @secondColumn

    firstColumn

    First Column Index

    secondColumn

    Second Column Index

    returns

    RDD[Double]

  96. var name: String

    Definition Classes
    RDD
  97. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  98. def normalize(columnIndex: Int, normalizer: NormalizationStrategy): TransformableRDD

    Returns a new TransformableRDD by normalizing values of the given column using different Normalizers

    Returns a new TransformableRDD by normalizing values of the given column using different Normalizers

    columnIndex

    Column Index

    normalizer

    Normalization Strategy

    returns

    TransformableRDD

  99. final def notify(): Unit

    Definition Classes
    AnyRef
  100. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  101. def numberOfColumns(): Int

    Returns number of column in this rdd

    Returns number of column in this rdd

    returns

    int

  102. def parent[U](j: Int)(implicit arg0: ClassTag[U]): RDD[U]

    Attributes
    protected[org.apache.spark]
    Definition Classes
    RDD
  103. val partitioner: Option[Partitioner]

    Definition Classes
    RDD
  104. final def partitions: Array[Partition]

    Definition Classes
    RDD
  105. def persist(): TransformableRDD.this.type

    Definition Classes
    RDD
  106. def persist(newLevel: StorageLevel): TransformableRDD.this.type

    Definition Classes
    RDD
  107. def pipe(command: Seq[String], env: Map[String, String], printPipeContext: ((String) ⇒ Unit) ⇒ Unit, printRDDElement: (String, (String) ⇒ Unit) ⇒ Unit, separateWorkingDir: Boolean): RDD[String]

    Definition Classes
    RDD
  108. def pipe(command: String, env: Map[String, String]): RDD[String]

    Definition Classes
    RDD
  109. def pipe(command: String): RDD[String]

    Definition Classes
    RDD
  110. def pivotByCount(pivotalColumn: Int, independentColumnIndexes: Seq[Int]): PivotTable[Integer]

    Generates a PivotTable by pivoting data in the pivotalColumn

    Generates a PivotTable by pivoting data in the pivotalColumn

    pivotalColumn

    Pivotal Column

    independentColumnIndexes

    Independent Column Indexes

    returns

    PivotTable

  111. final def preferredLocations(split: Partition): Seq[String]

    Definition Classes
    RDD
  112. def randomSplit(weights: Array[Double], seed: Long): Array[RDD[String]]

    Definition Classes
    RDD
  113. def reduce(f: (String, String) ⇒ String): String

    Definition Classes
    RDD
  114. def removeRows(predicate: (RowRecord) ⇒ Boolean): TransformableRDD

    Returns a new TransformableRDD containing only the elements that satisfy the matchInDictionary.

    Returns a new TransformableRDD containing only the elements that satisfy the matchInDictionary.

    predicate

    A matchInDictionary function, which gives bool value for every row.

    returns

    TransformableRDD

  115. def repartition(numPartitions: Int)(implicit ord: Ordering[String]): RDD[String]

    Definition Classes
    RDD
  116. def replaceValues(cluster: Cluster, newValue: String, columnIndex: Int): TransformableRDD

    Returns a new TransformableRDD by replacing the @cluster's text with specified @newValue

    Returns a new TransformableRDD by replacing the @cluster's text with specified @newValue

    cluster

    Cluster of similar values to be replaced

    newValue

    Value that will be used to replace all the cluster value

    columnIndex

    Column index

    returns

    TransformableRDD

  117. def sample(withReplacement: Boolean, fraction: Double, seed: Long): RDD[String]

    Definition Classes
    RDD
  118. def sampleColumnValues(columnIndex: Int): List[String]

    Returns a List of some elements of @columnIndex

    Returns a List of some elements of @columnIndex

    columnIndex

    column Index for the sample

    returns

    List[String]

    Definition Classes
    AbstractRDD
  119. val sampleRecords: List[String]

    Attributes
    protected
    Definition Classes
    AbstractRDD
  120. def saveAsObjectFile(path: String): Unit

    Definition Classes
    RDD
  121. def saveAsTextFile(path: String, codec: Class[_ <: CompressionCodec]): Unit

    Definition Classes
    RDD
  122. def saveAsTextFile(path: String): Unit

    Definition Classes
    RDD
  123. def select(columnIndexes: List[Int]): TransformableRDD

    Returns a new TransformableRDD containing values of @columnIndexes

    Returns a new TransformableRDD containing values of @columnIndexes

    columnIndexes

    A number of integer values specifying the columns that will be used to create the new table

    returns

    TransformableRDD

  124. def select(columnIndex: Int): RDD[String]

    Returns a RDD of given column

    Returns a RDD of given column

    columnIndex

    Column index

    returns

    RDD[String]

    Definition Classes
    AbstractRDD
  125. def setName(_name: String): TransformableRDD.this.type

    Definition Classes
    RDD
  126. def smooth(columnIndex: Int, smoothingMethod: SmoothingMethod): RDD[Double]

    Returns a new RDD containing smoothed values of @columnIndex using @smoothingMethod

    Returns a new RDD containing smoothed values of @columnIndex using @smoothingMethod

    columnIndex

    Column Index

    smoothingMethod

    Method that will be used for smoothing of the data

    returns

    RDD<Double>

  127. def sortBy[K](f: (String) ⇒ K, ascending: Boolean, numPartitions: Int)(implicit ord: Ordering[K], ctag: ClassTag[K]): RDD[String]

    Definition Classes
    RDD
  128. def sparkContext: SparkContext

    Definition Classes
    RDD
  129. def splitByDelimiter(column: Int, delimiter: String, retainColumn: Boolean = false, maxSplit: Int = 1): TransformableRDD

    Returns a new TransformableRDD by splitting the @column by the delimiter provided

    Returns a new TransformableRDD by splitting the @column by the delimiter provided

    column

    Column index of the value to be split

    delimiter

    delimiter or regEx that will be used to split the value @column

    retainColumn

    false when you want to remove the column value at @column in the result TransformableRDD

    maxSplit

    Maximum number of split to be added to the result TransformableRDD

    returns

    TransformableRDD

  130. def splitByFieldLength(column: Int, fieldLengths: List[Int], retainColumn: Boolean = false): TransformableRDD

    Returns a TransformableRDD by splitting the @column according to the specified lengths

    Returns a TransformableRDD by splitting the @column according to the specified lengths

    column

    Column index of the value to be split

    fieldLengths

    List of integers specifying the number of character each split value will contains

    retainColumn

    false when you want to remove the column value at @column in the result TransformableRDD

    returns

    TransformableRDD

  131. def subtract(other: RDD[String], p: Partitioner)(implicit ord: Ordering[String]): RDD[String]

    Definition Classes
    RDD
  132. def subtract(other: RDD[String], numPartitions: Int): RDD[String]

    Definition Classes
    RDD
  133. def subtract(other: RDD[String]): RDD[String]

    Definition Classes
    RDD
  134. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  135. def take(num: Int): Array[String]

    Definition Classes
    RDD
  136. def takeOrdered(num: Int)(implicit ord: Ordering[String]): Array[String]

    Definition Classes
    RDD
  137. def takeSample(withReplacement: Boolean, num: Int, seed: Long): Array[String]

    Definition Classes
    RDD
  138. def toDebugString: String

    Definition Classes
    RDD
  139. def toDoubleRDD(columnIndex: Int): RDD[Double]

    Returns a double RDD of given column index

    Returns a double RDD of given column index

    columnIndex

    Column index

    returns

    RDD[Double]

    Definition Classes
    AbstractRDD
  140. def toJavaRDD(): JavaRDD[String]

    Definition Classes
    RDD
  141. def toLocalIterator: Iterator[String]

    Definition Classes
    RDD
  142. def toRDD: RDD[String]

    Returns RDD

    Returns RDD

    returns

    RDD[String]

    Definition Classes
    AbstractRDD
  143. def toString(): String

    Definition Classes
    RDD → AnyRef → Any
  144. def top(num: Int)(implicit ord: Ordering[String]): Array[String]

    Definition Classes
    RDD
  145. def treeAggregate[U](zeroValue: U)(seqOp: (U, String) ⇒ U, combOp: (U, U) ⇒ U, depth: Int)(implicit arg0: ClassTag[U]): U

    Definition Classes
    RDD
  146. def treeReduce(f: (String, String) ⇒ String, depth: Int): String

    Definition Classes
    RDD
  147. def union(other: RDD[String]): RDD[String]

    Definition Classes
    RDD
  148. def unique(columnIndex: Int): RDD[String]

    Returns a new TransformableRDD containing the unique elements in the specified column

    Returns a new TransformableRDD containing the unique elements in the specified column

    columnIndex

    Column Index

    returns

    RDD<String>

  149. def unpersist(blocking: Boolean): TransformableRDD.this.type

    Definition Classes
    RDD
  150. def validateColumnIndex(columnIndexes: List[Int]): Unit

    Attributes
    protected
    Definition Classes
    AbstractRDD
  151. def validateColumnIndex(columnIndex: Int): Unit

    Attributes
    protected
    Definition Classes
    AbstractRDD
  152. def validateNumericColumn(columnIndex: Int): Unit

    Attributes
    protected
    Definition Classes
    AbstractRDD
  153. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  154. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  155. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  156. def zip[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(String, U)]

    Definition Classes
    RDD
  157. def zipPartitions[B, C, D, V](rdd2: RDD[B], rdd3: RDD[C], rdd4: RDD[D])(f: (Iterator[String], Iterator[B], Iterator[C], Iterator[D]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[D], arg3: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  158. def zipPartitions[B, C, D, V](rdd2: RDD[B], rdd3: RDD[C], rdd4: RDD[D], preservesPartitioning: Boolean)(f: (Iterator[String], Iterator[B], Iterator[C], Iterator[D]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[D], arg3: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  159. def zipPartitions[B, C, V](rdd2: RDD[B], rdd3: RDD[C])(f: (Iterator[String], Iterator[B], Iterator[C]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  160. def zipPartitions[B, C, V](rdd2: RDD[B], rdd3: RDD[C], preservesPartitioning: Boolean)(f: (Iterator[String], Iterator[B], Iterator[C]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  161. def zipPartitions[B, V](rdd2: RDD[B])(f: (Iterator[String], Iterator[B]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  162. def zipPartitions[B, V](rdd2: RDD[B], preservesPartitioning: Boolean)(f: (Iterator[String], Iterator[B]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  163. def zipWithIndex(): RDD[(String, Long)]

    Definition Classes
    RDD
  164. def zipWithUniqueId(): RDD[(String, Long)]

    Definition Classes
    RDD

Deprecated Value Members

  1. def filterWith[A](constructA: (Int) ⇒ A)(p: (String, A) ⇒ Boolean): RDD[String]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and filter

  2. def flatMapWith[A, U](constructA: (Int) ⇒ A, preservesPartitioning: Boolean)(f: (String, A) ⇒ Seq[U])(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and flatMap

  3. def foreachWith[A](constructA: (Int) ⇒ A)(f: (String, A) ⇒ Unit): Unit

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and foreach

  4. def mapPartitionsWithContext[U](f: (TaskContext, Iterator[String]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
    Annotations
    @DeveloperApi() @deprecated
    Deprecated

    (Since version 1.2.0) use TaskContext.get

  5. def mapPartitionsWithSplit[U](f: (Int, Iterator[String]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 0.7.0) use mapPartitionsWithIndex

  6. def mapWith[A, U](constructA: (Int) ⇒ A, preservesPartitioning: Boolean)(f: (String, A) ⇒ U)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex

  7. def toArray(): Array[String]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use collect

Inherited from AbstractRDD

Inherited from RDD[String]

Inherited from Logging

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped