Packages

o

org.apache.spark.sql.catalyst.util

UnsafeRowUtils

object UnsafeRowUtils

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. UnsafeRowUtils
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def avoidSetNullAt(dt: DataType): Boolean

    Returns a Boolean indicating whether one should avoid calling UnsafeRow.setNullAt for a field of the given data type.

    Returns a Boolean indicating whether one should avoid calling UnsafeRow.setNullAt for a field of the given data type. Fields of type DecimalType (with precision greater than Decimal.MAX_LONG_DIGITS) and CalendarIntervalType use pointers into the variable length region, and those pointers should never get zeroed out (setNullAt will zero out those pointers) because UnsafeRow may do in-place update for these 2 types even though they are not primitive.

    When avoidSetNullAt returns true, callers should not use UnsafeRow#setNullAt for fields of that data type, but instead pass a null value to the appropriate set method, e.g.:

    row.setDecimal(ordinal, null, precision)

    Even though only UnsafeRow has this limitation, it's safe to extend this rule to all subclasses of InternalRow, since you don't always know the concrete type of the row you are dealing with, and all subclasses of InternalRow will handle a null value appropriately.

  6. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  7. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  8. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  9. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  10. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  11. def getOffsetAndSize(row: UnsafeRow, index: Int): (Int, Int)
  12. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  13. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  14. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  15. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  16. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  17. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  18. def toString(): String
    Definition Classes
    AnyRef → Any
  19. def validateStructuralIntegrity(row: UnsafeRow, expectedSchema: StructType): Boolean

    Use the following rules to check the integrity of the UnsafeRow: - schema.fields.length == row.numFields should always be true - UnsafeRow.calculateBitSetWidthInBytes(row.numFields) < row.getSizeInBytes should always be true if the expectedSchema contains at least one field.

    Use the following rules to check the integrity of the UnsafeRow: - schema.fields.length == row.numFields should always be true - UnsafeRow.calculateBitSetWidthInBytes(row.numFields) < row.getSizeInBytes should always be true if the expectedSchema contains at least one field. - For variable-length fields:

    • if null bit says it's null, then
      • in general the offset-and-size should be zero
      • special case: variable-length DecimalType is considered mutable in UnsafeRow, and to support that, the offset is set to point to the variable-length part like a non-null value, while the size is set to zero to signal that it's a null value. The offset may also be set to zero, in which case this variable-length Decimal no longer supports being mutable in the UnsafeRow.
    • otherwise the field is not null, then extract offset and size: 1) 0 <= size < row.getSizeInBytes should always be true. We can be even more precise than this, where the upper bound of size can only be as big as the variable length part of the row. 2) offset should be >= fixed sized part of the row. 3) offset + size should be within the row bounds. - For fixed-length fields that are narrower than 8 bytes (boolean/byte/short/int/float), if null bit says it's null then don't do anything, else: check if the unused bits in the field are all zeros. The UnsafeRowWriter's write() methods make this guarantee. - Check the total length of the row.
  20. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  21. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  22. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from AnyRef

Inherited from Any

Ungrouped