com.redhat.et.silex.text

ApproximateWhitelist

case class ApproximateWhitelist(filter: BitSet) extends Product with Serializable

An ApproximateWhitelist is a basic Bloom filter intended for holding natural-language vocabularies. It deals with String values natively and can be trained from a sequence or from an RDD of any element type T, as long as there is an implicit conversion in scope from T to String.

Known limitation: while this filter uses several hashes, some of these will exhibit unusually high collision rates when hashing strings that are permutations of one another. If you experience poor filter performance on a given vocabulary, this might be worth investigating. The choice of hash functions is subject to change in a future release.

Linear Supertypes
Serializable, Serializable, Product, Equals, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. ApproximateWhitelist
  2. Serializable
  3. Serializable
  4. Product
  5. Equals
  6. AnyRef
  7. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new ApproximateWhitelist(filter: BitSet)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. def add[A](s: A)(implicit f: (A) ⇒ String): ApproximateWhitelist

    Adds an element to the whitelist.

  7. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  8. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  9. def combine(other: ⇒ ApproximateWhitelist): ApproximateWhitelist

    Creates a whitelist that accepts a superset of anything accepted by this and anything accepted by other.

    Creates a whitelist that accepts a superset of anything accepted by this and anything accepted by other.

  10. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  11. val filter: BitSet

  12. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  13. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  14. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  15. def maybeContains[A](s: A)(implicit f: (A) ⇒ String): Boolean

    Returns true if s is possibly contained in the whitelist and false if it definitely is not.

    Returns true if s is possibly contained in the whitelist and false if it definitely is not.

  16. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  17. final def notify(): Unit

    Definition Classes
    AnyRef
  18. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  19. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  20. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  21. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  22. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Serializable

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from AnyRef

Inherited from Any

Ungrouped