INDEX
    Explanations

    instances of phrases indicating uniqueness or superlatives

    phrases that indicate distinct categories or classifications

    New Auto-Interp
    Negative Logits
    words
    -0.62
    ļéĨĴ
    -0.55
    yards
    -0.54
     carriers
    -0.54
    cdn
    -0.53
     Minutes
    -0.52
     meanwhile
    -0.52
    ming
    -0.51
     reading
    -0.50
    tyard
    -0.50
    POSITIVE LOGITS
     kind
    1.45
     type
    1.11
     kinds
    1.10
    Kind
    1.06
     sort
    1.05
    kind
    1.02
     sorts
    1.01
    size
    0.94
     Kind
    0.93
    type
    0.90
    Act Density 0.079%

    No Known Activations