INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    LING
    -0.16
    ling
    -0.16
    rak
    -0.15
    ôt
    -0.15
    lington
    -0.14
     Kushner
    -0.14
    _Ptr
    -0.14
    rtle
    -0.14
    uality
    -0.13
     ark
    -0.13
    POSITIVE LOGITS
    berry
    0.20
    berries
    0.16
    iana
    0.16
    arden
    0.15
    bell
    0.15
    icha
    0.15
    anke
    0.15
    tero
    0.15
    valuator
    0.14
    anje
    0.14
    Act Density 0.013%

    No Known Activations