INDEX
    Explanations

    unique character symbols and foreign language characters

    New Auto-Interp
    Negative Logits
     equival
    -0.57
    eele
    -0.57
    pole
    -0.56
     eleph
    -0.55
     Penet
    -0.54
     satur
    -0.54
     locations
    -0.53
    gdala
    -0.53
     bonding
    -0.52
    cano
    -0.52
    POSITIVE LOGITS
    ħ
    1.08
    Ķ
    1.08
    ī
    1.06
    ĺ
    1.06
    Į
    1.03
    Ł
    1.02
    ľ
    1.00
    ı
    1.00
    ¿
    0.99
    ¼
    0.99
    Act Density 0.007%

    No Known Activations