INDEX
    Explanations

    constructions related to the state, description, or location of things

    New Auto-Interp
    Negative Logits
    atro
    -0.14
    uming
    -0.14
    Äįin
    -0.14
    ائÙĤ
    -0.13
    ville
    -0.13
    aterno
    -0.13
    atan
    -0.13
    ier
    -0.13
    ase
    -0.13
     Astr
    -0.12
    POSITIVE LOGITS
    kker
    0.16
    éĤ¦
    0.15
    ané
    0.15
    ám
    0.14
    VOICE
    0.14
    tuk
    0.14
    553
    0.14
     поÑħ
    0.13
    lÃŃ
    0.13
    934
    0.13
    Act Density 0.193%

    No Known Activations