INDEX
    Explanations

    prepositions

    New Auto-Interp
    Negative Logits
    -0.08
    timestamps
    -0.06
    dız
    -0.06
     Lance
    -0.06
    TimeZone
    -0.06
    -0.06
    Ik
    -0.06
    _REPEAT
    -0.06
     خویش
    -0.06
    -0.06
    POSITIVE LOGITS
     mortar
    0.07
    ELLOW
    0.07
     Sco
    0.07
     Trit
    0.07
    MO
    0.06
    kehr
    0.06
    RESH
    0.06
    otten
    0.06
    0.06
     günü
    0.06
    Act Density 0.022%

    No Known Activations