INDEX
    Explanations

    symbols and special characters in the text

    New Auto-Interp
    Negative Logits
    برد
    -0.19
    _elt
    -0.16
    borg
    -0.15
    acos
    -0.15
     patri
    -0.15
    .scalablytyped
    -0.15
    amil
    -0.14
    ÙĪÛĮÙĦ
    -0.14
    ìłĿ
    -0.14
    iker
    -0.14
    POSITIVE LOGITS
     Sup
    0.27
     Mon
    0.26
    Sup
    0.23
     SUP
    0.23
    Mon
    0.22
    _sup
    0.22
    .sup
    0.22
     par
    0.21
     sup
    0.21
    sup
    0.21
    Act Density 0.007%

    No Known Activations