INDEX
    Explanations

    punctuation

    New Auto-Interp
    Negative Logits
    оль
    -0.06
     فأ
    -0.06
    και
    -0.06
     proprio
    -0.06
    ух
    -0.06
     mic
    -0.06
     t�
    -0.06
     yyyy
    -0.06
     Ying
    -0.06
    чин
    -0.06
    POSITIVE LOGITS
     Î
    0.07
     (.
    0.06
     EXTRA
    0.06
     이동
    0.06
    iqu
    0.06
     ضربه
    0.06
    ][$
    0.06
     theological
    0.06
    intl
    0.06
    AGES
    0.06
    Act Density 0.029%

    No Known Activations