INDEX
    Explanations

    specific terms and phrases in a non-Latin script, indicating a focus on certain foreign language expressions or concepts

    New Auto-Interp
    Negative Logits
    reek
    -0.17
    oir
    -0.15
    rary
    -0.15
    agger
    -0.15
    ikk
    -0.14
     Kub
    -0.14
    _sdk
    -0.14
    arem
    -0.14
    кÑĥÑĤ
    -0.14
    duit
    -0.13
    POSITIVE LOGITS
     ÑįкÑģплÑĥаÑĤа
    0.17
    ãĥ¼ãĥį
    0.16
    snippet
    0.15
     kern
    0.15
    èά
    0.14
    одейÑģÑĤв
    0.14
     à¸ģาร
    0.14
     Coin
    0.14
     Sn
    0.14
     Orient
    0.14
    Act Density 0.066%

    No Known Activations