INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     
    1.42
    a
    1.14
    𝗮
    1.13
    <0x80>
    1.10
     empate
    1.06
     a
    1.05
     puan
    1.05
     goalkeeper
    1.02
    ж
    1.00
    𝗿
    0.98
    POSITIVE LOGITS
     doctrines
    1.23
     theologians
    1.23
    들은
    1.19
    들에
    1.19
     thaliana
    1.19
    许多
    1.18
     Многие
    1.16
    科學
    1.13
     이런
    1.13
     세계
    1.13
    Act Density 2.278%

    No Known Activations