INDEX
    Explanations

    phrases that assert the reality or existence of a situation or concept

    New Auto-Interp
    Negative Logits
    etter
    -0.16
    das
    -0.15
     une
    -0.15
    stad
    -0.15
    ãģĭãģĹ
    -0.14
    ez
    -0.14
    esus
    -0.14
     ponto
    -0.13
    eyi
    -0.13
    ilogy
    -0.13
    POSITIVE LOGITS
    none
    0.18
     none
    0.16
    abelle
    0.15
    idar
    0.15
     bahwa
    0.14
    ABEL
    0.14
    omencl
    0.14
    NONE
    0.13
    ITTER
    0.13
    nesota
    0.13
    Act Density 0.052%

    No Known Activations