INDEX
    Explanations

    phrases indicating dedication or commitment to a cause

    New Auto-Interp
    Negative Logits
    yer
    -0.18
    /cms
    -0.16
     Penal
    -0.15
    ernel
    -0.15
    alla
    -0.15
    PAD
    -0.15
     Dann
    -0.14
    abin
    -0.14
    ål
    -0.14
    agate
    -0.14
    POSITIVE LOGITS
    vo
    0.15
    ByUrl
    0.15
    dden
    0.14
    åľ°ä¸ĭ
    0.14
    AYER
    0.14
    _CHAN
    0.13
    ille
    0.13
    ÑĤиÑĢов
    0.13
    çķ¥
    0.13
    ekim
    0.13
    Act Density 0.010%

    No Known Activations