INDEX
    Explanations

    phrases indicating impending trouble or consequences

    New Auto-Interp
    Negative Logits
    iyan
    -0.07
    Slf
    -0.07
    irá
    -0.06
     Dra
    -0.06
     characteristic
    -0.06
    LAG
    -0.06
    stellung
    -0.06
    nett
    -0.06
     text
    -0.06
     Craft
    -0.06
    POSITIVE LOGITS
    æĹıèĩªæ²»
    0.06
     Tears
    0.06
     Tec
    0.06
    ifo
    0.06
    cot
    0.06
    atk
    0.06
    imo
    0.06
     Verified
    0.06
    à¸ķะ
    0.06
    åŃ
    0.06
    Act Density 0.002%

    No Known Activations