INDEX
    Explanations

    negations and contractions

    New Auto-Interp
    Negative Logits
    ätt
    -0.16
    unal
    -0.15
    incr
    -0.15
    lew
    -0.15
    ë»
    -0.15
    nez
    -0.14
     Firmware
    -0.14
    unden
    -0.14
    forgettable
    -0.14
     realpath
    -0.14
    POSITIVE LOGITS
     sure
    0.30
    sure
    0.24
     alone
    0.24
     phased
    0.23
     anymore
    0.21
     Sure
    0.21
    Sure
    0.20
     allowed
    0.20
     finished
    0.19
     nearly
    0.19
    Act Density 0.131%

    No Known Activations