INDEX
    Explanations

    contractions of "do not" or "does not"

    New Auto-Interp
    Negative Logits
     nearest
    -0.69
     enthusi
    -0.67
     populated
    -0.65
     newcom
    -0.64
     redes
    -0.63
     cancellation
    -0.63
     Cance
    -0.62
     exha
    -0.61
     Published
    -0.61
     princ
    -0.61
    POSITIVE LOGITS
    't
    1.84
    ÃŃ
    1.09
    uts
    1.00
    eness
    0.96
    n
    0.96
    ´
    0.94
    etsk
    0.92
    ned
    0.92
    hips
    0.87
    ALD
    0.84
    Act Density 0.153%

    No Known Activations