INDEX
    Explanations

    legal and administrative terms

    New Auto-Interp
    Negative Logits
    beard
    -0.84
    ILY
    -0.73
     buckle
    -0.65
     Nieto
    -0.61
     Belt
    -0.61
     lightly
    -0.61
    landish
    -0.60
     err
    -0.59
    theless
    -0.59
     mist
    -0.59
    POSITIVE LOGITS
    ations
    2.34
    ators
    2.31
    atory
    2.20
    ator
    2.08
    ational
    1.95
    ative
    1.88
    atio
    1.86
    atories
    1.84
    ating
    1.80
    ates
    1.74
    Act Density 0.098%

    No Known Activations