INDEX
    Explanations

    phrases related to instructions or regulations

    New Auto-Interp
    Negative Logits
     Aires
    -0.56
     abundantly
    -0.55
     banner
    -0.55
     Corrections
    -0.54
     Penal
    -0.53
     marginal
    -0.52
     nu
    -0.52
     eh
    -0.51
    SPONSORED
    -0.51
     sterling
    -0.51
    POSITIVE LOGITS
    -
    0.95
    -$
    0.93
    usterity
    0.93
    alog
    0.89
    _
    0.89
    lihood
    0.84
    bsite
    0.83
    mosp
    0.82
    etheless
    0.80
    tenance
    0.80
    Act Density 0.612%

    No Known Activations