INDEX
    Explanations

    statements emphasizing assurance and guarantee of compliance or reliability

    New Auto-Interp
    Negative Logits
     fritas
    -0.72
    talk
    -0.71
    Notae
    -0.67
     fVar
    -0.64
    7
    -0.62
     talk
    -0.62
    tauchen
    -0.61
    amb
    -0.60
     Pollard
    -0.60
    Brat
    -0.60
    POSITIVE LOGITS
     Ensure
    1.43
     ensures
    1.37
    Ensure
    1.36
     ensuring
    1.35
     ensured
    1.35
     ensure
    1.34
     Ensuring
    1.18
     Ens
    1.16
    ensure
    1.11
     ENS
    1.09
    Act Density 0.073%

    No Known Activations