INDEX
    Explanations

    terms related to assumptions being made

    references to assumptions

    New Auto-Interp
    Negative Logits
    Interstitial
    -0.81
     deed
    -0.69
    hern
    -0.68
    HCR
    -0.66
    amen
    -0.65
    sung
    -0.65
    iferation
    -0.64
    paying
    -0.64
    ching
    -0.64
    aska
    -0.63
    POSITIVE LOGITS
     assumptions
    1.32
     assumption
    1.16
     assum
    0.91
     assumes
    0.81
     princ
    0.81
     biases
    0.79
     underpin
    0.78
     assume
    0.76
     guesses
    0.74
     mistakes
    0.74
    Act Density 0.011%

    No Known Activations