INDEX
    Explanations

    phrases related to surpassing or meeting expectations

    references to prior expectations or beliefs about events or outcomes

    New Auto-Interp
    Negative Logits
    ichael
    -0.73
    istries
    -0.70
    loo
    -0.67
    advant
    -0.65
    athe
    -0.62
    onies
    -0.62
    aw
    -0.61
    ipop
    -0.60
    onite
    -0.60
    vl
    -0.59
    POSITIVE LOGITS
     estim
    0.69
     predec
    0.67
    DERR
    0.67
     assumptions
    0.66
     adversaries
    0.63
    existent
    0.61
     eras
    0.59
     selves
    0.59
    '';
    0.59
     Humans
    0.58
    Act Density 0.145%

    No Known Activations