INDEX
    Explanations

    instances where someone or something is anticipating a particular outcome

    expectation statements related to predictions or forecasts

    New Auto-Interp
    Negative Logits
    papers
    -0.68
    odor
    -0.67
    otin
    -0.66
    crim
    -0.65
    ãĤ±
    -0.63
     Tid
    -0.62
    ossier
    -0.62
    liam
    -0.62
     pseudonym
    -0.61
     Awakening
    -0.61
    POSITIVE LOGITS
     expects
    2.87
     anticip
    1.08
     expect
    1.03
     accepts
    1.01
     expecting
    0.98
    expected
    0.97
     env
    0.92
    Property
    0.91
     expected
    0.90
     assumes
    0.89
    Act Density 0.036%

    No Known Activations