INDEX
    Explanations

    mentions of future actions

    instances of the word "will" indicating future events or actions

    New Auto-Interp
    Negative Logits
    reality
    -0.65
     abstraction
    -0.62
    amped
    -0.62
    Lear
    -0.62
    Mom
    -0.61
    ZI
    -0.60
    processing
    -0.60
    76561
    -0.59
    establishment
    -0.59
    engineering
    -0.59
    POSITIVE LOGITS
     be
    1.19
     continue
    1.05
     undoubtedly
    1.05
     doubtless
    1.04
     likely
    1.01
     surely
    0.97
     gladly
    0.95
     probably
    0.93
     remain
    0.93
     definitely
    0.92
    Act Density 0.205%

    No Known Activations