INDEX
    Explanations

    phrases indicating high likelihood or possibility of something happening

    phrases indicating probability or likelihood

    New Auto-Interp
    Negative Logits
    76561
    -0.72
    feeding
    -0.66
    iful
    -0.64
    CLASSIFIED
    -0.62
    eworthy
    -0.61
     Columb
    -0.60
    fighting
    -0.59
     Mour
    -0.58
    cussion
    -0.58
    Trivia
    -0.58
    POSITIVE LOGITS
     be
    1.09
     become
    1.03
     explode
    0.98
     prove
    0.95
     have
    0.93
     succeed
    0.92
     lose
    0.90
     revert
    0.90
     regress
    0.90
     reside
    0.89
    Act Density 0.071%

    No Known Activations