INDEX
    Explanations

    phrases related to the potential negative consequences of various scenarios

    phrases that indicate potential or possibility

    New Auto-Interp
    Negative Logits
     bidding
    -0.65
     Quarter
    -0.62
     Mant
    -0.60
     Fighter
    -0.60
     Federation
    -0.59
     Fighting
    -0.58
     Prayer
    -0.57
     worsh
    -0.57
     Hamp
    -0.56
     submitting
    -0.56
    POSITIVE LOGITS
    't
    1.52
    adian
    1.07
    berra
    1.05
     confuse
    1.02
     easily
    1.01
    isters
    1.00
     complicate
    0.99
     be
    0.99
    NOT
    0.98
    ister
    0.96
    Act Density 0.124%

    No Known Activations