INDEX
    Explanations

    phrases related to defending or saving off negative outcomes or threats

    terms related to actions that mitigate or prevent negative outcomes

    New Auto-Interp
    Negative Logits
    rian
    -0.61
    ynski
    -0.58
    igma
    -0.57
    女
    -0.57
    Scot
    -0.56
     Supplemental
    -0.55
    HUD
    -0.55
    bsite
    -0.55
    agents
    -0.55
     Brewer
    -0.54
    POSITIVE LOGITS
     off
    1.91
    off
    1.48
    offs
    1.44
    Off
    1.35
     Off
    1.31
    OFF
    1.27
     OFF
    1.16
     dividends
    0.77
     goodbye
    0.76
     shoulders
    0.71
    Act Density 0.301%

    No Known Activations