INDEX
    Explanations

    phrases indicating types of actions, events, or conditions

    New Auto-Interp
    Negative Logits
    aceae
    -0.71
     âĶľ
    -0.68
    agree
    -0.66
     âĶľâĶĢâĶĢ
    -0.64
    lords
    -0.63
    ngth
    -0.63
    livious
    -0.63
     somet
    -0.62
    grounds
    -0.61
    nown
    -0.61
    POSITIVE LOGITS
     moratorium
    1.19
     boycott
    1.13
     halt
    1.06
     truce
    0.91
     referendum
    0.87
     barrage
    0.85
     roundup
    0.83
     dismissal
    0.83
     demonstration
    0.83
     crackdown
    0.82
    Act Density 0.078%

    No Known Activations