INDEX
    Explanations

    phrases related to actions of individuals, including actions with moral implications

    instances of social or political manipulation and control

    New Auto-Interp
    Negative Logits
    actionDate
    -0.85
    were
    -0.78
    Were
    -0.77
    ERE
    -0.72
     Were
    -0.71
     Matter
    -0.69
     weren
    -0.69
    DragonMagazine
    -0.66
    oubted
    -0.65
     outnumbered
    -0.64
    POSITIVE LOGITS
     prepares
    1.73
     learns
    1.71
     destroys
    1.71
     shuts
    1.70
     recovers
    1.68
     loses
    1.67
     performs
    1.67
     develops
    1.67
     tries
    1.66
     delivers
    1.66
    Act Density 0.817%

    No Known Activations