INDEX
Explanations
phrases related to actions of individuals, including actions with moral implications
instances of social or political manipulation and control
New Auto-Interp
Negative Logits
actionDate
-0.85
were
-0.78
Were
-0.77
ERE
-0.72
Were
-0.71
Matter
-0.69
weren
-0.69
DragonMagazine
-0.66
oubted
-0.65
outnumbered
-0.64
POSITIVE LOGITS
prepares
1.73
learns
1.71
destroys
1.71
shuts
1.70
recovers
1.68
loses
1.67
performs
1.67
develops
1.67
tries
1.66
delivers
1.66
Activations Density 0.817%