INDEX
Explanations
keywords related to actions or behaviors, especially in the context of criticisms or observations about others' actions
occurrences of the verb "act" and its variations
New Auto-Interp
Negative Logits
Teg
-0.66
Flavoring
-0.65
nic
-0.64
TON
-0.63
burn
-0.62
ickets
-0.61
yip
-0.61
Graphics
-0.60
Cotton
-0.60
fer
-0.60
POSITIVE LOGITS
uate
1.14
accordingly
1.05
uated
1.04
differently
1.04
decisively
1.02
responsibly
1.00
independently
0.98
aggressively
0.98
impuls
0.95
unilaterally
0.94
Activations Density 0.045%