INDEX
Explanations
phrases related to actions performed by individuals
phrases related to actions and accountability
New Auto-Interp
Negative Logits
Entered
-0.71
suspects
-0.69
Board
-0.66
ussen
-0.62
assis
-0.60
Flood
-0.59
inently
-0.58
Dram
-0.57
Rampage
-0.57
Gork
-0.57
POSITIVE LOGITS
pez
1.06
differently
0.94
wrong
0.84
unconsciously
0.80
actic
0.74
Äĩ
0.73
wrong
0.72
administr
0.72
athlet
0.70
女
0.69
Activations Density 0.050%