INDEX
Explanations
references to voluntary actions or agreements
references to voluntary actions and decisions
New Auto-Interp
Negative Logits
marks
-0.79
Tycoon
-0.78
mers
-0.78
sworth
-0.78
alon
-0.73
abal
-0.73
osite
-0.73
efully
-0.72
raph
-0.72
geist
-0.71
POSITIVE LOGITS
untarily
1.08
untary
1.04
manslaughter
1.01
voluntary
0.95
surrender
0.95
unte
0.93
relinqu
0.88
involuntary
0.85
enlist
0.82
euth
0.80
Activations Density 0.021%