INDEX
Explanations
the word "voluntary"
references to voluntary actions or decisions
New Auto-Interp
Negative Logits
marks
-0.82
Tycoon
-0.80
elf
-0.76
itect
-0.75
mers
-0.74
sworth
-0.73
rox
-0.73
GPU
-0.72
geist
-0.72
bugs
-0.71
POSITIVE LOGITS
untary
1.15
voluntary
1.14
untarily
1.05
manslaughter
1.05
unte
0.93
involuntary
0.86
compliance
0.84
cessation
0.83
captcha
0.80
voluntarily
0.80
Activations Density 0.006%