INDEX
Explanations
phrases related to making decisions or choices
New Auto-Interp
Negative Logits
Introduced
-0.68
mone
-0.62
via
-0.59
###
-0.58
shall
-0.57
April
-0.56
vividly
-0.56
wordpress
-0.56
§
-0.56
angelo
-0.55
POSITIVE LOGITS
hire
0.74
bidden
0.71
gotten
0.69
precaution
0.66
geries
0.66
example
0.65
raviolet
0.65
WARD
0.63
captcha
0.62
starters
0.61
Activations Density 12.975%