INDEX
Explanations
phrases indicating alternative options or choices
occurrences of the word "either" in relation to choices or alternatives
New Auto-Interp
Negative Logits
achus
-0.76
vez
-0.75
ussions
-0.71
biz
-0.71
elong
-0.70
roxy
-0.70
emen
-0.70
inav
-0.69
lights
-0.68
riad
-0.68
POSITIVE LOGITS
halves
0.73
overtly
0.71
lift
0.70
ante
0.70
side
0.69
individually
0.66
either
0.64
verbally
0.64
sexes
0.62
implicitly
0.61
Activations Density 0.017%