INDEX
Explanations
the word "Either" to indicate a choice or alternative
the word "either" and its variations, signaling conditional situations or contrasts
New Auto-Interp
Negative Logits
thal
-0.83
acter
-0.78
achus
-0.74
riad
-0.71
appings
-0.69
roxy
-0.68
plates
-0.68
vironments
-0.68
vir
-0.68
ocamp
-0.67
POSITIVE LOGITS
side
0.82
willfully
0.74
consciously
0.74
omit
0.69
manually
0.68
directly
0.68
Either
0.67
way
0.65
intentionally
0.64
ignore
0.64
Activations Density 0.023%