INDEX
Explanations
phrases indicating intention or possibility
occurrences of the word "anything."
New Auto-Interp
Negative Logits
Zone
-0.65
stro
-0.62
atic
-0.58
hs
-0.57
onomy
-0.56
osa
-0.55
roid
-0.54
Nation
-0.54
pez
-0.53
abre
-0.52
POSITIVE LOGITS
anything
3.34
anything
2.81
Anything
2.09
Anything
2.00
anybody
1.93
ANY
1.78
anyone
1.77
any
1.74
anywhere
1.67
whatever
1.64
Activations Density 0.020%