INDEX
Explanations
references to the concept of "anything."
repetitive mentions of the word "anything" in various contexts
New Auto-Interp
Negative Logits
irth
-0.77
ros
-0.76
grad
-0.74
nec
-0.70
pa
-0.69
inst
-0.66
hap
-0.64
anonymity
-0.63
ritz
-0.63
lav
-0.63
POSITIVE LOGITS
else
1.53
Else
1.39
THING
1.23
imaginable
1.08
Else
1.04
remotely
1.02
resembling
0.93
conceivable
0.88
whatsoever
0.85
soever
0.81
Activations Density 0.019%