INDEX
Explanations
the word "any" followed by a number
the word "any" in various contexts
New Auto-Interp
Negative Logits
rex
-0.71
rez
-0.67
appings
-0.65
bows
-0.64
ros
-0.61
iano
-0.61
ades
-0.61
seless
-0.60
endas
-0.59
expensive
-0.58
POSITIVE LOGITS
THING
1.22
conceivable
0.94
thin
0.91
body
0.91
WHERE
0.91
where
0.89
ONE
0.83
else
0.79
imaginable
0.77
remotely
0.73
Activations Density 0.056%