INDEX
Explanations
the word "any"
phrases that generalize situations or events
New Auto-Interp
Negative Logits
rex
-0.76
endas
-0.74
combe
-0.69
icz
-0.68
rez
-0.68
ÃŁ
-0.67
appings
-0.64
seless
-0.63
staking
-0.62
lights
-0.62
POSITIVE LOGITS
THING
1.12
conceivable
1.05
body
0.88
WHERE
0.87
imaginable
0.87
particular
0.83
sort
0.78
ONE
0.77
given
0.77
foreseeable
0.77
Activations Density 0.069%