INDEX
Explanations
mentions of ambiguity or uncertainty
the word "something" in various contexts
New Auto-Interp
Negative Logits
de
-0.69
assis
-0.67
interest
-0.65
follow
-0.64
DOM
-0.64
spread
-0.63
lead
-0.63
use
-0.63
ignore
-0.62
bourg
-0.61
POSITIVE LOGITS
Else
1.30
else
1.05
innocuous
0.81
intangible
0.79
resembling
0.79
ĪĴ
0.78
nutritious
0.77
incomprehensible
0.77
nutrit
0.76
tangible
0.75
Activations Density 0.036%