INDEX
Explanations
options or choices in a list
the word "or" in various contexts
New Auto-Interp
Negative Logits
ocracy
-0.84
ocratic
-0.71
Pony
-0.68
Material
-0.66
ocrat
-0.64
vertising
-0.64
ENE
-0.62
Words
-0.61
ocrats
-0.61
ETS
-0.59
POSITIVE LOGITS
chard
1.32
acle
1.28
alternatively
1.26
chid
1.26
acles
1.19
Else
1.11
ifice
1.09
nam
1.09
acular
1.06
otherwise
1.02
Activations Density 0.168%