INDEX
Explanations
words related to choices or alternatives
phrases that include the word "or."
New Auto-Interp
Negative Logits
Pony
-0.88
ocrats
-0.69
onday
-0.69
ulhu
-0.69
ocrat
-0.67
ocracy
-0.66
efer
-0.64
Frog
-0.63
RPG
-0.63
auga
-0.63
POSITIVE LOGITS
ifice
1.30
acles
1.27
acle
1.24
chard
1.21
nam
1.18
Else
1.18
nery
1.15
leans
1.08
chid
1.07
otherwise
1.06
Activations Density 0.144%