INDEX
Explanations
phrases indicating a connection, comparison, or alternation
conditional phrases expressing possibility or likelihood
New Auto-Interp
Negative Logits
Pony
-0.89
Cth
-0.88
mummy
-0.65
pandemonium
-0.61
Steel
-0.60
Centauri
-0.60
Rocket
-0.60
Columb
-0.59
Chow
-0.58
Rainbow
-0.58
POSITIVE LOGITS
acles
1.38
acular
1.26
chard
1.25
acle
1.24
leans
1.15
GAN
1.15
chid
1.12
Else
1.10
nery
1.08
lando
1.02
Activations Density 0.074%