INDEX
Explanations
words ending in "-om"
instances of the word "om."
New Auto-Interp
Negative Logits
++++++++++++++++
-0.68
Actual
-0.64
LIST
-0.62
nexus
-0.60
VO
-0.60
overpower
-0.58
pork
-0.58
compuls
-0.58
Veil
-0.56
actions
-0.56
POSITIVE LOGITS
orrow
1.14
obile
1.11
puter
1.11
otive
1.07
atoes
1.04
edia
1.04
useum
0.98
essage
0.97
merce
0.96
psons
0.96
Activations Density 0.022%