INDEX
Explanations
repeated suffixes, particularly those ending with "on" or "ons."
New Auto-Interp
Negative Logits
owl
-0.15
T
-0.14
on
-0.14
pat
-0.14
ouch
-0.14
Jun
-0.14
anship
-0.13
Sibling
-0.13
atum
-0.13
rom
-0.13
POSITIVE LOGITS
екÑģи
0.17
aires
0.17
forge
0.16
naire
0.16
voir
0.15
exion
0.15
UPLE
0.15
.infinity
0.15
AMESPACE
0.15
Äįer
0.15
Activations Density 0.075%