INDEX
Explanations
words with the suffix “-ter” or related phonetic patterns
New Auto-Interp
Negative Logits
sworth
-0.25
sphere
-0.22
speech
-0.20
sw
-0.19
slide
-0.19
scribe
-0.19
sword
-0.18
sn
-0.18
s
-0.18
ships
-0.18
POSITIVE LOGITS
a
0.28
o
0.26
oom
0.25
aft
0.24
ter
0.23
g
0.23
gent
0.23
getic
0.23
gency
0.23
al
0.23
Activations Density 0.103%