INDEX
Explanations
words related to reduction or removal
the prefix "er" in various contexts and forms
New Auto-Interp
Negative Logits
ership
-0.77
Spears
-0.67
OUNT
-0.66
Premium
-0.64
kefeller
-0.63
orld
-0.63
ELS
-0.63
Painter
-0.63
INESS
-0.62
IGHTS
-0.62
POSITIVE LOGITS
asure
1.24
asures
1.02
rant
0.98
icit
0.93
aser
0.92
ogenous
0.92
mination
0.91
asing
0.90
got
0.89
rors
0.82
Activations Density 0.016%