INDEX
Explanations
phrases including the word "leader."
occurrences of the suffix "er" in various contexts
New Auto-Interp
Negative Logits
ĸļ
-0.96
acebook
-0.71
chwitz
-0.69
eenth
-0.69
raltar
-0.68
atform
-0.65
luaj
-0.62
urities
-0.62
ertodd
-0.61
uncture
-0.60
POSITIVE LOGITS
jee
1.01
lein
0.95
idge
0.88
ger
0.86
adish
0.84
lich
0.82
rors
0.82
aton
0.81
baum
0.79
asures
0.79
Activations Density 0.051%