INDEX
Explanations
words related to political figures and administrative posts
names of individuals and locations
New Auto-Interp
Negative Logits
Seym
-0.79
matic
-0.78
Pend
-0.75
Ambro
-0.74
ruciating
-0.72
matically
-0.71
Able
-0.70
matical
-0.70
TPPStreamerBot
-0.68
oga
-0.68
POSITIVE LOGITS
enos
1.08
eny
1.02
eno
0.84
arios
0.79
chant
0.76
loe
0.74
cyclopedia
0.74
zyme
0.71
semble
0.69
ymes
0.68
Activations Density 0.014%