INDEX
Explanations
references to kings and significant royal figures
New Auto-Interp
Negative Logits
sed
-0.17
Kraj
-0.16
alous
-0.16
sar
-0.16
sik
-0.16
scape
-0.15
sse
-0.15
leck
-0.15
okit
-0.15
sert
-0.14
POSITIVE LOGITS
pin
0.34
fish
0.32
dom
0.31
pins
0.31
ston
0.28
ergarten
0.26
ht
0.24
lear
0.24
maker
0.23
DOM
0.23
Activations Density 0.028%