INDEX
Explanations
words related to kingdoms and royalty
references to various kingdoms and their contexts within the text
New Auto-Interp
Negative Logits
enegger
-0.77
CAST
-0.75
lder
-0.73
esters
-0.72
inez
-0.71
matter
-0.70
lus
-0.66
bats
-0.65
pos
-0.63
eman
-0.63
POSITIVE LOGITS
DOM
1.03
Hearts
1.00
Arabian
0.93
wide
0.92
Halls
0.83
doms
0.82
loo
0.81
Arabia
0.78
kingdom
0.77
Kingdom
0.76
Activations Density 0.013%