INDEX
Explanations
references to royalty or monarchy, particularly focusing on Queen-related terms
references to the Queen, specifically in relation to her name and titles
New Auto-Interp
Negative Logits
sych
-0.73
kson
-0.70
letcher
-0.69
herer
-0.69
odcast
-0.67
ramid
-0.67
razil
-0.64
aneous
-0.63
unaff
-0.63
Extend
-0.63
POSITIVE LOGITS
Anne
1.09
pin
0.92
Elizabeth
0.88
Mother
0.87
stown
0.86
Mary
0.83
Queen
0.83
Victoria
0.81
Bee
0.79
pins
0.79
Activations Density 0.019%