INDEX
Explanations
references to notable individuals, particularly those with titles such as "Sir."
starting with "Sir", "Cir", or "Mir"
Sir + names, Cir + rious
New Auto-Interp
Negative Logits
ItemBackground
-0.88
enance
-0.71
paksa
-0.70
kamen
-0.70
esternos
-0.70
Mako
-0.69
комму
-0.68
amation
-0.67
Beal
-0.66
ogeneity
-0.66
POSITIVE LOGITS
ir
0.87
CIR
0.87
Cir
0.84
Kir
0.83
Sira
0.83
hir
0.82
KIR
0.81
SIR
0.80
Kir
0.79
IR
0.79
Activations Density 0.258%