INDEX
Explanations
names or titles, especially those with the word "Baron"
proper nouns, particularly names and titles associated with individuals
New Auto-Interp
Negative Logits
mable
-0.79
DN
-0.72
extrem
-0.67
FORMATION
-0.65
agents
-0.63
mitting
-0.63
************
-0.63
release
-0.63
skelet
-0.62
EMS
-0.62
POSITIVE LOGITS
Baron
1.04
ess
1.00
Mord
0.94
esses
0.93
fman
0.85
stein
0.82
ham
0.82
Rothschild
0.78
esis
0.78
alist
0.77
Activations Density 0.010%