INDEX
Explanations
names of people or titles in military or political contexts
proper nouns and titles related to individuals and their roles
New Auto-Interp
Negative Logits
CONCLUS
-0.77
RESULTS
-0.70
cells
-0.70
irs
-0.68
MU
-0.64
1100
-0.64
3000
-0.63
Desktop
-0.62
grave
-0.62
groups
-0.61
POSITIVE LOGITS
Rodham
0.80
Richard
0.72
dinand
0.70
Salvador
0.69
Bryant
0.69
imir
0.69
Christopher
0.68
William
0.68
Fernandez
0.68
Edward
0.68
Activations Density 0.200%