INDEX
Explanations
names of specific individuals with high activation values, particularly "Ernest"
names and terms associated with individuals and locations
New Auto-Interp
Negative Logits
cius
-0.78
Psychiatry
-0.75
isky
-0.75
=~=~
-0.74
lette
-0.71
verage
-0.71
NB
-0.66
FF
-0.65
vin
-0.64
utions
-0.63
POSITIVE LOGITS
prises
0.88
ally
0.87
sburg
0.87
ript
0.85
eday
0.79
prise
0.78
eers
0.77
eer
0.73
ypes
0.72
eded
0.72
Activations Density 0.019%