INDEX
Explanations
proper nouns related to different entities such as people, places, and organizations
references to the name "Mills" and related entities
New Auto-Interp
Negative Logits
liest
-0.84
fierce
-0.75
stern
-0.72
itably
-0.69
wealth
-0.65
Sovereign
-0.64
Saddam
-0.64
ctory
-0.64
handsome
-0.63
SPONSORED
-0.62
POSITIVE LOGITS
Mills
1.35
pora
0.97
mosqu
0.94
boro
0.93
hirt
0.87
mallow
0.87
teasp
0.86
wich
0.84
Osw
0.84
psey
0.83
Activations Density 0.007%