INDEX
Explanations
proper nouns related to political figures or events
New Auto-Interp
Negative Logits
Marketable
-0.79
incial
-0.76
_-
-0.76
INAL
-0.72
actionDate
-0.67
²¾
-0.65
ĪĴ
-0.64
«ĺ
-0.64
ormal
-0.64
ħĭ
-0.64
POSITIVE LOGITS
nesday
0.79
reens
0.74
ipeg
0.73
theless
0.68
eret
0.63
Rost
0.62
lich
0.60
hof
0.60
bucks
0.59
wright
0.59
Activations Density 3.194%