INDEX
Explanations
proper nouns related to media, law, and investigative reporting
New Auto-Interp
Head Attr Weights
0:0.07
1:0.09
2:0.05
3:0.05
4:0.06
5:0.29
6:0.04
7:0.03
8:0.05
9:0.07
10:0.09
11:0.06
Negative Logits
bear
-1.93
Osiris
-1.85
Legend
-1.73
number
-1.72
renheit
-1.71
only
-1.70
-+
-1.66
mine
-1.65
fred
-1.65
leader
-1.63
POSITIVE LOGITS
assetsadobe
1.86
ewitness
1.81
compr
1.79
piv
1.74
illy
1.73
extrap
1.72
guiActiveUn
1.72
inflamm
1.72
complied
1.65
)=(
1.64
Activations Density 0.001%