INDEX
Explanations
proper nouns, especially related to political figures and specific organizations
references to pointing or directing attention towards specific subjects or individuals
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.11
3:0.06
4:0.06
5:0.06
6:0.04
7:0.03
8:0.33
9:0.13
10:0.05
11:0.02
Negative Logits
conservancy
-1.22
soever
-1.21
roy
-1.20
interstitial
-1.18
repaired
-1.15
emouth
-1.08
sembly
-1.08
ashington
-1.08
grapp
-1.06
lymp
-1.06
POSITIVE LOGITS
dial
1.31
Genocide
1.22
zinski
1.20
causation
1.14
TextColor
1.11
veto
1.11
flashing
1.10
enance
1.09
azar
1.09
urai
1.07
Activations Density 0.046%