INDEX
Explanations
references to political affiliations and party identifiers, particularly in the context of social issues
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.09
3:0.06
4:0.17
5:0.03
6:0.05
7:0.28
8:0.02
9:0.04
10:0.07
11:0.09
Negative Logits
DEBUG
-1.79
FY
-1.70
Reviewer
-1.59
erity
-1.59
leave
-1.57
partName
-1.56
LEASE
-1.53
Scale
-1.52
IAS
-1.48
UTC
-1.44
POSITIVE LOGITS
sov
1.48
Barnes
1.47
portfolio
1.45
auga
1.35
Lank
1.32
Chinatown
1.32
Lanka
1.30
oshenko
1.29
assemb
1.28
downt
1.28
Activations Density 0.000%