INDEX
Explanations
mentions of politicians and their actions or statements regarding specific policies or issues
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.18
3:0.23
4:0.10
5:0.03
6:0.02
7:0.05
8:0.05
9:0.05
10:0.10
11:0.12
Negative Logits
DragonMagazine
-1.59
ermanent
-1.40
Dragonbound
-1.40
rect
-1.33
rored
-1.30
folder
-1.26
isd
-1.24
ratulations
-1.24
ezvous
-1.21
ickets
-1.20
POSITIVE LOGITS
Corinth
1.60
unfairly
1.59
ruining
1.57
seeming
1.46
causing
1.44
wcs
1.43
cause
1.42
misinterpret
1.42
nor
1.40
etz
1.37
Activations Density 0.484%