INDEX
Explanations
mentions of political figures and entities
New Auto-Interp
Head Attr Weights
0:0.07
1:0.06
2:0.02
3:0.07
4:0.09
5:0.36
6:0.05
7:0.04
8:0.06
9:0.06
10:0.03
11:0.02
Negative Logits
stocks
-2.27
%%%%
-2.07
bart
-1.96
eternity
-1.87
Collider
-1.84
relegation
-1.83
cro
-1.83
erald
-1.82
ixtures
-1.80
promotions
-1.79
POSITIVE LOGITS
versus
2.65
vs
2.26
secondly
2.25
renheit
2.11
also
1.95
preceded
1.94
differs
1.94
punishable
1.92
Pg
1.92
Secondly
1.91
Activations Density 0.015%