INDEX
Explanations
references to media and public figures in articles
New Auto-Interp
Head Attr Weights
0:0.08
1:0.02
2:0.03
3:0.14
4:0.09
5:0.06
6:0.05
7:0.16
8:0.10
9:0.03
10:0.16
11:0.03
Negative Logits
differe
-2.37
orem
-2.30
regul
-2.29
afterward
-2.27
soph
-2.25
princ
-2.23
STL
-2.21
prin
-2.18
":-
-2.17
reforms
-2.15
POSITIVE LOGITS
Liverpool
3.08
Liverpool
2.97
Belfast
2.87
Anfield
2.86
£
2.85
NRL
2.85
Trafford
2.79
£
2.60
whisky
2.52
Cardiff
2.47
Activations Density 0.004%