INDEX
Explanations
proper nouns related to specific individuals
references to specific individuals and locations
New Auto-Interp
Negative Logits
MV
-0.82
om
-0.78
amn
-0.77
dat
-0.76
EVENTS
-0.72
aii
-0.71
nom
-0.70
uclear
-0.68
omn
-0.67
orig
-0.67
POSITIVE LOGITS
Stoke
3.86
Sunderland
1.49
Notting
1.48
Swansea
1.32
Chong
1.28
Norwich
1.24
Leicester
1.23
Brist
1.18
Preston
1.17
Newcastle
1.15
Activations Density 0.033%