INDEX
Explanations
mentions of locations, specifically those related to the UK
references to the United Kingdom and related entities
New Auto-Interp
Negative Logits
tons
-0.72
fingers
-0.70
bee
-0.69
chin
-0.66
¯¯¯¯
-0.66
clud
-0.65
blocks
-0.64
nea
-0.63
residues
-0.63
block
-0.61
POSITIVE LOGITS
JV
0.95
AIN
0.93
UGE
0.88
yip
0.86
MAP
0.86
UKIP
0.86
DEF
0.84
ERAL
0.83
UK
0.83
DEP
0.83
Activations Density 0.016%