INDEX
Explanations
different organizations and affiliates
phrases indicating relationships or components of entities
New Auto-Interp
Negative Logits
fw
-0.80
warranted
-0.73
ctive
-0.70
tics
-0.65
Reply
-0.64
oult
-0.64
apy
-0.63
fters
-0.62
metry
-0.62
fter
-0.62
POSITIVE LOGITS
famed
0.83
Britain
0.81
sorts
0.80
etime
0.75
the
0.71
whom
0.70
British
0.68
disgr
0.67
Belgium
0.67
Confederation
0.67
Activations Density 0.293%