INDEX
Explanations
references to institutions and organizations in a political context
New Auto-Interp
Negative Logits
sqor
-0.68
foundland
-0.65
MpServer
-0.62
ranging
-0.60
iannopoulos
-0.59
range
-0.59
eworthy
-0.56
erville
-0.56
ãĤ¨ãĥ«
-0.55
rising
-0.55
POSITIVE LOGITS
intervened
1.08
deems
1.03
decides
0.96
hadn
0.92
interfered
0.92
couldn
0.88
refuses
0.86
approves
0.85
considers
0.85
interven
0.84
Activations Density 0.337%