INDEX
Explanations
instances of "the" and phrases emphasizing the concept of politics
New Auto-Interp
Negative Logits
riel
-0.07
%C
-0.07
MÃľ
-0.07
коÑĤ
-0.07
cé
-0.07
FAILED
-0.07
Mezi
-0.07
rint
-0.07
lech
-0.07
raya
-0.06
POSITIVE LOGITS
White
0.07
nation
0.07
Win
0.06
Shank
0.06
359
0.06
Clare
0.06
White
0.06
Carpenter
0.05
country
0.05
House
0.05
Activations Density 0.030%