INDEX
Explanations
phrases related to political entities, specifically references to republics
New Auto-Interp
Negative Logits
AsUp
-0.90
-0.82
########.
-0.80
tonode
-0.79
cetamol
-0.79
expandindo
-0.79
Kidman
-0.76
gdx
-0.76
ग्राहक
-0.75
haired
-0.75
POSITIVE LOGITS
Republic
1.42
Republic
1.29
republic
1.09
REPUBLIC
0.93
republic
0.83
République
0.82
republics
0.77
Republik
0.75
Republica
0.75
Republics
0.74
Activations Density 0.008%