INDEX
Explanations
mentions of geographical locations or political jurisdictions
terms related to territory, nationality, and legal authorization in relation to different countries
New Auto-Interp
Negative Logits
rieg
-0.70
urger
-0.69
develop
-0.68
usters
-0.64
onge
-0.64
KING
-0.64
Said
-0.63
Ut
-0.62
yss
-0.62
struct
-0.61
POSITIVE LOGITS
azeera
0.89
isine
0.70
tops
0.69
ombat
0.66
grounds
0.66
consulate
0.66
abroad
0.66
tones
0.65
renheit
0.63
icals
0.63
Activations Density 0.350%