INDEX
Explanations
references to the United States in various contexts
Follows "United" and refers to countries
United States, United Kingdom, United Nations
New Auto-Interp
Negative Logits
متعلقه
-0.70
myſelf
-0.66
itſelf
-0.65
NSCoder
-0.64
Efq
-0.64
poffe
-0.63
گاب
-0.63
Демографія
-0.62
raiſ
-0.60
whoſe
-0.59
POSITIVE LOGITS
States
1.20
Kingdom
0.99
States
0.90
Nations
0.88
states
0.87
STATES
0.82
Kingdom
0.81
kingdom
0.75
United
0.68
KINGDOM
0.68
Activations Density 0.093%