INDEX
Explanations
country names, particularly the United States and United Kingdom
names of countries and cities, particularly emphasizing the term "United."
New Auto-Interp
Negative Logits
————
-0.61
âĻ
-0.60
notation
-0.60
âĶĢâĶĢâĶĢâĶĢ
-0.56
these
-0.55
"<
-0.55
overt
-0.54
âĸº
-0.54
unde
-0.54
****************
-0.54
POSITIVE LOGITS
foundland
0.76
etheless
0.70
luaj
0.62
Hudson
0.60
é¾įåĸļ士
0.58
esses
0.58
amins
0.57
Strait
0.56
taboola
0.55
odium
0.55
Activations Density 0.415%