INDEX
Explanations
mentions of the United States
"U" followed by a period or letter
U.S. mentions
New Auto-Interp
Negative Logits
^(@)
-0.79
ivelany
-0.75
\%$\\
-0.75
Flask
-0.71
nakalista
-0.71
Mandate
-0.71
thiệu
-0.70
Mascot
-0.70
riff
-0.70
المعيارى
-0.69
POSITIVE LOGITS
U
0.97
U
0.84
o
0.67
u
0.65
u
0.63
У
0.58
У
0.57
pro
0.55
ў
0.54
uitz
0.53
Activations Density 0.110%