INDEX
Explanations
references to the United States and its states, indicating focus on geographic and demographic information
New Auto-Interp
Negative Logits
labour
-0.81
Labour
-0.81
cillors
-0.79
KURZBESCHREIBUNG
-0.79
Labour
-0.77
-£
-0.76
cillor
-0.71
ExtendWith
-0.69
labour
-0.69
EUROPEAN
-0.69
POSITIVE LOGITS
America
1.08
America
1.05
🇺🇸
1.03
美国
1.03
在美国
0.97
American
0.94
امريكا
0.93
American
0.93
statunit
0.93
statunitense
0.92
Activations Density 1.889%