INDEX
Explanations
references to various Arab ethnicities, national identities, and related geopolitical contexts
New Auto-Interp
Negative Logits
Lithuanian
-0.57
Hungarian
-0.56
Latvian
-0.54
Rè
-0.53
Kla
-0.51
питан
-0.49
Estonian
-0.48
Hungary
-0.48
Laus
-0.48
Hungarian
-0.48
POSITIVE LOGITS
Arab
1.57
Saudi
1.52
Arabic
1.48
Arabs
1.46
Arabian
1.39
Arab
1.38
ARAB
1.31
Arabia
1.30
arabe
1.30
Saudi
1.29
Activations Density 0.358%