INDEX
Explanations
references to specific national or ethnic identities
New Auto-Interp
Negative Logits
Arabia
-0.35
France
-0.33
Finland
-0.33
Romania
-0.33
Denmark
-0.33
Client
-0.33
France
-0.33
AUSTRALIA
-0.33
Switzerland
-0.32
Sverige
-0.32
POSITIVE LOGITS
inian
0.80
vician
0.78
sonian
0.76
Chwiliwch
0.76
Италијани
0.75
casian
0.75
onian
0.75
conian
0.72
wegian
0.72
Infórmanos
0.72
Activations Density 0.408%