INDEX
Explanations
references to Arab identity and its various contexts
New Auto-Interp
Negative Logits
TING
-0.16
iro
-0.16
chers
-0.15
yon
-0.14
ignKey
-0.14
flip
-0.14
wit
-0.14
urb
-0.14
macros
-0.14
aran
-0.13
POSITIVE LOGITS
ipel
0.18
-Israel
0.17
isation
0.16
-American
0.16
Gulf
0.15
-major
0.15
net
0.15
-speaking
0.15
/black
0.15
ized
0.14
Activations Density 0.004%