INDEX
Explanations
references to historical events and memorials
New Auto-Interp
Negative Logits
ifar
-0.17
Aviv
-0.16
Dah
-0.16
Sir
-0.15
iland
-0.15
oco
-0.15
lag
-0.15
ìķĶ
-0.14
Sir
-0.14
lehem
-0.14
POSITIVE LOGITS
Asian
0.30
Asian
0.26
Oriental
0.26
Asians
0.25
orient
0.23
asian
0.22
Asi
0.22
Orient
0.22
orient
0.21
Chin
0.21
Activations Density 0.129%