INDEX
Explanations
references to Chinese and other Asian nationalities or ethnicities
nationalities or origins
nationalities and their associated terms
New Auto-Interp
Negative Logits
گردد
-0.43
噜
-0.41
tunik
-0.41
Gelände
-0.41
ddots
-0.41
کیلو
-0.41
höhe
-0.40
škola
-0.40
škole
-0.40
joy
-0.39
POSITIVE LOGITS
Chinese
1.21
Japanese
1.20
Chinese
1.20
Russian
1.20
Russian
1.20
Japanese
1.18
Mexican
1.15
Italian
1.13
Indian
1.13
German
1.12
Activations Density 0.290%