INDEX
Explanations
references to Chinese and Asian ethnicities or cultural elements
New Auto-Interp
Negative Logits
AntiForgeryToken
-0.38
utum
-0.35
Bof
-0.34
garantizar
-0.34
Лит
-0.33
Ife
-0.33
numerusform
-0.32
erhalten
-0.32
encar
-0.31
ENI
-0.31
POSITIVE LOGITS
Chinese
1.09
Chinese
1.03
chinese
1.02
chinois
0.95
CHINESE
0.90
Chine
0.87
China
0.84
chines
0.83
китай
0.81
China
0.81
Activations Density 1.568%