INDEX
Explanations
references to China and its related geographical, political, and cultural entities
New Auto-Interp
Negative Logits
PMailer
-0.91
WAUKEE
-0.77
IComponent
-0.73
Datuak
-0.68
bVar
-0.68
swick
-0.66
tayl
-0.66
Walkover
-0.66
שוליים
-0.65
SuccessListener
-0.65
POSITIVE LOGITS
China
1.29
China
1.14
CHINA
1.10
CHINA
1.06
Chinese
0.99
china
0.97
Chinese
0.83
中国
0.81
Cina
0.80
chinois
0.80
Activations Density 0.082%