INDEX
Explanations
references to China and its related terms
New Auto-Interp
Negative Logits
WAUKEE
-0.91
өз
-0.75
</h6>
-0.71
swick
-0.69
.\\
-0.69
ostavi
-0.69
.$$
-0.67
Moist
-0.66
שוליים
-0.65
ssohn
-0.64
POSITIVE LOGITS
China
1.82
China
1.61
CHINA
1.48
china
1.30
CHINA
1.28
Chinese
1.17
Chine
1.09
china
1.04
จีน
1.00
中国
0.99
Activations Density 0.055%