INDEX
Explanations
mentions of the country "China"
mentions of the country China
New Auto-Interp
Negative Logits
Bl
-0.83
Sel
-0.81
D
-0.80
St
-0.79
Al
-0.78
New
-0.77
First
-0.77
the
-0.77
B
-0.77
Red
-0.76
POSITIVE LOGITS
India
2.37
Canada
2.34
China
2.22
Australia
2.09
Pakistan
2.01
Japan
2.00
Italy
1.99
Germany
1.96
Ireland
1.95
France
1.92
Activations Density 0.051%