INDEX
Explanations
references to the country 'China'
mentions of "China"
New Auto-Interp
Negative Logits
nor
-0.77
merce
-0.72
Bram
-0.70
detail
-0.69
profit
-0.69
MAT
-0.68
numbered
-0.68
phis
-0.67
ces
-0.67
esters
-0.67
POSITIVE LOGITS
Jinping
1.09
China
1.03
China
1.02
yuan
0.92
PLA
0.86
Yuan
0.86
Hua
0.86
jing
0.86
Xin
0.85
ijing
0.85
Activations Density 0.022%