INDEX
Explanations
nationalist rhetoric policy changes
New Auto-Interp
Negative Logits
DidEnter
0.50
确定
0.47
东
0.46
饲
0.46
焦虑
0.45
anxieties
0.44
cít
0.43
तिथि
0.43
quotidien
0.42
ාවිත
0.42
POSITIVE LOGITS
patriots
0.50
U
0.47
хороший
0.46
ಹ್
0.46
엄청
0.45
totally
0.45
patriotic
0.45
хорошие
0.44
terrific
0.44
Rusia
0.43
Activations Density 0.007%