INDEX
Explanations
countries or locations
mentions of countries
New Auto-Interp
Negative Logits
erb
-0.70
apple
-0.69
essen
-0.69
upper
-0.68
lihood
-0.67
Sym
-0.67
////
-0.65
ãģŁ
-0.64
Ö¼
-0.63
hooting
-0.63
POSITIVE LOGITS
invaded
0.86
Airlines
0.79
's
0.79
descended
0.76
annexed
0.70
governed
0.70
Arabia
0.69
legisl
0.69
anasia
0.69
anism
0.68
Activations Density 0.179%