INDEX
Explanations
mentions of specific countries
mentions of specific countries or geographical locations
New Auto-Interp
Negative Logits
ombo
-0.79
opt
-0.77
erc
-0.70
izza
-0.70
orno
-0.68
iott
-0.67
orsche
-0.64
ers
-0.63
obsessive
-0.63
ishi
-0.63
POSITIVE LOGITS
IVERS
0.87
Acknowled
0.80
ãĥĨ
0.78
åĤ
0.75
åij
0.74
ford
0.73
å¦
0.72
ãĥģ
0.71
д
0.69
Awakens
0.69
Activations Density 0.028%