INDEX
Explanations
mentions of political leaders and officials, particularly in the context of global events
New Auto-Interp
Negative Logits
vale
-0.17
miss
-0.17
Boss
-0.14
akit
-0.14
Opera
-0.14
utom
-0.14
ubat
-0.13
cab
-0.13
OMEM
-0.13
Situation
-0.13
POSITIVE LOGITS
Joi
0.16
ween
0.15
Nets
0.14
zas
0.14
Merk
0.14
srand
0.13
Ñģоз
0.13
iker
0.13
FOREIGN
0.13
chner
0.13
Activations Density 0.041%