INDEX
Explanations
mentions of arrests and legal actions
New Auto-Interp
Negative Logits
uxxxx
-0.70
oportunidades
-0.55
löytyy
-0.54
dinosaurio
-0.54
Italij
-0.53
perspectiva
-0.53
Gästen
-0.52
blessés
-0.51
Socialista
-0.50
zahrani
-0.50
POSITIVE LOGITS
plat
0.88
Plat
0.76
propOrder
0.67
Plat
0.66
Proto
0.66
Proto
0.65
mand
0.61
PLAT
0.59
Pt
0.59
proto
0.58
Activations Density 0.171%