INDEX
Explanations
instances or mentions of danger, near misses, and mortality
near death or destruction
New Auto-Interp
Negative Logits
gekomen
-0.34
myö
-0.31
definitely
-0.31
sicuramente
-0.30
torebka
-0.29
niemals
-0.28
giras
-0.28
firmado
-0.27
definitively
-0.27
pewno
-0.27
POSITIVE LOGITS
awtextra
0.77
تضيفلها
0.67
autorytatywna
0.66
setVerticalGroup
0.64
يميديا
0.60
ModelAdmin
0.59
beginnetje
0.58
Almost
0.57
Personensuche
0.57
almost
0.57
Activations Density 0.155%