INDEX
Explanations
references to destruction or harm
New Auto-Interp
Negative Logits
solid
-0.33
])]
-0.30
restrictive
-0.29
Süß
-0.29
ilados
-0.29
Ramb
-0.28
ijn
-0.28
itemType
-0.28
metab
-0.28
fris
-0.28
POSITIVE LOGITS
astéro
0.71
Injured
0.66
Injury
0.64
injured
0.64
betweenstory
0.63
Injury
0.63
kaarangay
0.62
ModelExpression
0.61
earthquake
0.60
Survivors
0.60
Activations Density 0.643%