INDEX
Explanations
references to accidents and disasters, specifically involving injuries or fatalities
New Auto-Interp
Negative Logits
diego
-0.45
érrez
-0.42
новништво
-0.40
항
-0.40
isielt
-0.40
teraz
-0.40
зна
-0.40
stånd
-0.39
ilimit
-0.39
understanding
-0.38
POSITIVE LOGITS
accidents
0.92
acidente
0.91
accident
0.90
MemoryWarning
0.89
RectangleBorder
0.86
injuries
0.86
incident
0.84
accident
0.83
incidents
0.82
Injuries
0.82
Activations Density 0.395%