INDEX
Explanations
phrases related to accidents involving derailments or actions meant to hinder or harm something
instances of the word "derail" and its variations, as well as any occurrences of the word "sabotage."
New Auto-Interp
Negative Logits
arden
-0.70
administered
-0.66
rix
-0.65
atos
-0.63
addy
-0.63
recogn
-0.62
Hispanic
-0.62
Ps
-0.61
Palm
-0.60
elman
-0.60
POSITIVE LOGITS
derail
4.08
derailed
2.53
sabotage
1.34
sabot
1.21
torpedo
1.12
rupture
1.08
Amtrak
1.05
topple
1.01
wreck
0.98
brake
0.97
Activations Density 0.014%