INDEX
Explanations
terms related to train derailments
occurrences of the word "derail" and its variations, indicating discussions around disruption or accidents
New Auto-Interp
Negative Logits
ternity
-0.72
Kinnikuman
-0.72
anamo
-0.69
Reviewer
-0.67
Reach
-0.67
estate
-0.66
iping
-0.66
umption
-0.64
guyen
-0.63
archs
-0.63
POSITIVE LOGITS
derail
1.61
derailed
1.33
ments
0.85
freight
0.80
Amtrak
0.76
wreck
0.74
wagon
0.74
crew
0.73
ment
0.71
bridge
0.70
Activations Density 0.008%