INDEX
Explanations
mentions of accidents, disasters, or emergencies
New Auto-Interp
Negative Logits
icrobial
-0.76
ROR
-0.73
itia
-0.71
rolog
-0.69
addon
-0.69
Ľ
-0.67
eele
-0.65
arov
-0.65
achu
-0.65
ipedia
-0.64
POSITIVE LOGITS
worthiness
1.13
Dive
0.93
crashes
0.90
crash
0.87
course
0.81
wreck
0.80
dumps
0.78
Course
0.76
Crash
0.74
Crash
0.73
Activations Density 0.035%