INDEX
Explanations
references to specific aviation incidents and casualties
New Auto-Interp
Negative Logits
dil
-0.16
ayo
-0.16
iculo
-0.15
Ùĥرة
-0.14
NON
-0.14
cars
-0.14
icap
-0.14
anio
-0.14
rippling
-0.14
Cooler
-0.14
POSITIVE LOGITS
crash
0.24
crashed
0.20
Crash
0.20
Flight
0.20
crashes
0.18
olik
0.17
Flight
0.16
autop
0.16
flight
0.15
crashing
0.15
Activations Density 0.028%