INDEX
Explanations
mentions of airplane crashes
mentions of "plane" in various contexts
New Auto-Interp
Negative Logits
laus
-0.92
FINE
-0.78
PUT
-0.76
essee
-0.72
Cumber
-0.71
shire
-0.70
UGC
-0.68
uces
-0.67
lishes
-0.67
optional
-0.66
POSITIVE LOGITS
walk
1.09
airliner
1.03
walker
0.98
prope
0.98
plane
0.97
airplanes
0.97
hangar
0.96
wreckage
0.94
plane
0.93
planes
0.93
Activations Density 0.020%