INDEX
Explanations
specific mentions of airlines in text
references to airlines
New Auto-Interp
Negative Logits
Hill
-0.75
imens
-0.71
uces
-0.71
itia
-0.68
laus
-0.68
ector
-0.66
VK
-0.65
ests
-0.64
Patriarch
-0.64
utenberg
-0.63
POSITIVE LOGITS
airline
1.06
Airlines
1.05
airlines
0.97
boarding
0.88
pigeon
0.87
hangar
0.85
carrier
0.85
carriers
0.83
passenger
0.82
flights
0.81
Activations Density 0.009%