INDEX
Explanations
references to airplanes or aircraft
New Auto-Interp
Negative Logits
ãģį
-0.70
女
-0.68
UGC
-0.68
FINE
-0.66
å¦
-0.66
Interstitial
-0.65
ãĤ±
-0.65
Bened
-0.65
Tablet
-0.64
CoC
-0.64
POSITIVE LOGITS
liner
1.50
liners
1.29
ting
1.03
airliner
1.02
fare
0.99
jets
0.95
planes
0.95
flown
0.94
ted
0.92
pack
0.92
Activations Density 0.017%