INDEX
    Explanations

    references to airplanes or aircraft

    New Auto-Interp
    Negative Logits
    ãģį
    -0.70
    女
    -0.68
    UGC
    -0.68
    FINE
    -0.66
    å¦
    -0.66
    Interstitial
    -0.65
    ãĤ±
    -0.65
     Bened
    -0.65
     Tablet
    -0.64
     CoC
    -0.64
    POSITIVE LOGITS
    liner
    1.50
    liners
    1.29
    ting
    1.03
     airliner
    1.02
    fare
    0.99
     jets
    0.95
    planes
    0.95
     flown
    0.94
    ted
    0.92
    pack
    0.92
    Act Density 0.017%

    No Known Activations