INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    lasses
    -0.88
    ectar
    -0.70
     Tune
    -0.65
    çīĪ
    -0.63
    orf
    -0.61
    quickShipAvailable
    -0.61
     Haram
    -0.61
    antle
    -0.60
    bane
    -0.60
    obyl
    -0.59
    POSITIVE LOGITS
     petitions
    0.99
     petition
    0.98
    ers
    0.91
    naires
    0.88
     filed
    0.85
    ing
    0.84
     Petition
    0.82
    aires
    0.82
    ingham
    0.81
     signatures
    0.79
    Act Density 0.010%

    No Known Activations