INDEX
    Explanations

    the presence of the word "in"

    New Auto-Interp
    Negative Logits
     plurality
    -0.78
     stumble
    -0.74
    ropolitan
    -0.74
    Ħ¢
    -0.72
     flaw
    -0.69
    astery
    -0.69
     aber
    -0.69
    SPONSORED
    -0.68
    eele
    -0.68
     laps
    -0.68
    POSITIVE LOGITS
    pet
    0.81
    beard
    0.80
    vous
    0.80
    ayers
    0.77
    tips
    0.71
    anger
    0.69
    iki
    0.69
    giving
    0.69
    mast
    0.68
    venge
    0.67
    Act Density 0.000%

    No Known Activations