INDEX
    Explanations

    email-related phrases, specifically mentions about updates and offers

    New Auto-Interp
    Negative Logits
     Galile
    -0.68
     Rim
    -0.58
    eur
    -0.56
     idol
    -0.54
    Arg
    -0.52
    pedia
    -0.52
     Goose
    -0.52
    ries
    -0.51
     ammonia
    -0.51
    agate
    -0.51
    POSITIVE LOGITS
    ilver
    0.70
    weet
    0.68
    hooting
    0.66
    iberal
    0.65
    ettings
    0.65
    âĺħ
    0.64
    ooters
    0.64
    ilty
    0.63
    daily
    0.63
    scl
    0.63
    Act Density 0.021%

    No Known Activations