INDEX
    Explanations

    references to newsletters

    references to newsletters

    New Auto-Interp
    Negative Logits
    eur
    -0.83
    zh
    -0.74
    urally
    -0.73
     Gou
    -0.69
    gro
    -0.68
     Seah
    -0.66
    weak
    -0.65
    venge
    -0.65
    naire
    -0.64
    roo
    -0.64
    POSITIVE LOGITS
    insula
    0.88
    advertising
    0.84
    agascar
    0.79
     Shutterstock
    0.78
     newsletters
    0.72
    vine
    0.70
    isode
    0.69
     Flavoring
    0.68
    ervative
    0.68
     Consent
    0.66
    Act Density 0.015%

    No Known Activations