INDEX
    Explanations

    text related to the messaging platform "WhatsApp"

    variations of the word "Whats" or "what's"

    New Auto-Interp
    Negative Logits
    ³³³
    -0.77
     differential
    -0.75
    CVE
    -0.66
    eers
    -0.65
    eering
    -0.63
     Roh
    -0.63
     Franch
    -0.62
     compens
    -0.61
     demolition
    -0.60
     Crus
    -0.59
    POSITIVE LOGITS
    app
    1.11
    ocial
    1.05
    bour
    1.00
    App
    0.99
    omething
    0.96
    iques
    0.92
    creen
    0.89
    ername
    0.86
    alon
    0.84
    peed
    0.82
    Act Density 0.024%

    No Known Activations