INDEX
    Explanations

    social media platforms and locations

    references to social media or online platforms

    New Auto-Interp
    Negative Logits
     ACTIONS
    -0.66
    fully
    -0.64
    VALUE
    -0.63
    ively
    -0.63
    league
    -0.62
    FE
    -0.62
     shelters
    -0.61
     Colleges
    -0.61
     colleges
    -0.60
    RO
    -0.59
    POSITIVE LOGITS
    edin
    1.12
    agram
    0.89
    ucer
    0.83
    osaurs
    0.81
    emed
    0.80
    til
    0.78
     "$:/
    0.77
    ogl
    0.76
    ssl
    0.74
    amus
    0.73
    Act Density 0.014%

    No Known Activations