INDEX
    Explanations

    expressions of kindness and community involvement

    New Auto-Interp
    Negative Logits
    anga
    -0.16
    aris
    -0.15
    abic
    -0.14
    LEAN
    -0.14
    semb
    -0.14
    pec
    -0.14
    UBY
    -0.14
     Bates
    -0.14
    ès
    -0.14
    rst
    -0.14
    POSITIVE LOGITS
     patron
    0.16
     pac
    0.15
     opt
    0.15
    cheon
    0.15
     troop
    0.15
    jug
    0.15
     badly
    0.15
    Rendering
    0.15
     allot
    0.15
    kiye
    0.14
    Act Density 0.056%

    No Known Activations