INDEX
    Explanations

    references to charitable assistance and support for individuals in need

    New Auto-Interp
    Negative Logits
    ãĥIJãĥ¼
    -0.17
    spacer
    -0.16
    sov
    -0.15
    aterno
    -0.14
    олж
    -0.14
    AGO
    -0.14
    ê¶Į
    -0.14
    ALES
    -0.14
    виÑĤ
    -0.13
    adesh
    -0.13
    POSITIVE LOGITS
    tabs
    0.15
     tabs
    0.15
    -gnu
    0.15
     tab
    0.14
     rap
    0.14
    lesi
    0.13
    osex
    0.13
    zc
    0.13
     ign
    0.13
    º«
    0.13
    Act Density 0.018%

    No Known Activations