INDEX
    Explanations

    concepts related to societal issues and challenges

    New Auto-Interp
    Negative Logits
    ildo
    -0.15
    uhan
    -0.14
    ifax
    -0.14
    ÃŃcia
    -0.13
    erge
    -0.13
    agal
    -0.13
    666
    -0.13
    ÙĨدÙĤ
    -0.13
    uler
    -0.13
     Leafs
    -0.13
    POSITIVE LOGITS
     lately
    0.37
     since
    0.30
     recently
    0.25
     recent
    0.24
    since
    0.24
    以æĿ¥
    0.22
    recent
    0.22
    ince
    0.21
     Since
    0.20
    Recently
    0.19
    Act Density 0.867%

    No Known Activations