INDEX
    Explanations

    expressions of emotional states or judgments

    New Auto-Interp
    Negative Logits
     Alman
    -0.15
    елÑİ
    -0.14
     Tab
    -0.14
    Uploader
    -0.14
    istr
    -0.13
    uez
    -0.13
    ÑĽ
    -0.13
    ,'#
    -0.13
    mund
    -0.13
    498
    -0.13
    POSITIVE LOGITS
    evin
    0.17
    ulo
    0.17
    adu
    0.15
    elon
    0.15
    éĮ²
    0.15
    ospace
    0.14
     Hart
    0.14
    antine
    0.14
    elter
    0.14
    aton
    0.14
    Act Density 0.002%

    No Known Activations