INDEX
    Explanations

    language related to dissatisfaction and complaints

    New Auto-Interp
    Negative Logits
     Král
    -0.15
    ëĭ´
    -0.14
    ewis
    -0.14
    ä½µ
    -0.14
     Alman
    -0.13
    wik
    -0.13
     Herm
    -0.13
    anos
    -0.13
     vide
    -0.13
    unden
    -0.13
    POSITIVE LOGITS
     o
    0.18
     na
    0.16
     dem
    0.15
     nau
    0.15
     chai
    0.15
     vex
    0.15
     ga
    0.15
     dia
    0.15
    å·¥ä¸ļ
    0.14
    park
    0.14
    Act Density 0.016%

    No Known Activations