INDEX
    Explanations

    words related to resistance and opposition

    New Auto-Interp
    Negative Logits
     Sing
    -0.17
    rana
    -0.17
    EMA
    -0.15
    abh
    -0.15
    stdexcept
    -0.15
    ç¾
    -0.14
     Civil
    -0.14
    γεν
    -0.14
    tsy
    -0.14
    annotations
    -0.14
    POSITIVE LOGITS
    ousand
    0.17
    Ïģιά
    0.16
    ä¸Ŀ
    0.15
    udies
    0.14
    .reg
    0.14
    alu
    0.14
    daughter
    0.14
    chte
    0.13
    ipp
    0.13
    cc
    0.13
    Act Density 0.274%

    No Known Activations