INDEX
    Explanations

    specific references to technology, communication devices, and security concerns

    New Auto-Interp
    Negative Logits
    Diweddarwch
    -0.70
     zarówno
    -0.69
     nakalista
    -0.63
     zowel
    -0.63
     titolata
    -0.60
     především
    -0.59
     appunto
    -0.59
    fjspx
    -0.59
    期刊论文
    -0.58
     gekomen
    -0.58
    POSITIVE LOGITS
     while
    0.90
     whilst
    0.78
     mientras
    0.75
     WHILE
    0.72
     pretending
    0.71
     sambil
    0.66
     lmao
    0.65
     enquanto
    0.65
     every
    0.64
    mientras
    0.63
    Act Density 0.987%

    No Known Activations