INDEX
    Explanations

    statements regarding authority and agency in communication

    New Auto-Interp
    Negative Logits
    plib
    -0.15
    ubl
    -0.15
    anzi
    -0.15
    åĴ²
    -0.15
    inand
    -0.15
     sund
    -0.14
    alla
    -0.14
    Kernel
    -0.14
    гаÑĢ
    -0.14
    (kernel
    -0.13
    POSITIVE LOGITS
    plex
    0.15
    ann
    0.14
    387
    0.14
    aliz
    0.14
    ÃŃc
    0.14
    Reuse
    0.14
     Keys
    0.14
    ques
    0.13
    722
    0.13
     gest
    0.13
    Act Density 0.220%

    No Known Activations