INDEX
    Explanations

    negative expressions of inability or prohibition

    New Auto-Interp
    Negative Logits
    izon
    -0.15
    ¼åIJĪ
    -0.15
    inding
    -0.14
    oque
    -0.14
    IMS
    -0.14
    gree
    -0.13
    δÏħ
    -0.13
     clearing
    -0.13
    sm
    -0.13
    Matches
    -0.13
    POSITIVE LOGITS
     unm
    0.16
     freopen
    0.15
    distributed
    0.15
    orio
    0.15
    Cls
    0.14
     embroid
    0.14
    icha
    0.14
    ROTO
    0.14
    VERRIDE
    0.14
    yonel
    0.14
    Act Density 0.036%

    No Known Activations