INDEX
    Explanations

    textual references to mechanical or industrial processes and descriptions

    New Auto-Interp
    Negative Logits
    om
    -0.17
    aque
    -0.15
    اÛĮز
    -0.15
    acs
    -0.15
     Sharing
    -0.15
    Sharing
    -0.15
    mans
    -0.15
    mani
    -0.15
    cae
    -0.14
    kate
    -0.14
    POSITIVE LOGITS
    ivec
    0.19
    íĭ±
    0.15
    aley
    0.15
    ãģ°ãģĭãĤĬ
    0.14
     ÑĤÑı
    0.14
    ênh
    0.13
       
    0.13
    uild
    0.13
    åħ¨éĥ¨
    0.13
    رÙħ
    0.13
    Act Density 0.152%

    No Known Activations