INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     nassi
    -0.08
     nkar
    -0.08
     nz
    -0.08
     preferably
    -0.08
     issuing
    -0.08
     qualche
    -0.08
    /oder
    -0.08
     ocult
    -0.08
     kacha
    -0.07
     hatta
    -0.07
    POSITIVE LOGITS
    描述
    0.09
     phr
    0.09
    یه
    0.08
    에서는
    0.08
    0.08
    0.08
    体系
    0.08
     wording
    0.07
     Leon
    0.07
     veio
    0.07
    Act Density 0.024%

    No Known Activations