INDEX
    Explanations

    names and titles

    New Auto-Interp
    Negative Logits
     direção
    -0.09
     פעולה
    -0.08
     applications
    -0.08
     sollte
    -0.08
    -0.08
    -0.08
     instituted
    -0.08
    -0.07
     समारोह
    -0.07
    -0.07
    POSITIVE LOGITS
     Cali
    0.09
    ILD
    0.08
     jade
    0.08
    oboye
    0.08
     Jum
    0.08
    0.07
    0.07
    jh
    0.07
     android
    0.07
     ถูก
    0.07
    Act Density 0.016%

    No Known Activations