INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Método
    -0.07
    handle
    -0.06
    ілі
    -0.06
     Metodo
    -0.06
     Doctrine
    -0.06
     olmadı
    -0.06
    -0.06
    -0.06
    fix
    -0.06
     TIFF
    -0.06
    POSITIVE LOGITS
     --------------------------------
    0.07
     Γ
    0.07
    、「
    0.07
     bow
    0.06
     doz
    0.06
     ohled
    0.06
     inté
    0.06
     Tanz
    0.06
     disag
    0.06
    0.06
    Act Density 0.011%

    No Known Activations