INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     consistency
    -0.09
     została
    -0.08
    -0.08
     incons
    -0.07
     pitanje
    -0.07
    Consistency
    -0.07
    ascimento
    -0.07
     dyd
    -0.07
     Atelier
    -0.07
     fw
    -0.07
    POSITIVE LOGITS
     כדאי
    0.09
    —not
    0.08
     Situationen
    0.08
     vraiment
    0.08
     жағдай
    0.08
     שבה
    0.08
     circ
    0.08
     فيه
    0.08
     spécialisés
    0.08
    уға
    0.08
    Act Density 0.012%

    No Known Activations