INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -1.63
    3
    -1.52
     sayap
    -1.52
     aksesuar
    -1.49
    -1.48
     niektórych
    -1.42
     Houſe
    -1.41
     hırka
    -1.41
     rengi
    -1.40
     konsek
    -1.38
    POSITIVE LOGITS
     to
    2.86
     It
    1.79
     The
    1.62
     There
    1.47
    1.40
    Skocz
    1.34
    Koordinaten
    1.28
    и
    1.28
     $
    1.27
     

    1.26
    Act Density 0.096%

    No Known Activations