INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     the
    -1.96
     also
    -1.88
     a
    -1.80
     نیز
    -1.75
    ,’
    -1.59
     木製
    -1.58
     cristian
    -1.58
    場は
    -1.52
    !’
    -1.50
    2
    -1.49
    POSITIVE LOGITS
     asegurarse
    1.67
    .
    1.63
     that
    1.63
     particularmente
    1.59
    参见
    1.57
    ?!”
    1.56
    {}
    
    1.55
     mantenerse
    1.54
    1.51
     škole
    1.49
    Act Density 0.002%

    No Known Activations