INDEX
    Explanations

    phrases indicating examples or instances

    New Auto-Interp
    Negative Logits
     demás
    -0.49
     übrigen
    -0.44
     other
    -0.40
     demais
    -0.39
     otros
    -0.39
     altri
    -0.39
     outra
    -0.39
     lainnya
    -0.38
     otras
    -0.36
     beiden
    -0.36
    POSITIVE LOGITS
     those
    1.05
     الرياضيه
    0.99
    those
    0.89
     ceux
    0.85
     celles
    0.84
     كومونز
    0.84
     namely
    0.83
     ones
    0.81
     الحره
    0.81
     quelli
    0.80
    Act Density 0.703%

    No Known Activations