INDEX
    Explanations

    phrases and terms indicating repetition or emphasis on new ideas or information

    New Auto-Interp
    Negative Logits
     autres
    -0.99
     demais
    -0.90
     demás
    -0.83
    other
    -0.81
     other
    -0.80
    others
    -0.78
     others
    -0.78
     lainnya
    -0.71
     còn
    -0.70
    autres
    -0.70
    POSITIVE LOGITS
     couple
    0.74
     dozen
    0.73
     }}"></
    0.70
     layer
    0.70
     important
    0.70
     huge
    0.69
     interesting
    0.68
     paio
    0.66
    worldly
    0.66
     few
    0.65
    Act Density 0.114%

    No Known Activations