INDEX
    Explanations

    instances of the word "another" in various contexts

    New Auto-Interp
    Negative Logits
     autres
    -0.86
     lainnya
    -0.78
     demais
    -0.77
     تضيفلها
    -0.71
     others
    -0.70
    other
    -0.70
    others
    -0.69
     demás
    -0.66
     restantes
    -0.66
     însă
    -0.65
    POSITIVE LOGITS
     layer
    0.81
     similar
    0.79
    worldly
    0.79
     couple
    0.76
     equally
    0.75
     pair
    0.73
     reason
    0.71
     dimension
    0.71
     thing
    0.69
    !("{}",
    0.69
    Act Density 0.097%

    No Known Activations