INDEX
    Explanations

    mentions of the word "both."

    New Auto-Interp
    Negative Logits
     Ver
    -0.54
     favorite
    -0.53
    Ver
    -0.51
     ver
    -0.50
     mer
    -0.49
    seb
    -0.49
    TextAppearance
    -0.48
     wanna
    -0.48
     pas
    -0.47
     sos
    -0.47
    POSITIVE LOGITS
     both
    1.69
     både
    1.50
    both
    1.50
     zowel
    1.46
    Both
    1.38
     Both
    1.31
     sowohl
    1.27
     zarówno
    1.20
    tanto
    1.18
     BOTH
    1.17
    Act Density 0.146%

    No Known Activations