INDEX
    Explanations

    occurrences of the word "both."

    New Auto-Interp
    Negative Logits
     ſta
    -0.65
     ſte
    -0.63
     ſur
    -0.58
     ſche
    -0.51
     viſ
    -0.50
     DLA
    -0.50
     ſch
    -0.49
     houſe
    -0.48
     tranſ
    -0.48
     Reſ
    -0.48
    POSITIVE LOGITS
     both
    2.17
    both
    1.99
    Both
    1.98
     Both
    1.97
     BOTH
    1.70
    BOTH
    1.61
     begge
    1.54
     både
    1.52
     Beide
    1.52
     ambos
    1.45
    Act Density 0.295%

    No Known Activations