INDEX
    Explanations

    occurrences of the word "every"

    New Auto-Interp
    Negative Logits
    //
    -0.73
    Bauer
    -0.68
    zt
    -0.66
    Cone
    -0.61
    -0.59
    ässä
    -0.59
    йом
    -0.58
    Monique
    -0.57
     Magdalene
    -0.57
     Bauer
    -0.56
    POSITIVE LOGITS
    every
    1.81
     EVERY
    1.77
     every
    1.76
    EVERY
    1.71
     Every
    1.68
    Every
    1.65
     Ogni
    1.32
     Jedes
    1.24
     everytime
    1.16
     Elke
    1.16
    Act Density 0.063%

    No Known Activations