INDEX
    Explanations

    instances of the word "hit" in various contexts

    New Auto-Interp
    Negative Logits
     Tundra
    -0.69
    🔹
    -0.69
     Dahl
    -0.68
     McCartney
    -0.66
     Jacobsen
    -0.65
    ]_
    -0.65
    esinde
    -0.64
    °)
    -0.63
    ()}
    -0.63
    надцать
    -0.63
    POSITIVE LOGITS
     HIT
    1.56
     Hit
    1.53
     hit
    1.52
    HIT
    1.48
    Hit
    1.46
     hits
    1.46
    hit
    1.45
     Hits
    1.39
     hitting
    1.36
    +#+#
    1.33
    Act Density 0.034%

    No Known Activations