INDEX
    Explanations

    the word "had" in various contexts, indicating a focus on past experiences or completed actions

    New Auto-Interp
    Negative Logits
    woordig
    -0.79
    hlon
    -0.65
     knex
    -0.61
    ಿದೆ
    -0.61
    ocate
    -0.60
     perſon
    -0.60
     Olsson
    -0.60
    blume
    -0.60
    いません
    -0.59
    blumen
    -0.58
    POSITIVE LOGITS
     had
    3.57
    Had
    3.03
    had
    2.93
     Had
    2.91
     HAD
    2.66
    HAD
    2.11
     hadden
    2.06
     hadde
    1.94
     hatte
    1.93
     hatten
    1.93
    Act Density 0.091%

    No Known Activations