INDEX
    Explanations

    instances of the name "Harjit" with varying activation values

    the name "Harjit" and possibly references to "jam."

    New Auto-Interp
    Negative Logits
    bred
    -0.93
    alogue
    -0.78
    tarian
    -0.77
     Beckham
    -0.73
     Undead
    -0.73
     bearer
    -0.70
    vation
    -0.69
     Ultr
    -0.69
     Duchess
    -0.66
    ansk
    -0.66
    POSITIVE LOGITS
    jit
    4.08
    jam
    0.82
    frames
    0.77
    jas
    0.74
    Asset
    0.72
     RAND
    0.71
     Chandra
    0.70
    holes
    0.69
     Jinping
    0.68
    rily
    0.66
    Act Density 0.009%

    No Known Activations