INDEX
Explanations
instances of the name "Harjit" with varying activation values
the name "Harjit" and possibly references to "jam."
New Auto-Interp
Negative Logits
bred
-0.93
alogue
-0.78
tarian
-0.77
Beckham
-0.73
Undead
-0.73
bearer
-0.70
vation
-0.69
Ultr
-0.69
Duchess
-0.66
ansk
-0.66
POSITIVE LOGITS
jit
4.08
jam
0.82
frames
0.77
jas
0.74
Asset
0.72
RAND
0.71
Chandra
0.70
holes
0.69
Jinping
0.68
rily
0.66
Activations Density 0.009%