INDEX
Explanations
proper nouns and hypothetical situations starting with "Had."
instances of the word "Had" in various contexts
New Auto-Interp
Negative Logits
FTWARE
-0.81
outp
-0.69
oshenko
-0.67
scrimmage
-0.64
pept
-0.62
glove
-0.61
honoring
-0.59
tomat
-0.58
hars
-0.57
extingu
-0.56
POSITIVE LOGITS
iths
0.93
hers
0.92
rien
0.91
ibur
0.91
ith
0.88
rons
0.85
vard
0.85
rontal
0.84
bro
0.82
luck
0.82
Activations Density 0.074%