INDEX
Explanations
mentions of specific names and proper nouns
references to individuals with the name "Humphrey."
New Auto-Interp
Negative Logits
Piercing
-0.67
REDACTED
-0.65
Gaza
-0.65
20439
-0.64
Constructed
-0.63
eq
-0.63
ãĤº
-0.62
flashlight
-0.62
代
-0.61
henko
-0.60
POSITIVE LOGITS
reys
1.54
rey
1.26
Humph
1.21
ministic
0.87
rys
0.85
ries
0.85
awei
0.85
ry
0.84
inx
0.84
umph
0.82
Activations Density 0.005%