INDEX
Explanations
mentions of the name "Harold"
the name "Harold" in various contexts
New Auto-Interp
Negative Logits
ongs
-0.90
issan
-0.89
insula
-0.89
eanor
-0.88
illions
-0.83
igger
-0.83
ean
-0.83
psey
-0.82
rison
-0.82
ocrats
-0.81
POSITIVE LOGITS
Kut
0.78
Vaj
0.74
Wi
0.71
Harold
0.70
Lank
0.69
UID
0.69
Melvin
0.68
Wend
0.68
Bolt
0.67
fur
0.67
Activations Density 0.020%