INDEX
Explanations
proper nouns
instances of the name "Graham."
New Auto-Interp
Negative Logits
puter
-0.94
merce
-0.78
Rated
-0.78
selves
-0.71
PLA
-0.71
psychiat
-0.69
worldly
-0.69
senal
-0.69
ngth
-0.68
LOAD
-0.67
POSITIVE LOGITS
Graham
0.91
Hancock
0.85
Graham
0.82
esse
0.77
essen
0.76
Greene
0.76
etti
0.73
etz
0.72
abba
0.70
maxwell
0.70
Activations Density 0.015%