INDEX
Explanations
words related to a specific proper name, "Lawrence."
instances of the name "Lawrence."
New Auto-Interp
Negative Logits
hedral
-0.92
ramid
-0.88
graded
-0.87
resso
-0.82
cffff
-0.81
raid
-0.79
izoph
-0.77
minist
-0.77
effic
-0.73
arers
-0.72
POSITIVE LOGITS
Liver
0.96
ville
0.87
Lawrence
0.85
Hague
0.84
ç·
0.83
burg
0.83
Kra
0.82
Berkeley
0.77
rence
0.77
ocity
0.77
Activations Density 0.038%