INDEX
Explanations
references to the name "Lawrence."
mentions of the name "Lawrence"
New Auto-Interp
Negative Logits
ramid
-0.89
raid
-0.88
resso
-0.88
izoph
-0.87
hedral
-0.83
joined
-0.80
cffff
-0.80
tnc
-0.78
arers
-0.75
requ
-0.74
POSITIVE LOGITS
Lawrence
0.94
Liver
0.94
ville
0.91
Berkeley
0.87
Hague
0.79
Foster
0.79
burg
0.78
ç·
0.78
Wel
0.78
Kra
0.77
Activations Density 0.022%