INDEX
Explanations
proper nouns, specifically the name "Bernard"
mentions of the name "Bernard"
New Auto-Interp
Negative Logits
stanbul
-0.76
ngth
-0.76
worldly
-0.75
utra
-0.73
perse
-0.72
NetMessage
-0.71
pter
-0.70
licted
-0.70
conservancy
-0.69
dfx
-0.69
POSITIVE LOGITS
Bernard
0.94
ians
0.89
ienne
0.78
ique
0.77
ello
0.75
Suarez
0.73
ian
0.73
ardo
0.73
Tasman
0.72
iam
0.72
Activations Density 0.014%