INDEX
Explanations
names, particularly the name "Bernard"
the names of individuals, particularly those with the first name "Bernard"
New Auto-Interp
Negative Logits
conservancy
-0.83
ngth
-0.77
pter
-0.76
stanbul
-0.73
utra
-0.73
pps
-0.72
TOR
-0.71
perse
-0.71
ndra
-0.70
worldly
-0.69
POSITIVE LOGITS
ians
0.88
Bernard
0.88
iam
0.83
ello
0.83
ique
0.80
ienne
0.80
ella
0.79
ously
0.78
Suarez
0.76
deen
0.76
Activations Density 0.019%