INDEX
Explanations
mentions of the name "Michele" and similar proper nouns
New Auto-Interp
Negative Logits
hip
-0.78
hips
-0.78
holes
-0.77
erker
-0.75
emouth
-0.72
\\\\\\\\
-0.71
ĸļ
-0.71
olver
-0.70
ibaba
-0.69
raint
-0.69
POSITIVE LOGITS
Bach
1.23
Michele
1.00
Pelosi
0.94
Tome
0.84
Fey
0.83
Schmidt
0.79
kson
0.79
Crist
0.77
tti
0.77
tta
0.77
Activations Density 0.004%