INDEX
Explanations
mentions of the name "Anna" in the text
mentions of the name "Anna."
New Auto-Interp
Negative Logits
enegger
-0.83
ember
-0.76
angler
-0.75
aneously
-0.74
inct
-0.73
oday
-0.71
iating
-0.71
rane
-0.71
awar
-0.69
ierrez
-0.68
POSITIVE LOGITS
Karen
0.96
Maria
0.92
uthor
0.86
ette
0.85
Nicole
0.85
Anna
0.84
Kendrick
0.82
isle
0.81
Elsa
0.79
Louise
0.78
Activations Density 0.020%