INDEX
Explanations
people's names
instances of the name "Maria" in various contexts
New Auto-Interp
Negative Logits
ancies
-0.91
kefeller
-0.90
hire
-0.82
etition
-0.81
ridges
-0.78
¥ŀ
-0.76
unal
-0.76
recy
-0.75
onomy
-0.75
uably
-0.73
POSITIVE LOGITS
Maria
1.19
Teresa
1.02
Theresa
1.01
Elena
0.92
Maria
0.91
Crist
0.91
Isabel
0.87
ppa
0.86
Lucia
0.85
Clara
0.84
Activations Density 0.026%