INDEX
Explanations
the name "Kathleen"
mentions of specific individuals, particularly those named Kathleen
New Auto-Interp
Negative Logits
eleph
-0.97
ãĥīãĥ©
-0.79
unin
-0.76
standing
-0.74
icult
-0.73
urity
-0.73
direction
-0.73
ardless
-0.71
imov
-0.71
saf
-0.71
POSITIVE LOGITS
Wynne
1.07
Yamato
0.92
Upton
0.90
roid
0.85
Kathleen
0.85
rette
0.84
Wil
0.82
Louise
0.81
Sue
0.81
Marie
0.80
Activations Density 0.009%