INDEX
Explanations
the name "Leonard" with varying levels of activation
the name "Leonard" in various contexts
New Auto-Interp
Negative Logits
skirts
-0.75
tch
-0.73
manship
-0.73
PLA
-0.72
ħĭ
-0.71
exempt
-0.70
illance
-0.69
76561
-0.69
pron
-0.69
sites
-0.67
POSITIVE LOGITS
Leonard
1.04
Lauder
0.88
Cohen
0.88
angelo
0.82
Bernstein
0.78
Hayward
0.78
ergic
0.76
Strauss
0.76
antine
0.76
Bros
0.75
Activations Density 0.008%