INDEX
Explanations
instances of the name "Leonard."
New Auto-Interp
Negative Logits
skirts
-0.84
glers
-0.80
ħĭ
-0.77
PLA
-0.73
worldly
-0.68
manship
-0.67
selves
-0.67
yrim
-0.66
*/(
-0.65
pron
-0.62
POSITIVE LOGITS
angelo
0.97
Cohen
0.91
Leonard
0.91
Lauder
0.88
ardo
0.85
ency
0.83
Bernstein
0.82
arious
0.81
Nim
0.80
zman
0.80
Activations Density 0.002%