INDEX
Explanations
the presence of common Spanish articles and pronouns
New Auto-Interp
Negative Logits
myſelf
-1.75
itſelf
-1.71
―――――
-1.52
photolibrary
-1.52
raiſ
-1.50
pleaſure
-1.50
Majefty
-1.47
Houſe
-1.47
purpoſe
-1.46
iſt
-1.46
POSITIVE LOGITS
the
1.57
The
1.52
The
1.32
1.07
A
1.04
la
1.02
a
1.01
THE
1.01
un
1.01
le
0.94
Activations Density 0.012%