INDEX
Explanations
references to the word "my" or variations of it within the text
New Auto-Interp
Negative Logits
itſelf
-1.07
bbene
-0.97
raiſ
-0.94
InjectAttribute
-0.90
abstractmethod
-0.89
Monfieur
-0.89
vectorielles
-0.87
vectorielle
-0.86
Houſe
-0.84
Efq
-0.84
POSITIVE LOGITS
My
1.34
own
1.26
MY
1.18
My
1.13
my
1.10
my
1.02
HIS
1.02
MY
1.00
getMy
0.98
Her
0.93
Activations Density 0.079%