INDEX
Explanations
instances of the word "my."
"my" followed by nouns
New Auto-Interp
Negative Logits
itſelf
-0.78
ſelves
-0.67
faſt
-0.60
ſtand
-0.59
ſtate
-0.59
ſever
-0.59
leſs
-0.57
ſtand
-0.57
ſtance
-0.56
houſe
-0.55
POSITIVE LOGITS
my
1.34
my
1.11
My
1.09
My
1.06
MY
0.99
minha
0.85
MY
0.84
mijn
0.84
getMy
0.83
Mijn
0.79
Activations Density 0.064%