INDEX
Explanations
possessive pronouns and their corresponding contexts
New Auto-Interp
Negative Logits
incy
-0.16
endum
-0.15
ufe
-0.15
urovision
-0.15
ansom
-0.15
apel
-0.15
imens
-0.14
uyen
-0.14
aille
-0.14
edList
-0.14
POSITIVE LOGITS
own
0.24
first
0.20
dream
0.20
desired
0.19
desired
0.19
esy
0.18
own
0.17
dream
0.17
Own
0.15
OWN
0.15
Activations Density 0.156%