INDEX
Explanations
pronouns that indicate possession or ownership
New Auto-Interp
Negative Logits
invol
-0.06
naÄį
-0.06
kke
-0.06
puter
-0.06
ÑĢÑıд
-0.06
/Instruction
-0.06
avou
-0.06
ONO
-0.06
sembly
-0.06
ÙĪÙħات
-0.06
POSITIVE LOGITS
own
0.11
Own
0.07
own
0.07
iglia
0.07
próp
0.07
OWN
0.07
Own
0.07
ieves
0.07
Advertisement
0.06
CLU
0.06
Activations Density 0.018%