INDEX
Explanations
phrases related to personal ownership and customization
New Auto-Interp
Negative Logits
dle
-0.16
ryn
-0.15
ingly
-0.14
endo
-0.14
dad
-0.14
ailable
-0.14
βά
-0.14
225
-0.13
enas
-0.13
/react
-0.13
POSITIVE LOGITS
own
0.32
Own
0.21
Own
0.21
respective
0.20
próp
0.19
eigenen
0.18
à¹Ģà¸Ńà¸ĩ
0.17
OWN
0.17
propia
0.16
own
0.16
Activations Density 0.026%