INDEX
Explanations
expressions of ownership and individuality
New Auto-Interp
Negative Logits
er
-0.72
faptul
-0.71
novità
-0.70
culoare
-0.67
navideño
-0.60
tuturor
-0.58
året
-0.58
ically
-0.58
involucrados
-0.58
nedeniyle
-0.58
POSITIVE LOGITS
own
1.56
Own
1.34
OWN
1.33
Own
1.27
own
1.14
sendiri
1.06
eigenen
1.02
соб
0.97
OWN
0.95
personal
0.91
Activations Density 0.064%