INDEX
Explanations
mentions of ownership or the concept of being an owner in various contexts
New Auto-Interp
Negative Logits
ute
-0.20
ula
-0.19
ulla
-0.16
ero
-0.16
ëĭ¤
-0.15
uten
-0.15
dozen
-0.15
Ã¥n
-0.15
oning
-0.15
iones
-0.14
POSITIVE LOGITS
/operator
0.31
/operators
0.30
-operator
0.29
/man
0.24
-manager
0.17
trÃŃ
0.17
/manage
0.17
/admin
0.16
-fashioned
0.16
lier
0.16
Activations Density 0.044%