INDEX
Explanations
situations involving domestic life and personal relationships
New Auto-Interp
Negative Logits
lamaz
-0.15
andler
-0.14
Tavern
-0.14
styled
-0.14
anes
-0.14
kup
-0.13
Local
-0.13
å§Ķ
-0.13
lush
-0.13
lez
-0.13
POSITIVE LOGITS
/shared
0.18
living
0.17
shared
0.16
shared
0.16
_SHARED
0.15
åħ±
0.15
Shared
0.15
aurant
0.15
issen
0.15
åĢ«
0.14
Activations Density 0.215%