INDEX
Explanations
phrases related to social interactions and relationships
New Auto-Interp
Negative Logits
hausen
-0.17
Ø´ÙĪ
-0.17
agos
-0.16
udic
-0.15
ousel
-0.15
ÙĬÙĩ
-0.15
jes
-0.15
udo
-0.14
ptom
-0.14
802
-0.14
POSITIVE LOGITS
isphere
0.16
unb
0.16
Kız
0.16
mol
0.15
themselves
0.14
selectors
0.14
ahl
0.14
ilo
0.14
oa
0.14
Rac
0.14
Activations Density 1.609%