INDEX
Explanations
phrases indicating social interactions and family dynamics
New Auto-Interp
Negative Logits
енÑĤа
-0.15
ermint
-0.14
cov
-0.14
VÅ¡
-0.14
@student
-0.14
cest
-0.14
mort
-0.14
ëį
-0.14
θα
-0.14
à¥Ģà¤Ł
-0.13
POSITIVE LOGITS
spontaneous
0.17
Age
0.17
serious
0.17
rencontrer
0.17
NSA
0.16
age
0.16
seeking
0.16
seeks
0.16
single
0.16
-serif
0.15
Activations Density 0.137%