INDEX
Explanations
relationships and personal connections, particularly in the context of romantic partnerships and infidelity
New Auto-Interp
Negative Logits
xin
-0.18
marriages
-0.17
Blond
-0.15
granddaughter
-0.15
ζε
-0.15
widow
-0.14
bat
-0.14
pread
-0.14
Äı
-0.14
daughter
-0.14
POSITIVE LOGITS
significant
0.44
Significant
0.40
partner
0.37
boyfriend
0.35
significant
0.35
BF
0.34
beau
0.30
bf
0.29
lover
0.29
param
0.29
Activations Density 0.213%