INDEX
Explanations
themes related to complex emotional relationships and interpersonal conflicts
New Auto-Interp
Negative Logits
olean
-0.19
ocab
-0.18
PERT
-0.15
idla
-0.15
ystate
-0.15
ritis
-0.15
-cols
-0.15
edback
-0.15
ieres
-0.15
achelor
-0.14
POSITIVE LOGITS
crush
0.18
attraction
0.17
plat
0.15
ix
0.15
IX
0.15
Grat
0.15
sexual
0.15
/lang
0.15
warmer
0.14
conquest
0.14
Activations Density 0.249%