INDEX
Explanations
themes related to dominance and submission dynamics
New Auto-Interp
Negative Logits
Alam
-0.16
asso
-0.16
udd
-0.15
wald
-0.15
erli
-0.15
acho
-0.15
udder
-0.14
oop
-0.14
avorites
-0.14
awner
-0.14
POSITIVE LOGITS
inst
0.15
inel
0.15
hip
0.15
segue
0.14
ucci
0.14
ily
0.14
ç´į
0.14
escal
0.14
rys
0.14
ERY
0.14
Activations Density 0.209%