INDEX
Explanations
references to the title or name "Lady" and variations of it
New Auto-Interp
Negative Logits
ander
-0.17
ters
-0.16
ists
-0.15
maxx
-0.15
elsey
-0.15
ù
-0.15
meisje
-0.15
ISTS
-0.14
krom
-0.14
lers
-0.14
POSITIVE LOGITS
bug
0.30
Gaga
0.29
bugs
0.26
finger
0.24
bird
0.23
/man
0.22
Liberty
0.20
like
0.20
Luck
0.19
aga
0.18
Activations Density 0.010%