INDEX
Explanations
the word "empower" and its variants, indicating a focus on empowerment themes
New Auto-Interp
Negative Logits
inks
-0.16
pst
-0.16
ration
-0.15
extern
-0.15
bum
-0.15
bsp
-0.14
uba
-0.14
inx
-0.14
erne
-0.14
eward
-0.14
POSITIVE LOGITS
irical
0.34
owered
0.33
owering
0.32
yre
0.26
ower
0.26
LOYEE
0.25
athy
0.25
LOY
0.24
oleon
0.22
orio
0.22
Activations Density 0.010%