INDEX
Explanations
phrases that indicate purpose or intention
New Auto-Interp
Negative Logits
ilm
-0.17
ties
-0.16
gren
-0.15
usercontent
-0.15
adele
-0.15
ively
-0.15
mite
-0.15
ãĤĪãģĨãģª
-0.15
mh
-0.14
ka
-0.14
POSITIVE LOGITS
sake
0.26
bidden
0.26
geries
0.25
-profit
0.24
/by
0.23
instance
0.21
aging
0.20
purposes
0.20
/from
0.19
/about
0.19
Activations Density 0.716%