INDEX
Explanations
phrases and variations of the word "possess."
New Auto-Interp
Negative Logits
adesh
-0.16
bose
-0.16
itage
-0.15
inar
-0.15
utr
-0.14
406
-0.14
avis
-0.14
Ĺi
-0.13
lic
-0.13
lek
-0.13
POSITIVE LOGITS
ively
0.18
entially
0.17
ment
0.17
ãģ¡ãģ¯
0.16
mind
0.16
ãĥ«ãĥī
0.16
ments
0.16
gem
0.16
ful
0.15
possess
0.15
Activations Density 0.015%