INDEX
Explanations
phrases that express possession or existence related to "having."
New Auto-Interp
Negative Logits
undert
-0.14
indr
-0.14
Ple
-0.14
drv
-0.14
rael
-0.14
raquo
-0.13
sop
-0.13
589
-0.13
fen
-0.13
acom
-0.13
POSITIVE LOGITS
297
0.16
eria
0.15
okens
0.15
utow
0.15
IGIN
0.15
urette
0.15
eyim
0.15
obe
0.14
orgen
0.14
ursor
0.13
Activations Density 0.020%