INDEX
Explanations
phrases related to parenting and family life
New Auto-Interp
Negative Logits
jong
-0.71
Scale
-0.65
ESA
-0.64
azed
-0.63
externalActionCode
-0.63
itiz
-0.62
gaard
-0.62
anwhile
-0.61
lua
-0.60
nomine
-0.60
POSITIVE LOGITS
roots
0.65
extra
0.64
steam
0.63
tml
0.62
tons
0.62
swat
0.60
some
0.60
tonnes
0.60
stantial
0.60
appearances
0.60
Activations Density 0.855%