INDEX
Explanations
phrases related to caregiving and parental responsibilities
New Auto-Interp
Negative Logits
gli
-0.17
mans
-0.15
848
-0.15
fol
-0.14
dz
-0.14
ÂĿ
-0.14
793
-0.14
elo
-0.14
aland
-0.14
fen
-0.14
POSITIVE LOGITS
ORE
0.16
ÏĢη
0.15
ween
0.15
igne
0.15
kees
0.15
plat
0.15
jerne
0.15
ismet
0.14
ÅĽli
0.14
insi
0.14
Activations Density 0.103%