INDEX
Explanations
terms related to caregiving or caretaking
mentions of caregiving or care-related themes
New Auto-Interp
Negative Logits
ãĥĥãĥī
-0.82
akedown
-0.64
rand
-0.64
å°Ĩ
-0.63
ãĥ³ãĤ¸
-0.63
UE
-0.60
UES
-0.59
toast
-0.58
debugging
-0.58
uments
-0.58
POSITIVE LOGITS
taker
1.80
giving
1.43
taking
1.33
lessness
1.20
free
1.19
lessly
1.14
fully
1.12
worn
1.06
ening
1.06
ful
0.96
Activations Density 0.029%