INDEX
Explanations
mentions of the word "care" in various contexts, potentially related to healthcare, caregiving, or cautionary actions
references to "care" and its various forms related to healthcare or caregiving contexts
New Auto-Interp
Negative Logits
ãĥĥãĥī
-0.78
awe
-0.65
integer
-0.64
RESULTS
-0.62
poppy
-0.61
ument
-0.60
perjury
-0.59
hinder
-0.59
âϦ
-0.59
Redditor
-0.58
POSITIVE LOGITS
taker
1.26
fully
0.97
er
0.92
llan
0.92
ndra
0.91
lli
0.89
giving
0.89
care
0.89
ful
0.89
tta
0.85
Activations Density 0.015%