INDEX
Explanations
phrases related to caring, concern, and interest
the concept of care and concern in various contexts
New Auto-Interp
Negative Logits
ross
-0.78
hiba
-0.77
BuyableInstoreAndOnline
-0.72
Lay
-0.72
aurus
-0.71
MX
-0.70
cession
-0.69
Paper
-0.68
zynski
-0.67
SPONSORED
-0.66
POSITIVE LOGITS
preserving
0.99
improving
0.88
aesthetics
0.85
fairness
0.83
maximizing
0.81
respecting
0.80
integrity
0.78
ĺħ
0.78
protecting
0.77
politics
0.77
Activations Density 0.053%