INDEX
Explanations
words associated with kindness and positive attributes
New Auto-Interp
Negative Logits
NSCoder
-0.88
IsContent
-0.87
makeConstraints
-0.77
Drapeau
-0.75
DockStyle
-0.71
rrggbb
-0.69
GMENT
-0.67
egna
-0.66
AddTagHelper
-0.65
KommentareTeilen
-0.64
POSITIVE LOGITS
kindness
1.16
kindness
0.97
Kindness
0.96
generosity
0.90
generous
0.84
kindly
0.83
charitable
0.80
unkind
0.79
compassionate
0.76
kinder
0.71
Activations Density 0.362%