INDEX
Explanations
verbs and adjectives related to giving positive feedback
expressions of admiration or praise
New Auto-Interp
Negative Logits
ramid
-0.78
bang
-0.71
soDeliveryDate
-0.71
itamin
-0.70
itol
-0.69
agnetic
-0.69
proport
-0.67
claimer
-0.66
abouts
-0.66
ueller
-0.66
POSITIVE LOGITS
ifully
0.85
fully
0.80
him
0.77
virtues
0.76
ably
0.74
them
0.72
enance
0.70
ously
0.68
bravery
0.68
Neh
0.67
Activations Density 0.115%