INDEX
Explanations
expressions of deservingness or worthiness
phrases indicating entitlement or merit
New Auto-Interp
Negative Logits
ullivan
-0.68
cross
-0.66
gdala
-0.65
INS
-0.60
plateau
-0.60
Ou
-0.59
Bohem
-0.59
gap
-0.58
edd
-0.57
law
-0.56
POSITIVE LOGITS
praise
0.87
applause
0.86
consideration
0.81
attention
0.81
credit
0.81
recognition
0.81
precedence
0.80
acknowledgement
0.80
better
0.80
ILY
0.76
Activations Density 0.042%